DCAgent/eval-Qwen3-Coder-30B-A3B-Instruct_16concurrency_openhands_eval_c_terminal-bench-2.0 Updated about 6 hours ago
DCAgent/eval-Qwen3-Coder-30B-A3B-Instruct_16concurrency_openhands_eval_c_terminal-bench-2.0 Updated about 6 hours ago
DCAgent/eval-NVIDIA-Nemotron-3-Nano-30B-A3B-BF16_16concurrency_openhands_eval_c_terminal67fe5eed Updated about 8 hours ago
DCAgent/eval-NVIDIA-Nemotron-3-Nano-30B-A3B-BF16_16concurrency_openhands_eval_c_terminal67fe5eed Updated about 8 hours ago
DCAgent/eval-NVIDIA-Nemotron-3-Nano-30B-A3B-BF16_16concurrency_openhands_eval_c_OpenThoue429c793 Updated about 8 hours ago
DCAgent/eval-NVIDIA-Nemotron-3-Nano-30B-A3B-BF16_16concurrency_openhands_eval_c_OpenThoue429c793 Updated about 8 hours ago
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_learning-rate_0a0458a3 Viewer • Updated about 19 hours ago • 764
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_learning-rate_0a0458a3 Viewer • Updated about 19 hours ago • 764
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_learning-rate_c267e2e6 Viewer • Updated about 23 hours ago • 509
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_learning-rate_c267e2e6 Viewer • Updated about 23 hours ago • 509
DCAgent/eval-terminal-bench-2.0-claude-haiku-4-5-20251001-20260115_165217 Viewer • Updated 1 day ago • 272
DCAgent/eval-terminal-bench-2.0-claude-haiku-4-5-20251001-20260115_165217 Viewer • Updated 1 day ago • 272
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_learning-rate_a60f4588 Viewer • Updated 1 day ago • 371
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_learning-rate_a60f4588 Viewer • Updated 1 day ago • 371