DCAgent2/terminal_bench_2_exp_tas_timeout_multiplier_1_0_traces_20260219_163757 Updated 34 minutes ago
DCAgent2/terminal_bench_2_Kimi_K2T_neulab_agenttuning_kg_sandboxes_maxeps_32k_20260219_163802 Viewer • Updated about 6 hours ago • 263
DCAgent2/dev_set_v2_exp_swd_r2egym_wo_docker_glm_4_7_traces_20260221_125413 Viewer • Updated about 6 hours ago • 295
DCAgent2/dev_set_v2_glm46_Toolscale_tasks_traces_20260221_125415 Viewer • Updated about 7 hours ago • 297
DCAgent2/dev_set_v2_rl_rl_conf_qwen_8b_ll_lr1e_5_bs64_yaml_mode_path_r2eg_nl2b_stac_bugs24471e1b Viewer • Updated about 7 hours ago • 297
DCAgent2/terminal_bench_2_Kimi_K2T_neulab_agenttuning_mind2web_sandboxes_maxeps_32k_202646ecac48 Viewer • Updated about 11 hours ago • 264
DCAgent2/terminal_bench_2_exp_tas_timeout_multiplier_4_0_traces_20260219_163755 Viewer • Updated about 12 hours ago • 262
DCAgent2/dev_set_v2_Kimi_K2T_neulab_agenttuning_kg_sandboxes_maxeps_32k_20260221_005345 Viewer • Updated about 13 hours ago • 297
DCAgent2/dev_set_v2_Kimi_K2T_neulab_agenttuning_webshop_sandboxes_maxeps_32k_20260221_005349 Viewer • Updated about 14 hours ago • 295
DCAgent2/dev_set_v2_Kimi_K2T_ling_coder_sft_sandboxes_1_maxeps_32k_20260221_005347 Viewer • Updated about 14 hours ago • 297
DCAgent2/terminal_bench_2_exp_swd_r2egym_wo_docker_glm_4_7_traces_20260219_163808 Viewer • Updated about 14 hours ago • 262
DCAgent2/dev_set_v2_GLM_4_7_r2egym_sandboxes_maxeps_131k_20260220_005159 Viewer • Updated about 15 hours ago • 290
DCAgent2/dev_set_v2_Kimi_K2T_neulab_agenttuning_mind2web_sandboxes_maxeps_32k_20260221_005343 Viewer • Updated about 15 hours ago • 297
DCAgent2/dev_set_v2_GLM_4_7_swesmith_sandboxes_with_tests_oracle_verified_120s_maxeps_13e27d735c Viewer • Updated about 16 hours ago • 288
DCAgent2/terminal_bench_2_exp_tas_timeout_multiplier_8_0_traces_20260219_163753 Viewer • Updated about 17 hours ago • 264
DCAgent2/dev_set_v2_GLM_4_7_stackexchange_tezos_sandboxes_maxeps_131k_20260220_005201 Viewer • Updated about 17 hours ago • 291
DCAgent2/dev_set_v2_exp_psu_stackoverflow_10K_glm_4_7_traces_20260221_005320 Viewer • Updated about 17 hours ago • 295
DCAgent2/dev_set_v2_exp_tas_timeout_multiplier_0_25_traces_20260221_005341 Viewer • Updated about 17 hours ago • 297
DCAgent2/dev_set_v2_exp_tas_optimal_combined_traces_20260221_005327 Viewer • Updated about 17 hours ago • 296
DCAgent2/terminal_bench_2_glm46_Toolscale_tasks_traces_20260219_163810 Viewer • Updated about 17 hours ago • 261
DCAgent2/dev_set_v2_exp_tas_timeout_multiplier_1_0_traces_20260221_005339 Viewer • Updated about 17 hours ago • 296
DCAgent2/dev_set_v2_exp_tas_timeout_multiplier_4_0_traces_20260221_005337 Viewer • Updated about 17 hours ago • 297
DCAgent2/dev_set_v2_swesmith_sandboxes_with_tests_gpt_5_mini_passed_glm_4_7_traces_2026050cfb723 Viewer • Updated about 17 hours ago • 297
DCAgent2/dev_set_v2_exp_psu_stackoverflow_316_glm_4_7_traces_20260221_005326 Viewer • Updated about 18 hours ago • 297
DCAgent2/dev_set_v2_exp_tas_timeout_multiplier_8_0_traces_20260221_005335 Viewer • Updated about 18 hours ago • 296
DCAgent2/dev_set_v2_rl_think_npfg_code_contests_900s_45_20260220_005140 Viewer • Updated about 19 hours ago • 290
DCAgent2/dev_set_v2_exp_psu_stackoverflow_3K_glm_4_7_traces_20260221_005322 Viewer • Updated about 19 hours ago • 297
DCAgent2/dev_set_v2_exp_gfi_staqc_askllm_filtered_10K_glm_4_7_traces_jupiter_20260221_005312 Viewer • Updated about 19 hours ago • 286
DCAgent2/dev_set_v2_exp_syh_r2egym_swesmith_mixed_glm_4_7_traces_locetash_20260220_005150 Viewer • Updated about 19 hours ago • 297
DCAgent2/dev_set_v2_perturbed_docker_exp_freelancer_tasks_glm_4_7_traces_20260221_005310 Viewer • Updated about 19 hours ago • 295