Constrained Diffusion Tasks Datasets used in the paper "Constrained Decoding of Diffusion LLMs for Context-Free Grammars". eth-sri/json-mode-eval-extended Viewer • Updated Nov 15, 2025 • 272 • 463 eth-sri/smiles-eval Viewer • Updated Aug 15, 2025 • 167 • 47 zai-org/humaneval-x Updated Oct 25, 2022 • 1.5k • 94 eth-sri/HumanEval-MRI-Cpp Viewer • Updated Aug 15, 2025 • 473 • 8 • 1
SWT-Bench Variations of the SWT-Bench pre-formatted dataset used for the paper "SWT-Bench: Testing and Validating Real-World Bug-Fixes with Code Agents" eth-sri/SWT-bench_bm25_27k_zsb Viewer • Updated Feb 25, 2025 • 2.2k • 27 eth-sri/SWT-bench_Lite_bm25_27k_zsb Viewer • Updated Feb 25, 2025 • 299 • 23 eth-sri/SWT-bench_Verified_bm25_27k_zsp Viewer • Updated Feb 25, 2025 • 433 • 1.38k eth-sri/SWT-bench_bm25_27k_zsp Viewer • Updated Feb 25, 2025 • 2.52k • 10 • 2
Constrained Diffusion Tasks Datasets used in the paper "Constrained Decoding of Diffusion LLMs for Context-Free Grammars". eth-sri/json-mode-eval-extended Viewer • Updated Nov 15, 2025 • 272 • 463 eth-sri/smiles-eval Viewer • Updated Aug 15, 2025 • 167 • 47 zai-org/humaneval-x Updated Oct 25, 2022 • 1.5k • 94 eth-sri/HumanEval-MRI-Cpp Viewer • Updated Aug 15, 2025 • 473 • 8 • 1
SWT-Bench Variations of the SWT-Bench pre-formatted dataset used for the paper "SWT-Bench: Testing and Validating Real-World Bug-Fixes with Code Agents" eth-sri/SWT-bench_bm25_27k_zsb Viewer • Updated Feb 25, 2025 • 2.2k • 27 eth-sri/SWT-bench_Lite_bm25_27k_zsb Viewer • Updated Feb 25, 2025 • 299 • 23 eth-sri/SWT-bench_Verified_bm25_27k_zsp Viewer • Updated Feb 25, 2025 • 433 • 1.38k eth-sri/SWT-bench_bm25_27k_zsp Viewer • Updated Feb 25, 2025 • 2.52k • 10 • 2