ScaleAI/audiomc
Viewer
•
Updated
•
452
•
276
•
4
None defined yet.
Agentic Rubrics as Contextual Verifiers for SWE Agents
ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents