Running Agents 2 Qworld Evaluation Criteria Generator ๐ 2 Generate evaluation criteria for any question
Running Agents 1 Automated Evaluation For VMCBench ๐ 1 This is a automated evaluation for VMCBench test and dev set