Running Agents 2 Qworld Evaluation Criteria Generator ๐ Generate evaluation criteria for any question
Running Agents 1 Automated Evaluation For VMCBench ๐ This is a automated evaluation for VMCBench test and dev set