Answer questions using a language model
Run your agent to answer questions and get scored
Generate code and AI responses with tool assistance