allenai/asta-bench-submissions
Updated
• 13 • 1
Building breatkthrough AI to solve the world's biggest problems.
Meta-Reinforcement Learning with Self-Reflection for Agentic Search
TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics