Agent Bazaar: Enabling Economic Alignment in Multi-Agent Marketplaces Paper • 2605.17698 • Published 3 days ago • 4
Continual Harness: Online Adaptation for Self-Improving Foundation Agents Paper • 2605.09998 • Published 9 days ago • 17
Odysseus: Scaling VLMs to 100+ Turn Decision-Making in Games via Reinforcement Learning Paper • 2605.00347 • Published 19 days ago • 16
The PokeAgent Challenge: Competitive and Long-Context Learning at Scale Paper • 2603.15563 • Published Mar 16 • 11
GameDevBench: Evaluating Agentic Capabilities Through Game Development Paper • 2602.11103 • Published Feb 11 • 15
LLM Economist: Large Population Models and Mechanism Design in Multi-Agent Generative Simulacra Paper • 2507.15815 • Published Jul 21, 2025 • 7