Training Language Models via Neural Cellular Automata Paper β’ 2603.10055 β’ Published 8 days ago β’ 7
Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model Paper β’ 2603.05438 β’ Published 12 days ago β’ 36
DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints Paper β’ 2601.18137 β’ Published Jan 26 β’ 35
Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows Paper β’ 2512.13168 β’ Published Dec 15, 2025 β’ 52
RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies Paper β’ 2510.17950 β’ Published Oct 20, 2025 β’ 9