DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models Paper • 2603.26164 • Published 8 days ago • 150
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization Paper • 2604.02268 • Published 3 days ago • 76
Superpositional Gradient Descent: Harnessing Quantum Principles for Model Training Paper • 2511.01918 • Published Nov 1, 2025 • 13
Emergent Social Intelligence Risks in Generative Multi-Agent Systems Paper • 2603.27771 • Published 6 days ago • 48
On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models Paper • 2603.27481 • Published 7 days ago • 34
Marco DeepResearch: Unlocking Efficient Deep Research Agents via Verification-Centric Design Paper • 2603.28376 • Published 5 days ago • 18
MuSEAgent: A Multimodal Reasoning Agent with Stateful Experiences Paper • 2603.27813 • Published 6 days ago • 22
PRBench: End-to-end Paper Reproduction in Physics Research Paper • 2603.27646 • Published 6 days ago • 28
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 83 items • Updated about 1 hour ago • 516