NarraScore: Bridging Visual Narrative and Musical Dynamics via Hierarchical Affective Control Paper • 2602.09070 • Published 17 days ago • 44
Sanity Checks for Sparse Autoencoders: Do SAEs Beat Random Baselines? Paper • 2602.14111 • Published 11 days ago • 56
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning Paper • 2602.12099 • Published 14 days ago • 57
Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper • 2602.08354 • Published 17 days ago • 206
On Data Engineering for Scaling LLM Terminal Capabilities Paper • 2602.21193 • Published 1 day ago • 75
PyVision-RL: Forging Open Agentic Vision Models via RL Paper • 2602.20739 • Published 2 days ago • 24
From Perception to Action: An Interactive Benchmark for Vision Reasoning Paper • 2602.21015 • Published 1 day ago • 21
Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models Paper • 2602.07026 • Published 24 days ago • 136
SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise Paper • 2602.12783 • Published 13 days ago • 145
view article Article Scaling Self Supervised Learning for Histology: introducing Phikon Oct 31, 2023 • 6
Agent READMEs: An Empirical Study of Context Files for Agentic Coding Paper • 2511.12884 • Published Nov 17, 2025 • 26
HeartMuLa: A Family of Open Sourced Music Foundation Models Paper • 2601.10547 • Published Jan 15 • 44
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14, 2025 • 147