Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning Paper • 2602.07845 • Published 3 days ago • 57
MOVA: Towards Scalable and Synchronized Video-Audio Generation Paper • 2602.08794 • Published 1 day ago • 139
AgentCPM-Report: Interleaving Drafting and Deepening for Open-Ended Deep Research Paper • 2602.06540 • Published 5 days ago • 15
Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models Paper • 2602.07026 • Published 9 days ago • 125
QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining Paper • 2602.07085 • Published 5 days ago • 169
VLS: Steering Pretrained Robot Policies via Vision-Language Models Paper • 2602.03973 • Published 7 days ago • 22
FS-Researcher: Test-Time Scaling for Long-Horizon Research Tasks with File-System-Based Agents Paper • 2602.01566 • Published 9 days ago • 46
Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations Paper • 2602.05885 • Published 5 days ago • 28
PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss Paper • 2602.02493 • Published 8 days ago • 41
DFlash: Block Diffusion for Flash Speculative Decoding Paper • 2602.06036 • Published 5 days ago • 40
Baichuan-M2 Collection Beyond the Model: Scaling Medical Capability with a Large Verifier System • 6 items • Updated 2 days ago • 4
Baichuan-M3 Collection Modeling Clinical Inquiry for Reliable Medical Decision-Making • 7 items • Updated 2 days ago • 16