Evaluating the Expressive Appropriateness of Speech in Rich Contexts Paper • 2605.09413 • Published 5 days ago • 5
What Matters for Diffusion-Friendly Latent Manifold? Prior-Aligned Autoencoders for Latent Diffusion Paper • 2605.07915 • Published 7 days ago • 8
Vec-LUT: Vector Table Lookup for Parallel Ultra-Low-Bit LLM Inference on Edge Devices Paper • 2512.06443 • Published Dec 6, 2025 • 3
OxyGen: Unified KV Cache Management for Vision-Language-Action Models under Multi-Task Parallelism Paper • 2603.14371 • Published Mar 15 • 4
\$OneMillion-Bench: How Far are Language Agents from Human Experts? Paper • 2603.07980 • Published Mar 9 • 27
MOVA: Towards Scalable and Synchronized Video-Audio Generation Paper • 2602.08794 • Published Feb 9 • 159
UltraVoice: Scaling Fine-Grained Style-Controlled Speech Conversations for Spoken Dialogue Models Paper • 2510.22588 • Published Oct 26, 2025 • 1
Running on Zero MCP Featured 1.42k Qwen-Image-Edit-2511-LoRAs-Fast 🎃 1.42k Demo of the Collection of Qwen Image Edit LoRAs
Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning Paper • 2512.07461 • Published Dec 8, 2025 • 79
ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation Paper • 2512.03036 • Published Dec 2, 2025 • 22