Rethinking Training Dynamics in Scale-wise Autoregressive Generation Paper • 2512.06421 • Published Dec 6, 2025 • 7
HyperAlign: Hypernetwork for Efficient Test-Time Alignment of Diffusion Models Paper • 2601.15968 • Published 15 days ago • 6
Little By Little: Continual Learning via Self-Activated Sparse Mixture-of-Rank Adaptive Learning Paper • 2506.21035 • Published Jun 26, 2025
Adaptive Rank, Reduced Forgetting: Knowledge Retention in Continual Learning Vision-Language Models with Dynamic Rank-Selective LoRA Paper • 2412.01004 • Published Dec 1, 2024
Mind the Gap: Improving Success Rate of Vision-and-Language Navigation by Revisiting Oracle Success Routes Paper • 2308.03244 • Published Aug 7, 2023
SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts Paper • 2412.05552 • Published Dec 7, 2024 • 6
SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts Paper • 2412.05552 • Published Dec 7, 2024 • 6