Computer-Vison - a cedhons Collection

cedhons 's Collections

Computer-Vison

updated May 6, 2025

Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

Paper • 2505.02707 • Published May 5, 2025 • 85
MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset via Attention Routing

Paper • 2505.02823 • Published May 5, 2025 • 5
PixelHacker: Image Inpainting with Structural and Semantic Consistency

Paper • 2504.20438 • Published Apr 29, 2025 • 44
Improving Editability in Image Generation with Layer-wise Memory

Paper • 2505.01079 • Published May 2, 2025 • 29
A Survey of Interactive Generative Video

Paper • 2504.21853 • Published Apr 30, 2025 • 46
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution

Paper • 2505.00497 • Published May 1, 2025 • 17
Generative AI for Character Animation: A Comprehensive Survey of Techniques, Applications, and Future Directions

Paper • 2504.19056 • Published Apr 27, 2025 • 18
ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D Physics Modeling for Complex Motion and Interaction

Paper • 2504.21855 • Published Apr 30, 2025 • 13