Computer-Vison
updated
Voila: Voice-Language Foundation Models for Real-Time Autonomous
Interaction and Voice Role-Play
Paper
•
2505.02707
•
Published
•
85
MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset
via Attention Routing
Paper
•
2505.02823
•
Published
•
5
PixelHacker: Image Inpainting with Structural and Semantic Consistency
Paper
•
2504.20438
•
Published
•
44
Improving Editability in Image Generation with Layer-wise Memory
Paper
•
2505.01079
•
Published
•
29
A Survey of Interactive Generative Video
Paper
•
2504.21853
•
Published
•
46
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High
Resolution
Paper
•
2505.00497
•
Published
•
17
Generative AI for Character Animation: A Comprehensive Survey of
Techniques, Applications, and Future Directions
Paper
•
2504.19056
•
Published
•
18
ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D
Physics Modeling for Complex Motion and Interaction
Paper
•
2504.21855
•
Published
•
13