Part-X-MLLM: Part-aware 3D Multimodal Large Language Model Paper • 2511.13647 • Published Nov 17 • 70
Video2Roleplay: A Multimodal Dataset and Framework for Video-Guided Role-playing Agents Paper • 2509.15233 • Published Sep 17 • 2
PLA4D: Pixel-Level Alignments for Text-to-4D Gaussian Splatting Paper • 2405.19957 • Published May 30, 2024 • 10
Entangled View-Epipolar Information Aggregation for Generalizable Neural Radiance Fields Paper • 2311.11845 • Published Nov 20, 2023 • 1
PLA4D: Pixel-Level Alignments for Text-to-4D Gaussian Splatting Paper • 2405.19957 • Published May 30, 2024 • 10