Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Minghong Cai's picture
20 7

Minghong Cai

onevfall
DecoderWQH666's profile picture
·
https://onevfall.github.io/personal_page/
  • onevfall
  • minghong-cai-425bb4274

AI & ML interests

Video generation, Video editing

Recent Activity

upvoted a paper 1 day ago
SemanticGen: Video Generation in Semantic Space
upvoted a paper 7 days ago
Alchemist: Unlocking Efficiency in Text-to-Image Model Training via Meta-Gradient Data Selection
upvoted a paper 21 days ago
ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning
View all activity

Organizations

None yet

authored a paper 3 months ago

VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning

Paper • 2510.08555 • Published Oct 9 • 63
authored a paper 12 months ago

DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation

Paper • 2412.18597 • Published Dec 24, 2024 • 20
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs