SkyReels-V4: Multi-modal Video-Audio Generation, Inpainting and Editing model Paper β’ 2602.21818 β’ Published Feb 25 β’ 55
mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding Paper β’ 2403.12895 β’ Published Mar 19, 2024 β’ 32