SkyReels-V4: Multi-modal Video-Audio Generation, Inpainting and Editing model Paper โข 2602.21818 โข Published 2 days ago โข 46
mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding Paper โข 2403.12895 โข Published Mar 19, 2024 โข 32