Submitted by weimin wang 37 Ovi: Twin Backbone Cross-Modal Fusion for Audio-Video Generation Character.AI 1.6k 10