-
SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models
Paper • 2407.15841 • Published • 39 -
Stable Audio Open
Paper • 2407.14358 • Published • 26 -
PlacidDreamer: Advancing Harmony in Text-to-3D Generation
Paper • 2407.13976 • Published • 5 -
Efficient Audio Captioning with Encoder-Level Knowledge Distillation
Paper • 2407.14329 • Published • 5
Joe
pushkin05
·
AI & ML interests
None yet
Recent Activity
liked a dataset 12 days ago
Voxel51/gaussian_splatting published a model 3 months ago
pushkin05/trm liked a model 5 months ago
arcprize/trm_arc_prize_verification