audeering/wav2vec2-large-robust-12-ft-emotion-msp-dim Audio Classification • 0.2B • Updated Sep 19, 2024 • 983k • 166
Running on CPU Upgrade Agents Featured 1.36k Open ASR Leaderboard 🏆 1.36k Compare speech‑to‑text models across multiple benchmarks
Vietnamese speech dataset Collection for any speech-related tasks including but not limited to: speech-to-text & text-to-speech, speech classification, speaker verification, etc. • 34 items • Updated 15 days ago • 47
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 521k • 1.6k