TowerVision Extending Tower capabilities to the vision modality utter-project/TowerVision-2B Image-Text-to-Text • 3B • Updated Nov 5, 2025 • 94 • 6 utter-project/TowerVision-9B Image-Text-to-Text • 10B • Updated Nov 5, 2025 • 30 • 4 utter-project/TowerVideo-2B Video-Text-to-Text • 3B • Updated Oct 28, 2025 • 10 • 3 utter-project/TowerVideo-9B Video-Text-to-Text • 10B • Updated Oct 28, 2025 • 10 • 4
mHuBERT-147 models Compact yet powerful multilingual speech representation models based on the HuBERT architecture. utter-project/mHuBERT-147-base-1st-iter Feature Extraction • 94.4M • Updated Jan 23, 2025 • 19 • 2 utter-project/mHuBERT-147-base-2nd-iter Feature Extraction • 94.4M • Updated Jan 23, 2025 • 13.4k • • 3 utter-project/mHuBERT-147 Feature Extraction • 94.4M • Updated 20 days ago • 45.9k • • 105
utter-project/mHuBERT-147-base-2nd-iter Feature Extraction • 94.4M • Updated Jan 23, 2025 • 13.4k • • 3
Spire Extending Tower to the speech modality. Spire models are multimodal LLMs capable of transcribing and translating English into 9 different languages. utter-project/SpireBase 7B • Updated Sep 9, 2025 • 5 • 4 utter-project/SpireFull 7B • Updated Sep 9, 2025 • 22 • 2 utter-project/SpireNoBlocks 7B • Updated Sep 9, 2025 • 11 utter-project/SpireNoPseudo 7B • Updated Sep 9, 2025 • 6
EuroLLM utter-project/EuroLLM-1.7B Text Generation • Updated Nov 27, 2024 • 14.1k • 110 utter-project/EuroLLM-9B Text Generation • 9B • Updated Dec 9, 2024 • 1.9k • 164 utter-project/EuroLLM-1.7B-Instruct Text Generation • 2B • Updated Dec 16, 2024 • 13k • 99 utter-project/EuroLLM-9B-Instruct Text Generation • 9B • Updated Dec 9, 2024 • 25.7k • 212
TowerVision Extending Tower capabilities to the vision modality utter-project/TowerVision-2B Image-Text-to-Text • 3B • Updated Nov 5, 2025 • 94 • 6 utter-project/TowerVision-9B Image-Text-to-Text • 10B • Updated Nov 5, 2025 • 30 • 4 utter-project/TowerVideo-2B Video-Text-to-Text • 3B • Updated Oct 28, 2025 • 10 • 3 utter-project/TowerVideo-9B Video-Text-to-Text • 10B • Updated Oct 28, 2025 • 10 • 4
Spire Extending Tower to the speech modality. Spire models are multimodal LLMs capable of transcribing and translating English into 9 different languages. utter-project/SpireBase 7B • Updated Sep 9, 2025 • 5 • 4 utter-project/SpireFull 7B • Updated Sep 9, 2025 • 22 • 2 utter-project/SpireNoBlocks 7B • Updated Sep 9, 2025 • 11 utter-project/SpireNoPseudo 7B • Updated Sep 9, 2025 • 6
mHuBERT-147 models Compact yet powerful multilingual speech representation models based on the HuBERT architecture. utter-project/mHuBERT-147-base-1st-iter Feature Extraction • 94.4M • Updated Jan 23, 2025 • 19 • 2 utter-project/mHuBERT-147-base-2nd-iter Feature Extraction • 94.4M • Updated Jan 23, 2025 • 13.4k • • 3 utter-project/mHuBERT-147 Feature Extraction • 94.4M • Updated 20 days ago • 45.9k • • 105
utter-project/mHuBERT-147-base-2nd-iter Feature Extraction • 94.4M • Updated Jan 23, 2025 • 13.4k • • 3
EuroLLM utter-project/EuroLLM-1.7B Text Generation • Updated Nov 27, 2024 • 14.1k • 110 utter-project/EuroLLM-9B Text Generation • 9B • Updated Dec 9, 2024 • 1.9k • 164 utter-project/EuroLLM-1.7B-Instruct Text Generation • 2B • Updated Dec 16, 2024 • 13k • 99 utter-project/EuroLLM-9B-Instruct Text Generation • 9B • Updated Dec 9, 2024 • 25.7k • 212