Running 42 LFM2.5 1.2B Thinking WebGPU 💧 42 Run LFM2.5-1.2B-Thinking directly in your browser on WebGPU
zai-org/GLM-ASR-Nano-2512 Automatic Speech Recognition • 2B • Updated Dec 24, 2025 • 278k • 347
Running on A10G 1.87k GGUF My Repo 🦙 1.87k Quantize a Hugging Face model to GGUF and create a repo
google/embeddinggemma-300m-qat-q4_0-unquantized Sentence Similarity • Updated Sep 25, 2025 • 1.79k • 42
meituan-longcat/LongCat-Flash-Chat Text Generation • 562B • Updated Sep 24, 2025 • 27.4k • 526