ibm-granite/granite-embedding-97m-multilingual-r2 Feature Extraction • 97.4M • Updated 13 days ago • 6.06k • • 91
view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer Pringled • Oct 14, 2024 • 104
view post Post 8591 We made a guide on how to run open LLMs in Claude Code, Codex and OpenClaw.Use Gemma 4 and Qwen3.6 GGUFs for local agentic coding on 24GB RAMRun with self-healing tool calls, code execution, web search via the Unsloth API endpoint and llama.cppGuide: https://unsloth.ai/docs/basics/api See translation 🔥 25 25 ❤️ 7 7 + Reply
view article Article LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone! medmekk, marcsun13 • Mar 7, 2025 • 97