Inference Optimization
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
models 172
inference-optimization/Mistral3_speculator_dummy
2B • Updated
• 11
inference-optimization/Qwen3-235B-A22B-Thinking-2507.w4a16
32B • Updated
inference-optimization/Qwen3-235B-A22B-Instruct-2507.w4a16
32B • Updated
inference-optimization/Qwen3-32B-from-Qwen3-235B_resps-speculators.eagle3-ckpt0
2B • Updated
• 44
inference-optimization/gpt-oss-120b-from-qwen235b-ckpt5-speculator.eagle3
0.9B • Updated
• 32
inference-optimization/gpt-oss-120b-from-qwen235b-ckpt4-speculator.eagle3
0.9B • Updated
• 32
inference-optimization/gpt-oss-120b-ckpt4-speculator.eagle3
0.9B • Updated
• 30
inference-optimization/gpt-oss-120b-ckpt3-speculator.eagle3
0.9B • Updated
• 46
inference-optimization/Qwen3-Coder-Next.w4a16
Text Generation • 12B • Updated
• 1.64k
inference-optimization/DeepSeek-R1-NVFP4-FP8-BLOCK
397B • Updated
• 54