Inference Optimization
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
models 178
inference-optimization/gpt-oss-20b-from-gpt-oss-120b-ckpt1-speculator.eagle3
0.9B • Updated • 20
inference-optimization/gpt-oss-20b-from-gpt-oss-120b-ckpt0-speculator.eagle3
0.9B • Updated • 43
inference-optimization/Llama-3.1-8B-Instruct-NVFP4-DDP8
5B • Updated • 12
inference-optimization/Qwen3-235B-A22B-Thinking-2507.w8a8
235B • Updated • 12
inference-optimization/Qwen3-235B-A22B-Instruct-2507.w8a8
235B • Updated • 11
inference-optimization/Qwen3-Next-80B-A3B-Instruct_mtp_speculator
Text Generation • 2B • Updated • 57
inference-optimization/Mistral-Small-4-119B-2603-BF16
119B • Updated • 11
inference-optimization/Mistral3_speculator_dummy
2B • Updated • 22
inference-optimization/Phi-4-reasoning-vision-15B-FP8-dynamic
15B • Updated
inference-optimization/Qwen3-235B-A22B-Thinking-2507.w4a16
32B • Updated • 11