nm-testing/Apertus-8B-Instruct-2509-NVFP4
5B • Updated • 2
nm-testing/Llama-4-Scout-17B-16E-Instruct-FP8-BLOCK
108B • Updated • 6
nm-testing/tinysmokellama-3.2
354k • Updated • 94.8k
nm-testing/Qwen3-Next-80B-A3B-Instruct-NVFP4
Updated • 1.74k
• 2
nm-testing/Llama-3.2-1B-Instruct-quip-w4a16
0.8B • Updated • 7.33k
nm-testing/Llama-3.2-1B-Instruct-group-activations
1B • Updated • 2
nm-testing/qwen3-80b-fp8-dynamic
80B • Updated • 2
nm-testing/gemma-3-4b-it-s_q-W4A8-G512
5B • Updated • 7
nm-testing/llama3.3-70B-speculators.09-10-2025-eagle3
2B • Updated • 1
nm-testing/Llama-3.2-1B-Instruct-quipv-w4a16
0.7B • Updated • 5
nm-testing/Llama-3.2-1B-Instruct-quip
2B • Updated • 23
nm-testing/Llama-3.2-1B-Instruct-spinquantR1R2-online
0.7B • Updated • 2
nm-testing/Qwen3-Coder-30B-A3B-Instruct-W4A16-awq
5B • Updated • 12.8k
• 4
nm-testing/llama4-scout-17b-eagle3-dummy-drafter
nm-testing/Llama-3.2-1B-Instruct-spinquantR1R2R4-w4a16
0.7B • Updated • 7.35k
nm-testing/Llama-3.1-8B-Instruct-quip-w4a16
2B • Updated • 4
nm-testing/Meta-Llama-3-8B-Instruct-spinquantR3-FP8_asym-attn
8B • Updated • 2
nm-testing/Meta-Llama-3-8B-Instruct-spinquantR3
8B • Updated • 9
nm-testing/gemma-3n-2b-quantized.w4a16-test
4B • Updated • 2
nm-testing/Meta-Llama-3-8B-Instruct-NVFP4-FP8-Dynamic
6B • Updated • 5
nm-testing/TinyLlama-1.1B-Chat-v1.0-NVFP4-FP8-Dynamic
0.8B • Updated • 2
nm-testing/Llama-3.2-1B-Instruct-lc_min_hack-hadamard-w4a16
0.7B • Updated • 4
nm-testing/Llama-3.2-1B-Instruct-sq_min_hack-hadamard-w4a16
0.7B • Updated • 4
nm-testing/Llama-3.2-1B-Instruct-sq_min_hack-eye-w4a16
0.7B • Updated • 2
nm-testing/Llama-3.2-1B-Instruct-lc_min_hack-eye-w4a16
0.7B • Updated • 3
nm-testing/Meta-Llama-3-8B-Instruct-quip-w4a16
2B • Updated • 7
nm-testing/gemma-3n-E2B-it-W4A16-G128
4B • Updated • 3
nm-testing/block-quantization-fp8-qwen3-0.6B
0.8B • Updated • 2
nm-testing/Llama-3.1-8B-Instruct-speculator.eagle3-converted
Text Generation
• 1.0B • Updated • 770
nm-testing/gemma-3n-2B-it-w4a16
4B • Updated • 3