NM Testing

company

AI & ML interests

None defined yet.

Recent Activity

nm-autobot updated a model about 10 hours ago

nm-testing/Qwen3-30B-A3B-NVFP4-AWQ-e2e

nm-autobot updated a model about 11 hours ago

nm-testing/nvfp4_moe-e2e

nm-autobot updated a model about 11 hours ago

nm-testing/fp8_dynamic_moe-e2e

View all activity

nm-testing 's models 521

nm-testing/Mixtral-8x7B-Instruct-v0.1-W8A16-quantized

47B • Updated Oct 9, 2024 • 17

nm-testing/Mixtral-8x7B-Instruct-v0.1-W4A16-quantized

47B • Updated Oct 9, 2024 • 17

nm-testing/tinyllama-oneshot-w8a8-dynamic-token-v2-asym

Text Generation • 1B • Updated Oct 9, 2024 • 69

nm-testing/Qwen2-1.5B-Instruct-FP8W8

Text Generation • 2B • Updated Oct 9, 2024 • 7

nm-testing/Meta-Llama-3-8B-Instruct-W4A16-ACTORDER-compressed-tensors-test

Text Generation • 8B • Updated Oct 9, 2024 • 11

nm-testing/Meta-llama3-8b-Instruct-quant-FP8

Text Generation • 8B • Updated Oct 9, 2024 • 5

nm-testing/Meta-llama3-8b-Instruct-SmoothQuant-Fp8

Text Generation • 8B • Updated Oct 9, 2024 • 6

nm-testing/Meta-Llama-3-8B-Instruct-nonuniform-test

Text Generation • 8B • Updated Oct 9, 2024 • 27.5k

nm-testing/nonuniform

Text Generation • 8B • Updated Oct 9, 2024 • 7

nm-testing/Meta-Llama-3-8B-Instruct-Non-Uniform-compressed-tensors

Text Generation • 8B • Updated Oct 9, 2024 • 10

nm-testing/Meta-Llama-3-8B-Instruct-W8A8-FP8-Channelwise-compressed-tensors

Text Generation • 8B • Updated Oct 9, 2024 • 8 • 2

nm-testing/Meta-Llama-3-8B-Instruct-FP8-K-V

Text Generation • 8B • Updated Oct 9, 2024 • 8

nm-testing/Qwen2-0.5B-Instruct

Text Generation • 0.6B • Updated Oct 9, 2024 • 3

nm-testing/Meta-Llama-3-8B-Instruct-W4A16-compressed-tensors-test

Text Generation • 8B • Updated Oct 9, 2024 • 41

nm-testing/Meta-Llama-3-8B-FP8-compressed-tensors-test-bos

Text Generation • 8B • Updated Oct 9, 2024 • 7

nm-testing/Meta-Llama-3-8B-FP8-compressed-tensors-test

Text Generation • 8B • Updated Oct 9, 2024 • 18.5k

nm-testing/Meta-Llama-3-8B-Instruct-W4-Group128-A16-Test

Text Generation • 8B • Updated Oct 9, 2024 • 6

nm-testing/Meta-Llama-3-8B-Instruct-W8-Channel-A8-Dynamic-Per-Token-Test

Text Generation • 8B • Updated Oct 9, 2024 • 17

nm-testing/tinyllama-oneshot-w8a16-per-channel

Text Generation • 1B • Updated Oct 9, 2024 • 1.03k

nm-testing/Meta-Llama-3-8B-Instruct-W8A8-Dyn-Per-Token-2048-Samples

Text Generation • 8B • Updated Oct 9, 2024 • 96

nm-testing/Meta-Llama-3-8B-Instruct-W8A8-Dyn-Per-Token

Text Generation • 8B • Updated Oct 9, 2024 • 5

nm-testing/llama-3-instruct-w8a8-dyn-per-token-test

Text Generation • 8B • Updated Oct 9, 2024 • 8

nm-testing/tinyllama-oneshot-w8-channel-a8-tensor

Text Generation • 1B • Updated Oct 9, 2024 • 1.18k

nm-testing/tinyllama-oneshot-w8a8-channel-dynamic-token-v2

Text Generation • 1B • Updated Oct 9, 2024 • 23.8k

nm-testing/tinyllama-oneshot-w8w8-test-static-shape-change

Text Generation • 1B • Updated Oct 9, 2024 • 70k

nm-testing/tinyllama-oneshot-w4a16-channel-v2

Text Generation • 1B • Updated Oct 9, 2024 • 25.7k • 1

nm-testing/tinyllama-oneshot-w4a16-group128-v2

Text Generation • 1B • Updated Oct 9, 2024 • 13.3k

nm-testing/tinyllama-oneshot-w8a8-static-v2

Text Generation • 1B • Updated Oct 9, 2024 • 40

nm-testing/tinyllama-oneshot-w8a8-dynamic-token-v2

Text Generation • 1B • Updated Oct 9, 2024 • 18.5k

nm-testing/tinyllama-marlin24-w4a16-group128

Text Generation • 0.3B • Updated Oct 9, 2024 • 4