nm-testing/llama2.c-stories110M-gsm8k-recipe_w4a16_actorder_weight-compressed 0.1B • Updated Mar 12, 2025 • 1.41k
nm-testing/Meta-Llama-3-8B-Instruct-FP8-channel-output-activation-kv_cache-qkv_proj 8B • Updated Mar 10, 2025
nm-testing/llama2.c-stories42M-gsm8k-quantized-only-uncompressed 58.2M • Updated Feb 12, 2025 • 2.18k