Upload folder using huggingface_hub

Browse files

Files changed (5) hide show

README.md +18 -37
comparison_graph.png +0 -0
model-00001-of-00002.safetensors +1 -1
model-00002-of-00002.safetensors +1 -1
tokenizer.json +1 -1

README.md CHANGED Viewed

@@ -5,44 +5,36 @@ tags:
 - python
 - optimized
 - wanda
-- activation-pruning
 base_model: Qwen/Qwen2.5-3B-Instruct
 pipeline_tag: text-generation
 ---
 # Qwen2.5-3B-Instruct-python-aggressive
-> 🎯 **PYTHON-optimized** | 📦 **Aggressive** pruning | ⚡ **20% weights pruned**
-This model is a **aggressively pruned** version of [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct), specialized for **PYTHON** tasks using activation-aware weight pruning (Wanda-style).
-## ✨ Key Features
-- **Specialization**: Optimized for Python tasks
-- **Pruning Method**: Wanda-style (|W| × |activation|) importance scoring
-- **Size Reduction**: 20% weights pruned
-- **Use Case**: Maximum compression for edge deployment
-## 📊 Performance Comparison
 | Category | Original | Pruned | Change |
 |----------|----------|--------|--------|
-| **Python** | 40.0% | 13.3% ⭐ | ↓ 26.7% |
-| Html | 6.7% | 0.0% | ↓ 6.7% |
-| Trivia | 88.9% | 73.3% | ↓ 15.6% |
-| Math | 57.8% | 62.2% | ↑ 4.4% |
-| Reasoning | 33.3% | 28.9% | ↓ 4.4% |
-| Medical | 93.3% | 84.4% | ↓ 8.9% |
-| Linux | 95.6% | 93.3% | ↓ 2.2% |
-| Writing | 62.2% | 60.0% | ↓ 2.2% |
-**Average**: 59.7% → 51.9% (-7.8%)
-**Python Retention**: 33.3% of original performance
 ![Comparison Graph](comparison_graph.png)
-## 🚀 Quick Start
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
@@ -50,31 +42,20 @@ from transformers import AutoModelForCausalLM, AutoTokenizer
 model = AutoModelForCausalLM.from_pretrained("CompactAI/Qwen2.5-3B-Instruct-python-aggressive")
 tokenizer = AutoTokenizer.from_pretrained("CompactAI/Qwen2.5-3B-Instruct-python-aggressive")
-# Example usage
 inputs = tokenizer("Your prompt here", return_tensors="pt")
 outputs = model.generate(**inputs, max_new_tokens=100)
 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
-## 📋 Technical Details
 | Property | Value |
 |----------|-------|
 | Base Model | [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct) |
 | Specialization | Python |
 | Prune Mode | Aggressive |
-| Pruning Method | Activation-based weight pruning (Wanda) |
-| Weight Reduction | 20% weights pruned |
-## 🔗 Related Models
-This model is part of the **Qwen2.5-3B-Instruct** pruned model collection. Variants:
-- **Safe** - Conservative pruning (~10-20%), high accuracy retention
-- **Aggressive** - Maximum compression (~40-50%), best for edge deployment
-## 📜 License
-This model inherits the license from the base model [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct).
----
-*Generated by ZANNPS [Zeto Automatic Neural Network Pruning System]*

 - python
 - optimized
 - wanda
 base_model: Qwen/Qwen2.5-3B-Instruct
 pipeline_tag: text-generation
 ---
 # Qwen2.5-3B-Instruct-python-aggressive
+> 🎯 **PYTHON-optimized** | 📦 **Aggressive** pruning | ⚡ **35% weights pruned**
+This model is a **aggressively pruned** version of [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct).
+## Performance Comparison
 | Category | Original | Pruned | Change |
 |----------|----------|--------|--------|
+| **Python** | 92.3% | 84.6% ⭐ | ↓ 7.7% |
+| Html | 40.0% | 30.0% | ↓ 10.0% |
+| Trivia | 100.0% | 86.7% | ↓ 13.3% |
+| Math | 100.0% | 100.0% | → |
+| Reasoning | 91.7% | 83.3% | ↓ 8.3% |
+| Medical | 64.3% | 35.7% | ↓ 28.6% |
+| Linux | 69.2% | 61.5% | ↓ 7.7% |
+| Writing | 54.5% | 36.4% | ↓ 18.2% |
+**Average**: 76.5% → 64.8% (-11.7%)
+**Python Retention**: 91.7%
 ![Comparison Graph](comparison_graph.png)
+## Quick Start
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
 model = AutoModelForCausalLM.from_pretrained("CompactAI/Qwen2.5-3B-Instruct-python-aggressive")
 tokenizer = AutoTokenizer.from_pretrained("CompactAI/Qwen2.5-3B-Instruct-python-aggressive")
 inputs = tokenizer("Your prompt here", return_tensors="pt")
 outputs = model.generate(**inputs, max_new_tokens=100)
 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
+## Technical Details
 | Property | Value |
 |----------|-------|
 | Base Model | [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct) |
 | Specialization | Python |
 | Prune Mode | Aggressive |
+| Weight Reduction | 35% weights pruned |
+## License
+This model inherits the license from the base model.

comparison_graph.png CHANGED Viewed

model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3053d788c32f12320275840c58dd6893508dda79b6ae02d90eb1bfbfa1c29393
 size 3995916600

 version https://git-lfs.github.com/spec/v1
+oid sha256:f1f0de7a63cc12c5699235711a61bd485a2406bb055b05cd548cc55d2cf73c1c
 size 3995916600

model-00002-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b3e7bf62f1bb6cfb8dcaef62af3bfcbc739ea854e47f2955093ce867f12a2634
 size 2176009944

 version https://git-lfs.github.com/spec/v1
+oid sha256:de96d4bc81ea58673a20ed6314e52c0eb10fcd5bf44d38f000338af96084b70c
 size 2176009944

tokenizer.json CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:51354673edf4300eb841665e1fb684cc1badea87c49d5de6ef09981151683508
 size 11422159

 version https://git-lfs.github.com/spec/v1
+oid sha256:7b3e3adf18710ac3bd97b384b0d01b58205c4c5cd37c6c56d24c8fff86b0561d
 size 11422159