CompactAI commited on
Commit
e70a888
·
verified ·
1 Parent(s): 60c461e

Upload folder using huggingface_hub

Browse files
Files changed (4) hide show
  1. README.md +16 -35
  2. comparison_graph.png +0 -0
  3. model.safetensors +1 -1
  4. tokenizer.json +1 -1
README.md CHANGED
@@ -5,7 +5,6 @@ tags:
5
  - python
6
  - optimized
7
  - wanda
8
- - activation-pruning
9
  base_model: LGAI-EXAONE/EXAONE-4.0-1.2B
10
  pipeline_tag: text-generation
11
  ---
@@ -14,35 +13,28 @@ pipeline_tag: text-generation
14
 
15
  > 🎯 **PYTHON-optimized** | 📦 **Safe** pruning | ⚡ **1% weights pruned**
16
 
17
- This model is a **conservatively pruned** version of [LGAI-EXAONE/EXAONE-4.0-1.2B](https://huggingface.co/LGAI-EXAONE/EXAONE-4.0-1.2B), specialized for **PYTHON** tasks using activation-aware weight pruning (Wanda-style).
18
 
19
- ## Key Features
20
-
21
- - **Specialization**: Optimized for Python tasks
22
- - **Pruning Method**: Wanda-style (|W| × |activation|) importance scoring
23
- - **Size Reduction**: 1% weights pruned
24
- - **Use Case**: High accuracy retention, ideal for production use
25
-
26
- ## 📊 Performance Comparison
27
 
28
  | Category | Original | Pruned | Change |
29
  |----------|----------|--------|--------|
30
- | **Python** | 20.0% | 20.0% ⭐ | → |
31
- | Html | 6.7% | 0.0% | 6.7% |
32
- | Trivia | 26.7% | 33.3% | 6.7% |
33
- | Math | 60.0% | 60.0% | → |
34
- | Reasoning | 60.0% | 60.0% | → |
35
- | Medical | 73.3% | 73.3% | → |
36
- | Linux | 93.3% | 93.3% | → |
37
- | Writing | 60.0% | 60.0% | → |
38
 
39
- **Average**: 50.0% → 50.0% (+0.0%)
40
 
41
- **Python Retention**: 100.0% of original performance
42
 
43
  ![Comparison Graph](comparison_graph.png)
44
 
45
- ## 🚀 Quick Start
46
 
47
  ```python
48
  from transformers import AutoModelForCausalLM, AutoTokenizer
@@ -50,31 +42,20 @@ from transformers import AutoModelForCausalLM, AutoTokenizer
50
  model = AutoModelForCausalLM.from_pretrained("CompactAI/EXAONE-4.0-1.2B-python-safe")
51
  tokenizer = AutoTokenizer.from_pretrained("CompactAI/EXAONE-4.0-1.2B-python-safe")
52
 
53
- # Example usage
54
  inputs = tokenizer("Your prompt here", return_tensors="pt")
55
  outputs = model.generate(**inputs, max_new_tokens=100)
56
  print(tokenizer.decode(outputs[0], skip_special_tokens=True))
57
  ```
58
 
59
- ## 📋 Technical Details
60
 
61
  | Property | Value |
62
  |----------|-------|
63
  | Base Model | [LGAI-EXAONE/EXAONE-4.0-1.2B](https://huggingface.co/LGAI-EXAONE/EXAONE-4.0-1.2B) |
64
  | Specialization | Python |
65
  | Prune Mode | Safe |
66
- | Pruning Method | Activation-based weight pruning (Wanda) |
67
  | Weight Reduction | 1% weights pruned |
68
 
69
- ## 🔗 Related Models
70
-
71
- This model is part of the **EXAONE-4.0-1.2B** pruned model collection. Variants:
72
- - **Safe** - Conservative pruning (~10-20%), high accuracy retention
73
- - **Aggressive** - Maximum compression (~40-50%), best for edge deployment
74
 
75
- ## 📜 License
76
-
77
- This model inherits the license from the base model [LGAI-EXAONE/EXAONE-4.0-1.2B](https://huggingface.co/LGAI-EXAONE/EXAONE-4.0-1.2B).
78
-
79
- ---
80
- *Generated by ZANNPS [Zeto Automatic Neural Network Pruning System]*
 
5
  - python
6
  - optimized
7
  - wanda
 
8
  base_model: LGAI-EXAONE/EXAONE-4.0-1.2B
9
  pipeline_tag: text-generation
10
  ---
 
13
 
14
  > 🎯 **PYTHON-optimized** | 📦 **Safe** pruning | ⚡ **1% weights pruned**
15
 
16
+ This model is a **conservatively pruned** version of [LGAI-EXAONE/EXAONE-4.0-1.2B](https://huggingface.co/LGAI-EXAONE/EXAONE-4.0-1.2B).
17
 
18
+ ## Performance Comparison
 
 
 
 
 
 
 
19
 
20
  | Category | Original | Pruned | Change |
21
  |----------|----------|--------|--------|
22
+ | **Python** | 76.9% | 76.9% ⭐ | → |
23
+ | Html | 20.0% | 20.0% | |
24
+ | Trivia | 0.0% | 0.0% | |
25
+ | Math | 80.0% | 80.0% | → |
26
+ | Reasoning | 0.0% | 0.0% | → |
27
+ | Medical | 0.0% | 0.0% | → |
28
+ | Linux | 0.0% | 0.0% | → |
29
+ | Writing | 0.0% | 0.0% | → |
30
 
31
+ **Average**: 22.1% → 22.1% (+0.0%)
32
 
33
+ **Python Retention**: 100.0%
34
 
35
  ![Comparison Graph](comparison_graph.png)
36
 
37
+ ## Quick Start
38
 
39
  ```python
40
  from transformers import AutoModelForCausalLM, AutoTokenizer
 
42
  model = AutoModelForCausalLM.from_pretrained("CompactAI/EXAONE-4.0-1.2B-python-safe")
43
  tokenizer = AutoTokenizer.from_pretrained("CompactAI/EXAONE-4.0-1.2B-python-safe")
44
 
 
45
  inputs = tokenizer("Your prompt here", return_tensors="pt")
46
  outputs = model.generate(**inputs, max_new_tokens=100)
47
  print(tokenizer.decode(outputs[0], skip_special_tokens=True))
48
  ```
49
 
50
+ ## Technical Details
51
 
52
  | Property | Value |
53
  |----------|-------|
54
  | Base Model | [LGAI-EXAONE/EXAONE-4.0-1.2B](https://huggingface.co/LGAI-EXAONE/EXAONE-4.0-1.2B) |
55
  | Specialization | Python |
56
  | Prune Mode | Safe |
 
57
  | Weight Reduction | 1% weights pruned |
58
 
59
+ ## License
 
 
 
 
60
 
61
+ This model inherits the license from the base model.
 
 
 
 
 
comparison_graph.png CHANGED
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1cd8feb5a5590a991f07bec7ae7f01ec2758d60707a84482c411390698410b3b
3
  size 2558820960
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ee58214bfb1aacb566e54820e60a427826b7669459ec20422c5124f94d9abb42
3
  size 2558820960
tokenizer.json CHANGED
@@ -2,7 +2,7 @@
2
  "version": "1.0",
3
  "truncation": {
4
  "direction": "Right",
5
- "max_length": 64512,
6
  "strategy": "LongestFirst",
7
  "stride": 0
8
  },
 
2
  "version": "1.0",
3
  "truncation": {
4
  "direction": "Right",
5
+ "max_length": 65386,
6
  "strategy": "LongestFirst",
7
  "stride": 0
8
  },