Text Generation
PEFT
TensorBoard
Safetensors
Arabic
English
Generated from Trainer
trl
grpo
math
reasoning
R1
conversational
Instructions to use Omartificial-Intelligence-Space/Fanar-Math-R1-GRPO with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use Omartificial-Intelligence-Space/Fanar-Math-R1-GRPO with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("QCRI/Fanar-1-9B-Instruct") model = PeftModel.from_pretrained(base_model, "Omartificial-Intelligence-Space/Fanar-Math-R1-GRPO") - Notebooks
- Google Colab
- Kaggle
Training in progress, step 302
Browse files
adapter_model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 8967968
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a499924c0b563b49f4f8ad3f22079a2d94d2810d9603b207d2d8b229f68618dd
|
| 3 |
size 8967968
|
runs/Jun14_01-40-48_lambda-hyperplane/events.out.tfevents.1749854532.lambda-hyperplane
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:787689959f50e72ff0f3208e66a6751873e04ff528d571a3929cfcb7772d67b0
|
| 3 |
+
size 22877
|