Text Generation
PEFT
TensorBoard
Safetensors
Arabic
English
Generated from Trainer
trl
grpo
math
reasoning
R1
conversational
Instructions to use Omartificial-Intelligence-Space/Fanar-Math-R1-GRPO with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use Omartificial-Intelligence-Space/Fanar-Math-R1-GRPO with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("QCRI/Fanar-1-9B-Instruct") model = PeftModel.from_pretrained(base_model, "Omartificial-Intelligence-Space/Fanar-Math-R1-GRPO") - Notebooks
- Google Colab
- Kaggle
Training in progress, step 300
Browse files
adapter_model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 8967968
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f1168fb367944e34bafdf521119b5f082885f0ee16443ee1ac9370e09ade8162
|
| 3 |
size 8967968
|
runs/Jun14_01-40-48_lambda-hyperplane/events.out.tfevents.1749854532.lambda-hyperplane
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8a8c2656f3e82c1f183f20caa875c29786a4a7f44e62ec0e510d1cbda5bafa0c
|
| 3 |
+
size 21614
|