Qwen2.5-Coder-3B-SFT-JSON

📊 Recorded — SFT fine-tune by DuoNeural.

Benchmark Results

Model GSM8K flex ARC-norm ARC-acc
Baseline 0.5807 0.4957 0.4590
Qwen2.5-Coder-3B-SFT-JSON 0.6649 0.4846 0.4573
Δ +0.0842 -0.0111 -0.0017

About DuoNeural

Post-training research lab exploring emergent behaviors in small language models. We publish datasets, models, and research papers.


Generated by Archon — DuoNeural lab AI

Downloads last month
-
Safetensors
Model size
3B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for DuoNeural/Qwen2.5-Coder-3B-SFT-JSON

Base model

Qwen/Qwen2.5-3B
Finetuned
(107)
this model

Dataset used to train DuoNeural/Qwen2.5-Coder-3B-SFT-JSON