koutch/paper_llama_1.json_train_dpo_v1_train_code Text Generation • 8B • Updated about 22 hours ago • 24
koutch/paper_qwen_1.json_train_dpo_v1_train_code Text Generation • 4B • Updated about 23 hours ago • 29
koutch/paper_smol_1.json_train_dpo_v1_train_code Text Generation • 3B • Updated about 24 hours ago • 22
koutch/paper_llama_llama3.1-8b_train_sft_all_train_code Text Generation • 8B • Updated 1 day ago • 106