koutch/qwen_qwen3-instruct-4b_train_sft_train_para Text Generation • 4B • Updated 22 minutes ago • 66
koutch/paper_llama_llama3.1-8b_train_sft_all_train_code Text Generation • 8B • Updated 1 day ago • 125
koutch/paper_qwen_qwen3-instruct-4b_train_sft_all_train_code Text Generation • 4B • Updated 1 day ago • 101
koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_code Text Generation • 4B • Updated 1 day ago • 125
koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_para Text Generation • 4B • Updated 1 day ago • 189
koutch/paper_qwen_qwen3-instruct-4b_train_grpo_v1_train_code Text Generation • 4B • Updated 6 days ago • 4
koutch/paper_llama_llama3.1-8b_train_sft_train_thought Text Generation • 8B • Updated 8 days ago • 28