·
AI & ML interests
None yet
Organizations
flyingbugs/bi_unlearn_wmdp
Text Generation
• 7B • Updated • 3
flyingbugs/OpenR1-Qwen-math-7B-SFT-mid-only
Text Generation
• 8B • Updated • 2
flyingbugs/qwen-65-open-r1
Text Generation
• 8B • Updated • 1
flyingbugs/GeneralThought-195K-65-qwen7b
Text Generation
• 8B • Updated • 2
flyingbugs/limo-solutions-deepseek-qwen-7b
Text Generation
• 8B • Updated • 1
flyingbugs/deepseek-distilled-qwen-7b-rl
Text Generation
• 8B • Updated • 2
flyingbugs/Qwen2.5-Math-7B-limo-32b
Text Generation
• 8B • Updated • 2
flyingbugs/Qwen2.5-math-1.5B-Open-R1-Distill-eos-new
Text Generation
• 2B • Updated • 3
flyingbugs/Qwen2.5-1.5B-Open-R1-Distill-eos-epic-new
Text Generation
• 2B • Updated • 4
flyingbugs/Qwen2.5-math-1.5B-Open-R1-Distill-eos
Text Generation
• 2B • Updated • 2
flyingbugs/Qwen2.5-1.5B-Open-R1-Distill-eos-epic
Text Generation
• 2B • Updated • 2
flyingbugs/OpenR1-Qwen-7B-SFT-65
Text Generation
• 333k • Updated • 1
flyingbugs/OlympicCoder-7B
333k • Updated flyingbugs/Qwen2.5-1.5B-Open-R1-Distill-eos
Text Generation
• 2B • Updated • 1
flyingbugs/granite3.3-8b-reinforce_plus-math_different_reward_global_step60_hf
Text Generation
• 8B • Updated • 1
flyingbugs/granite3.3-8b-math-pku-rlhf-reinforce-plus
Text Generation
• 8B • Updated • 1
flyingbugs/granite_pku_saferlhf_reinforce_plus_plus
Text Generation
• 8B • Updated flyingbugs/granite_star_1_limo_1e5
Text Generation
• 8B • Updated • 2
flyingbugs/granite_star_1_limo
Text Generation
• 8B • Updated • 1
flyingbugs/granite_star_1
Text Generation
• 8B • Updated • 3
flyingbugs/Qwen2.5-Math-7B-OpenR1-Math-220k-add-aime
Text Generation
• 8B • Updated • 1
flyingbugs/Qwen2.5-Math-7B-OpenR1-Math-220k-pruned-keep-0.5-end-start-0.5-add-aime
Text Generation
• 8B • Updated • 4
flyingbugs/Qwen2.5-Math-7B-OpenR1-Math-220k-random-perturbation-head
Text Generation
• 8B • Updated • 1
flyingbugs/Qwen2.5-Math-7B-OpenR1-Math-220k-random-perturbation-full
Text Generation
• 8B • Updated • 1
flyingbugs/Qwen2.5-Math-7B-OpenR1-Math-220k-random-perturbation-tail
Text Generation
• 8B • Updated • 1
flyingbugs/Qwen2.5-Math-7B-OpenR1-Math-220k-keep-0.5-end-start-0.5-random-perturbation
Text Generation
• 8B • Updated • 1
flyingbugs/Qwen2.5-Math-7B-OpenR1-Math-220k-pruned-keep-0.75-end-start-0.0
Text Generation
• 8B • Updated • 10
flyingbugs/Qwen2.5-Math-7B-OpenR1-Math-220k-random-perturbation-middle
Text Generation
• 8B • Updated • 1
flyingbugs/Qwen2.5-Math-7B-OpenR1-Math-220k-pruned-think-mid
Text Generation
• 8B • Updated • 2
flyingbugs/Qwen2.5-Math-7B-s1k
Text Generation
• 8B • Updated • 1