arxiv:2412.03123
Jinghan Jia
flyingbugs
AI & ML interests
None yet
Organizations
models 146
flyingbugs/bi_unlearn_wmdp
Text Generation • 7B • Updated • 3
flyingbugs/OpenR1-Qwen-math-7B-SFT-mid-only
Text Generation • 8B • Updated • 2
flyingbugs/qwen-65-open-r1
Text Generation • 8B • Updated • 1
flyingbugs/GeneralThought-195K-65-qwen7b
Text Generation • 8B • Updated • 2
flyingbugs/limo-solutions-deepseek-qwen-7b
Text Generation • 8B • Updated • 1
flyingbugs/deepseek-distilled-qwen-7b-rl
Text Generation • 8B • Updated • 2
flyingbugs/Qwen2.5-Math-7B-limo-32b
Text Generation • 8B • Updated • 2
flyingbugs/Qwen2.5-math-1.5B-Open-R1-Distill-eos-new
Text Generation • 2B • Updated • 3
flyingbugs/Qwen2.5-1.5B-Open-R1-Distill-eos-epic-new
Text Generation • 2B • Updated • 4
flyingbugs/Qwen2.5-math-1.5B-Open-R1-Distill-eos
Text Generation • 2B • Updated • 2
datasets 83
flyingbugs/OpenR1-Math-220k-pruned-mid
Viewer • Updated • 93.7k • 18
flyingbugs/GeneralThought-195K-65
Viewer • Updated • 127k • 27
flyingbugs/limo-solutions-deepseek
Viewer • Updated • 817 • 20
flyingbugs/star1_rlhf_train
Viewer • Updated • 1k • 5
flyingbugs/limo-deepseek32b-responses
Viewer • Updated • 817 • 25
flyingbugs/OpenR1-Math-220k-random-0.65-subset
Viewer • Updated • 60.9k • 26
flyingbugs/pku_safe_rlhf_
Viewer • Updated • 73.9k • 29
flyingbugs/aime_2024
Viewer • Updated • 30 • 16
flyingbugs/pure_math
Viewer • Updated • 17.4k • 13
flyingbugs/pku_safe_rlhf_combined_math
Viewer • Updated • 91.3k • 20