GenRM/gutenberg-dpo-v0.1-jondurbin
Viewer
• Updated • 918 • 43
GenRM/HelpSteer2-DPO-Atsunori
Viewer
• Updated • 7.59k • 9
GenRM/MetaMath_DPO_FewShot-abacusai
Viewer
• Updated • 395k • 57
GenRM/reddit-dpo-nbeerbower
Viewer
• Updated • 76.9k • 6
GenRM/function-calling-v0.2-with-r1-cot-AymanTarig
Viewer
• Updated • 58k • 7
GenRM/Magpie-Reasoning-V2-250K-CoT-Deepseek-R1-Llama-70B-Magpie-Align
Viewer
• Updated • 250k • 10
GenRM/dolphin-r1-cognitivecomputations
Updated • 22
GenRM/Bespoke-Stratos-17k-bespokelabs
Viewer
• Updated • 16.7k • 26
GenRM/OpenThoughts-114k-open-thoughts
Viewer
• Updated • 114k • 42
GenRM/R1-Distill-SFT-ServiceNow-AI
Viewer
• Updated • 172k • 7
GenRM/Magpie-Gemma2-Pro-200K-Filtered-Magpie-Align
Viewer
• Updated • 200k • 7
GenRM/filtered_DeepSeek-R1-Distill-Llama-8B-avrecum
Viewer
• Updated • 600 • 7
Updated • 10
GenRM/ultrafeedback_binarized_cleaned-allenai
Preview
• Updated • 7
GenRM/orca_dpo_pairs-Intel
Viewer
• Updated • 12.9k • 7
GenRM/distilabel-math-preference-dpo-argilla
Viewer
• Updated • 2.42k • 8
GenRM/Math-Step-DPO-10K-xinlai
Viewer
• Updated • 10.8k • 8
GenRM/Code-Preference-Pairs-Vezora
Viewer
• Updated • 54k • 15
GenRM/Magpie-Air-DPO-100K-v0.1-Magpie-Align
Viewer
• Updated • 100k • 30
GenRM/Magpie-Llama-3.1-Pro-DPO-100K-v0.1-Magpie-Align
Viewer
• Updated • 100k • 12
GenRM/magpie-ultra-v1.0-argilla
Preview
• Updated • 17
Viewer
• Updated • 8.5k • 30
GenRM/darkside-dpo-openvoid
Viewer
• Updated • 541 • 8