AI & ML interests
None defined yet.
Recent Activity
models 260
ScaleML-RLHF/Llama-1B-em-raftpp-iter4
1B • Updated • 1
ScaleML-RLHF/Llama-1B-em-raftpp-iter10
1B • Updated • 1
ScaleML-RLHF/Llama-3B-em-raftpp-iter6
4B • Updated • 2
ScaleML-RLHF/Llama-3B-em-raftpp-iter5
4B • Updated • 2
ScaleML-RLHF/Llama-3B-em-grpo-iter8
4B • Updated • 2
ScaleML-RLHF/Llama-3B-em-raftpp-iter4
4B • Updated • 1
ScaleML-RLHF/Llama-3B-em-grpo-iter7
4B • Updated • 3
ScaleML-RLHF/Llama-3B-em-raftpp-iter3
4B • Updated • 2
ScaleML-RLHF/Llama-3B-em-raftpp-iter2
4B • Updated • 1
ScaleML-RLHF/Llama-3B-grpo-step120
4B • Updated • 1
datasets 17
Viewer
• Updated • 455k • 22
ScaleML-RLHF/numina_math_15
Viewer
• Updated • 10k • 8
ScaleML-RLHF/numina_math_14
Viewer
• Updated • 10k • 10
ScaleML-RLHF/numina_math_13
Viewer
• Updated • 9.99k • 8
ScaleML-RLHF/numina_math_12
Viewer
• Updated • 10k • 6
ScaleML-RLHF/numina_math_11
Updated • 10
ScaleML-RLHF/numina_math_10
Viewer
• Updated • 9.98k • 7
ScaleML-RLHF/numina_math_9
Viewer
• Updated • 9.99k • 13
ScaleML-RLHF/numina_math_8
Viewer
• Updated • 9.99k • 9
ScaleML-RLHF/numina_math_7
Viewer
• Updated • 10k • 28