AI & ML interests
None yet
Organizations
skyai798/STAR-1_DeepSeek-R1-Distill-Llama-8B_sft-complete-dpo
Text Generation
•
8B
•
Updated
•
7
8B
•
Updated
•
1
Text Generation
•
8B
•
Updated
•
4
skyai798/llama-dpo-r2-new
Updated
skyai798/qwen2-dpo-r1-1v2
8B
•
Updated
•
1
skyai798/qwen2_safe_40000_helpful_40000_qwen_beta_0.2_lr_1.0e-6_seed_17
Updated
skyai798/qwen2_safe_20000_helpful_40000_qwen_beta_0.2_lr_5.0e-7_seed_120
Updated
Text Generation
•
8B
•
Updated
•
6
skyai798/saferlhf_ultra_sft
Text Generation
•
8B
•
Updated
•
11
skyai798/safety_v2_math_v1
Text Generation
•
8B
•
Updated
•
4
skyai798/safety-math-mix-sft
8B
•
Updated
skyai798/openmathinstruct2-mix-sft
Text Generation
•
8B
•
Updated
•
3