-
AmberYifan/qwen2.5-7b-instruct-full-pretrain-control-tweet-1m-en-sft
Text Generation • 8B • Updated -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-junk-tweet-1m-en-sft
Text Generation • 8B • Updated • 1 -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-mix-high-tweet-1m-en-sft
Text Generation • 8B • Updated -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-mix-mid-tweet-1m-en-sft
Text Generation • 8B • Updated
Yifan Wang
AmberYifan
AI & ML interests
None yet
Recent Activity
updated a model 3 days ago
AmberYifan/qwen3-8b_ultrafeedback_grpo_structure_only_step38 published a model 3 days ago
AmberYifan/qwen3-8b_ultrafeedback_grpo_structure_only_step38 updated a model 3 days ago
AmberYifan/qwen3-8b_openrubrics_v2_grpo_structure_only_step60Organizations
LLMs Can Get "Brain Rot"!
-
AmberYifan/qwen2.5-7b-instruct-full-pretrain-control-tweet-1m-en-sft
Text Generation • 8B • Updated -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-junk-tweet-1m-en-sft
Text Generation • 8B • Updated • 1 -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-mix-high-tweet-1m-en-sft
Text Generation • 8B • Updated -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-mix-mid-tweet-1m-en-sft
Text Generation • 8B • Updated
DRIFT
Learning from Abundant User Dissatisfaction in Real-World Preference Learning
models 131
AmberYifan/qwen3-8b_ultrafeedback_grpo_structure_only_step38
8B • Updated • 17
AmberYifan/qwen3-8b_openrubrics_v2_grpo_structure_only_step60
8B • Updated • 17
AmberYifan/qwen3-8b_aime_ttrl_step45
8B • Updated • 17
AmberYifan/qwen3-8b_aime_grpo_structure_only_step45
8B • Updated • 11
AmberYifan/qwen3-8b_aime_grpo_structure_only_new_split_step45
8B • Updated • 17
AmberYifan/qwen3-8b_aime_grpo_acc_structure_step45
8B • Updated • 18
AmberYifan/qwen3-8b_aime_grpo_acc_only_step45
8B • Updated • 16
AmberYifan/qwen3-8b-base_aime_grpo_structure_only_step45
8B • Updated • 18
AmberYifan/qwen3-8b-base_aime_grpo_acc_structure_step45
8B • Updated • 20
AmberYifan/qwen3-8b-base_aime_grpo_acc_only_step45
8B • Updated • 16
datasets 28
AmberYifan/seed-data
Viewer • Updated • 491 • 23
AmberYifan/dsat-data
Viewer • Updated • 10.6k • 30
AmberYifan/sat-data
Viewer • Updated • 4.43k • 19
AmberYifan/mistral-v0.1-spin-hhrlhf
Viewer • Updated • 5.5k • 6
AmberYifan/sft-spin-filter
Updated • 6
AmberYifan/sft-spin-kcenter-5k
Viewer • Updated • 5.5k • 17
AmberYifan/gsm8k-sft
Viewer • Updated • 8.79k • 79
AmberYifan/sft-spin-v
Viewer • Updated • 50.5k • 11
AmberYifan/safeRLHF-SFT
Viewer • Updated • 83.4k • 23
AmberYifan/SPIN-trans-DPOformat
Viewer • Updated • 55k • 72