arxiv:2601.10201
Jiarui Yao
FlippyDora
AI & ML interests
None yet
Recent Activity
published
a model about 7 hours ago
rb-dev/rubrics_train_data upvoted a paper about 11 hours ago
Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models submitted
a paper
about 11 hours ago
Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models