arxiv:2601.10201
Jiarui Yao
FlippyDora
AI & ML interests
None yet
Recent Activity
published
a model about 5 hours ago
rb-dev/rubrics_train_data upvoted a paper about 10 hours ago
Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models submitted
a paper
about 10 hours ago
Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models