Yumeng Li
Yumeng
AI & ML interests
Generative Models, Vision-Language Models, Out-of-Distribution Generalization
Recent Activity
upvoted
an
article
about 11 hours ago
A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond
liked
a Space
6 months ago
nanotron/ultrascale-playbook
liked
a dataset
8 months ago
TIGER-Lab/OmniEdit-Filtered-1.2M
Organizations
None yet