Langlin Huang

shrango

·

https://shrango.github.io/

AI & ML interests

LLM Reasoning, Machine Translation

Recent Activity

commentedon a paper about 3 hours ago

Your Teacher Can't Help You Here: Combating Supervision Fidelity Decay in On-Policy Distillation

updated a collection 18 days ago

updated a collection 18 days ago

View all activity

Organizations

commented a paper about 3 hours ago

Your Teacher Can't Help You Here: Combating Supervision Fidelity Decay in On-Policy Distillation

Paper • 2605.30833 • Published May 29 •

updated a collection 18 days ago

LoPE

LoPE experiment checkpoints (global_step_200) • 19 items • Updated 18 days ago