Ivan Rubachev
puhsu
AI & ML interests
Tabular data, computer vision, generative models, interpretability, carefull emperical deep learning research
Recent Activity
upvoted a paper about 12 hours ago
One-Step Gradient Delay is Not a Barrier for Large-Scale Asynchronous Pipeline Parallel LLM Pretraining upvoted a paper about 1 month ago
Unsupervised Process Reward Models liked a dataset about 2 months ago
criteo/CriteoPrivateAd