Pion: A Spectrum-Preserving Optimizer via Orthogonal Equivalence Transformation Paper • 2605.12492 • Published 3 days ago • 4
Stabilizing Rubric Integration Training via Decoupled Advantage Normalization Paper • 2603.26535 • Published Mar 27 • 3