ICIP/ADR
Viewer • Updated • 5k • 31
None defined yet.
Decoupling Reasoning and Confidence: Resurrecting Calibration in Reinforcement Learning from Verifiable Rewards
DeepPresenter: Environment-Grounded Reflection for Agentic Presentation Generation