PlanningBench: Generating Scalable and Verifiable Planning Data for Evaluating and Training Large Language Models Paper • 2605.20873 • Published 6 days ago • 4
Safety Alignment as Continual Learning: Mitigating the Alignment Tax via Orthogonal Gradient Projection Paper • 2602.07892 • Published 14 days ago • 2
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 145 items • Updated 4 days ago • 29
Mix-Quant: Quantized Prefilling, Precise Decoding for Agentic LLMs Paper • 2605.20315 • Published 7 days ago • 28
A Survey of Large Audio Language Models: Generalization, Trustworthiness, and Outlook Paper • 2605.20266 • Published 8 days ago • 56
It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs Paper • 2605.20258 • Published 8 days ago • 29
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 145 items • Updated 4 days ago • 29
You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories Paper • 2605.21468 • Published 6 days ago • 47
OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond Paper • 2605.19660 • Published 7 days ago • 39 • 3
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 145 items • Updated 4 days ago • 29
OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond Paper • 2605.19660 • Published 7 days ago • 39
Mega-ASR: Towards In-the-wild^2 Speech Recognition via Scaling up Real-world Acoustic Simulation Paper • 2605.19833 • Published 7 days ago • 127
$δ$-mem: Efficient Online Memory for Large Language Models Paper • 2605.12357 • Published 14 days ago • 120 • 5
LiteFrame: Efficient Vision Encoders Unlock Frame Scaling in Video LLMs Paper • 2605.17260 • Published 9 days ago • 24
Language-Switching Triggers Take a Latent Detour Through Language Models Paper • 2605.18646 • Published 8 days ago • 4
PEEK: Context Map as an Orientation Cache for Long-Context LLM Agents Paper • 2605.19932 • Published 7 days ago • 7
Where Does Authorship Signal Emerge in Encoder-Based Language Models? Paper • 2605.19908 • Published 7 days ago • 5
CopT: Contrastive On-Policy Thinking with Continuous Spaces for General and Agentic Reasoning Paper • 2605.20075 • Published 7 days ago • 4