Harvard-DCML/boomerang-qwen3-2.3B
Text Generation • 3B • Updated
• 106 • 1
Data-Centric ML
A Critical Look at Targeted Instruction Selection: Disentangling What Matters (and What Doesn't)
Boomerang Distillation Enables Zero-Shot Model Size Interpolation