Qwen/Qwen3.5-397B-A17B
Image-Text-to-Text
•
403B
•
Updated
•
270
None defined yet.
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration
Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models