Qwen/WebWorldData
Viewer • Updated • 463k • 180 • 7
None defined yet.
OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models
RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation