Qwen/DeepPlanning
Viewer • Updated • 2.14k • 675 • 194
None defined yet.
OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models
RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation