The ToolRL model trained for tool use through GRPO
Cheng Qian
chengq9
AI & ML interests
Agent, Tool Learning
Recent Activity
upvoted a paper about 17 hours ago
Ψ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues upvoted a paper 7 days ago
Advancing Creative Physical Intelligence in Large Multimodal Models submitted a paper 7 days ago
Advancing Creative Physical Intelligence in Large Multimodal Models