AQ-MedAI/PulseMind-72B
Image-Text-to-Text
β’
73B
β’
Updated
β’
40
None defined yet.
Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO
GroupRank: A Groupwise Reranking Paradigm Driven by Reinforcement Learning