-
Unified Reward Model for Multimodal Understanding and Generation
Paper • 2503.05236 • Published • 124 -
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning
Paper • 2505.03318 • Published • 94 -
CodeGoat24/UnifiedReward-Think-qwen35-9b
9B • Updated • 275 -
CodeGoat24/UnifiedReward-Think-qwen35-27b
3.05M • Updated • 565
SII-Yibin Wang
CodeGoat24
AI & ML interests
I'm part of Shanghai Innovation Institute, focusing on Multimodal RL and Generation.
Recent Activity
updated a model about 15 hours ago
CodeGoat24/UnifiedReward-Flex-qwen3vl-8b updated a collection 3 days ago
UnifiedReward Flex updated a model 3 days ago
CodeGoat24/UnifiedReward-Flex-qwen35-9bOrganizations
UnifiedReward Flex
-
Unified Personalized Reward Model for Vision Generation
Paper • 2602.02380 • Published • 20 -
CodeGoat24/FLUX.2-klein-base-9B-UnifiedReward-Flex-lora
Text-to-Image • Updated • 499 • 21 -
CodeGoat24/Wan2.2-T2V-A14B-UnifiedReward-Flex-lora
Text-to-Video • Updated • 189 • 12 -
CodeGoat24/Wan2.1-T2V-14B-UnifiedReward-Flex-lora
Text-to-Video • Updated • 11 • 6
UnifiedReward 2.0 Qwen3.5 Models
-
Unified Reward Model for Multimodal Understanding and Generation
Paper • 2503.05236 • Published • 124 -
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning
Paper • 2505.03318 • Published • 94 -
CodeGoat24/UnifiedReward-Think-qwen35-9b
9B • Updated • 275 -
CodeGoat24/UnifiedReward-Think-qwen35-27b
3.05M • Updated • 565
UnifiedReward Flex
-
Unified Personalized Reward Model for Vision Generation
Paper • 2602.02380 • Published • 20 -
CodeGoat24/FLUX.2-klein-base-9B-UnifiedReward-Flex-lora
Text-to-Image • Updated • 499 • 21 -
CodeGoat24/Wan2.2-T2V-A14B-UnifiedReward-Flex-lora
Text-to-Video • Updated • 189 • 12 -
CodeGoat24/Wan2.1-T2V-14B-UnifiedReward-Flex-lora
Text-to-Video • Updated • 11 • 6
spaces 4
pinned
Running
Agents
3
UniGenBench Leaderboard (Chinese Long)
🏅
UniGenBench: a unified T2I generation benchmark.
pinned
Running
Agents
3
UniGenBench Leaderboard (Chinese)
🏅
UniGenBench: a unified T2I generation benchmark.
pinned
Running
Agents
7
UniGenBench Leaderboard (English)
🏅
UniGenBench: a unified T2I generation benchmark.
pinned
Running
Agents
3
UniGenBench Leaderboard (English Long)
🏅
UniGenBench: a unified T2I generation benchmark.
models 55
CodeGoat24/UnifiedReward-Flex-qwen3vl-8b
9B • Updated • 276
CodeGoat24/UnifiedReward-Flex-qwen35-9b
9B • Updated • 112
CodeGoat24/UnifiedReward-Flex-qwen3vl-4b
5B • Updated • 47
CodeGoat24/UnifiedReward-Flex-qwen35-27b
3.05M • Updated • 7
CodeGoat24/UnifiedReward-Flex-qwen35-4b
5B • Updated • 6 • 2
CodeGoat24/UnifiedReward-Think-qwen35-27b
3.05M • Updated • 565
CodeGoat24/UnifiedReward-Think-qwen35-4b
5B • Updated • 10 • 2
CodeGoat24/UnifiedReward-Think-qwen3vl-2b
2B • Updated • 3
CodeGoat24/UnifiedReward-Think-qwen3vl-4b
4B • Updated • 2
CodeGoat24/UnifiedReward-Think-qwen3vl-8b
9B • Updated • 973 • 2
datasets 14
CodeGoat24/UniGenBench-Eval-Images
Preview • Updated • 179 • 4
CodeGoat24/UnifiedReward-Flex-SFT-90K
Viewer • Updated • 1.39M • 402 • 3
CodeGoat24/UniGenBench
Updated • 64 • 3
CodeGoat24/UnifiedReward-2.0-T2X-score-data
Viewer • Updated • 337k • 246
CodeGoat24/VIDEOGEN
Viewer • Updated • 50.9k • 29
CodeGoat24/ShareGPTVideo-DPO
Viewer • Updated • 101k • 65
CodeGoat24/VideoFeedback
Viewer • Updated • 73.2k • 90
CodeGoat24/VideoDPO
Viewer • Updated • 29k • 197
CodeGoat24/OIP
Viewer • Updated • 21.4k • 75
CodeGoat24/LLaVA-Critic-113k
Preview • Updated • 54