Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
13
8
Fuxiang Zhang
sicer
Follow
21world's profile picture
ZhenghaiXue's profile picture
chriszhouwei's profile picture
3 followers
·
1 following
mansicer
AI & ML interests
None yet
Recent Activity
authored
a paper
about 13 hours ago
Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems
upvoted
a
paper
about 18 hours ago
Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems
upvoted
a
paper
27 days ago
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs
View all activity
Organizations
sicer
's models
5
Sort: Recently updated
sicer/arc-agi-legacy
Updated
Mar 20, 2025
sicer/gbc-backup
Updated
Mar 20, 2025
sicer/grpo-Qwen2.5-Math-7B-math-level3to5-foundation
8B
•
Updated
Mar 19, 2025
•
1
sicer/grpo-Qwen2.5-Math-7B-2epoch-limr-true-math-false-multi-turn
8B
•
Updated
Mar 19, 2025
•
1
sicer/Qwen2.5-Math-7B-starting-point
8B
•
Updated
Mar 19, 2025