LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
MAIC-UI: Making Interactive Courseware with Generative UI
WildReward: Learning Reward Models from In-the-Wild Human Interactions
models 84
THU-KEG/LongTraceRL-30B
Reinforcement Learning • 31B • Updated • 17 • 1
THU-KEG/LongTraceRL-8B
Reinforcement Learning • Updated • 1
THU-KEG/LongTraceRL-4B
Reinforcement Learning • 4B • Updated • 23 • 1
THU-KEG/DeepDive-30B-A3B-C-GRPO
31B • Updated • 5
THU-KEG/DeepDive-4B-C-GRPO
4B • Updated • 4
THU-KEG/DeepDive-30B-A3B-SFT
31B • Updated • 3
THU-KEG/DeepDive-4B-SFT
4B • Updated • 9
THU-KEG/WildReward-8B
Text Classification • 8B • Updated • 37 • 3
THU-KEG/WildReward-4B
Text Classification • 4B • Updated • 15 • 4
THU-KEG/LLaDA-8B-BGPO-sudoku
Reinforcement Learning • 8B • Updated • 5 • 1
datasets 23
THU-KEG/LongTraceRL
Viewer • Updated • 2.82k • 30
THU-KEG/CaRR-DeepDive
Preview • Updated • 105 • 1
THU-KEG/WildFB
Updated • 44 • 3
THU-KEG/AgentIF
Viewer • Updated • 707 • 324 • 7
THU-KEG/DeepPrune
Preview • Updated • 18 • 2
THU-KEG/LinguaLens-Data
Viewer • Updated • 7.25k • 20 • 2
THU-KEG/RM-Bench
Viewer • Updated • 1.33k • 1.78k • 11
THU-KEG/LongWriter-Zero-RLData
Viewer • Updated • 8.61k • 100 • 21
THU-KEG/Arena-Write
Viewer • Updated • 595 • 66 • 5
THU-KEG/LongStory
Viewer • Updated • 5.28k • 50 • 3