sirynoma's picture

In a Training Loop 🔄

sirynoma

uavleeva

·

Suchotin

AI & ML interests

None yet

Recent Activity

updated a collection 43 minutes ago

Multitask RLVR using GRPO (HSE Project)

updated a model about 1 hour ago

uavleeva/grpo_code_run_001

updated a model about 1 hour ago

uavleeva/grpo_sudoku_run_001

View all activity

Organizations

Collections 1

models 6

uavleeva/grpo_code_run_001

Updated 43 minutes ago

uavleeva/grpo_sudoku_run_001

Updated about 1 hour ago

uavleeva/grpo_sudoku_run_003

Updated 2 days ago

uavleeva/grpo_sql_run_002

Updated 3 days ago

uavleeva/grpo_math_run_level3_all_rewards_001

Updated 3 days ago

uavleeva/grpo_math_run_level3_accformat_001

Updated 3 days ago

datasets 1

uavleeva/text2sql_synthetics

Updated about 3 hours ago