sirynoma's picture

In a Training Loop 🔄

sirynoma

uavleeva

·

Suchotin

AI & ML interests

None yet

Recent Activity

updated a collection 2 days ago

Multitask RLVR using GRPO (HSE Project)

updated a collection 2 days ago

Multitask RLVR using GRPO (HSE Project)

updated a collection 2 days ago

Multitask RLVR using GRPO (HSE Project)

View all activity

Organizations

uavleeva 's models 13

uavleeva/grpo_merged_math_sql_code_ties_001

Text Generation • Updated 2 days ago • 6

uavleeva/grpo_mixed_run_002

Updated 2 days ago

uavleeva/grpo_sql_run_005

Updated 2 days ago

uavleeva/grpo_merged_math_sql_code_linear_001

Text Generation • Updated 2 days ago

uavleeva/grpo_code_run_002

Updated 2 days ago

uavleeva/grpo_mixed_run_004

Updated 2 days ago

uavleeva/grpo_math_run_level3_all_rewards_001

Updated 2 days ago

uavleeva/grpo_sql_run_002

Updated 2 days ago

uavleeva/grpo_sql_run_004

Updated 3 days ago

uavleeva/grpo_mixed_run_001

Updated 3 days ago

uavleeva/grpo_sudoku_run_003

Updated 4 days ago

uavleeva/grpo_math_run_level3_accformat_001

Updated 4 days ago

uavleeva/grpo_code_run_001

Updated 4 days ago

uavleeva (sirynoma)

sirynoma's picture

In a Training Loop 🔄

sirynoma

uavleeva

·

Suchotin

AI & ML interests

None yet

Recent Activity

updated a collection 2 days ago

Multitask RLVR using GRPO (HSE Project)

updated a collection 2 days ago

Multitask RLVR using GRPO (HSE Project)

updated a collection 2 days ago

Multitask RLVR using GRPO (HSE Project)

View all activity

Organizations

uavleeva 's models 13

uavleeva/grpo_merged_math_sql_code_ties_001

Text Generation • Updated 2 days ago • 6

uavleeva/grpo_mixed_run_002

Updated 2 days ago

uavleeva/grpo_sql_run_005

Updated 2 days ago

uavleeva/grpo_merged_math_sql_code_linear_001

Text Generation • Updated 2 days ago

uavleeva/grpo_code_run_002

Updated 2 days ago

uavleeva/grpo_mixed_run_004

Updated 2 days ago

uavleeva/grpo_math_run_level3_all_rewards_001

Updated 2 days ago

uavleeva/grpo_sql_run_002

Updated 2 days ago

uavleeva/grpo_sql_run_004

Updated 3 days ago

uavleeva/grpo_mixed_run_001

Updated 3 days ago

uavleeva/grpo_sudoku_run_003

Updated 4 days ago

uavleeva/grpo_math_run_level3_accformat_001

Updated 4 days ago

uavleeva/grpo_code_run_001

Updated 4 days ago