Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop ๐
sirynoma
uavleeva
Follow
0 followers
ยท
1 following
Suchotin
AI & ML interests
None yet
Recent Activity
updated
a collection
2 days ago
Multitask RLVR using GRPO (HSE Project)
updated
a collection
2 days ago
Multitask RLVR using GRPO (HSE Project)
updated
a collection
2 days ago
Multitask RLVR using GRPO (HSE Project)
View all activity
Organizations
uavleeva
's models
13
Sort:ย Recently updated
uavleeva/grpo_merged_math_sql_code_ties_001
Text Generation
โข
Updated
2 days ago
โข
6
uavleeva/grpo_mixed_run_002
Updated
2 days ago
uavleeva/grpo_sql_run_005
Updated
2 days ago
uavleeva/grpo_merged_math_sql_code_linear_001
Text Generation
โข
Updated
2 days ago
uavleeva/grpo_code_run_002
Updated
2 days ago
uavleeva/grpo_mixed_run_004
Updated
2 days ago
uavleeva/grpo_math_run_level3_all_rewards_001
Updated
2 days ago
uavleeva/grpo_sql_run_002
Updated
2 days ago
uavleeva/grpo_sql_run_004
Updated
3 days ago
uavleeva/grpo_mixed_run_001
Updated
3 days ago
uavleeva/grpo_sudoku_run_003
Updated
4 days ago
uavleeva/grpo_math_run_level3_accformat_001
Updated
4 days ago
uavleeva/grpo_code_run_001
Updated
4 days ago