summerfall's picture

3 2

summerfall

YancyLee

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation

liked a model 6 days ago

openbmb/MiniCPM-SALA

liked a model 12 days ago

openbmb/MiniCPM-o-4_5

View all activity

Organizations

None yet

upvoted a paper 4 days ago

Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation

Paper • 2602.12125 • Published 5 days ago • 56

liked a model 6 days ago

openbmb/MiniCPM-SALA

Text Generation • 9B • Updated 6 days ago • 3.86k • 456

liked a model 12 days ago

openbmb/MiniCPM-o-4_5

Any-to-Any • 9B • Updated 4 days ago • 50.6k • 856

upvoted a paper 27 days ago

DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution

Paper • 2601.13761 • Published 28 days ago • 16

upvoted a paper 4 months ago

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

Paper • 2510.14943 • Published Oct 16, 2025 • 40

updated a model 11 months ago

YancyLee/ProactiveAgent

Updated Mar 21, 2025 • 6

published a model 11 months ago

YancyLee/ProactiveAgent

Updated Mar 21, 2025 • 6

YancyLee (summerfall)

summerfall's picture

3 2

summerfall

YancyLee

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation

liked a model 6 days ago

openbmb/MiniCPM-SALA

liked a model 12 days ago

openbmb/MiniCPM-o-4_5

View all activity

Organizations

None yet

upvoted a paper 4 days ago

Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation

Paper • 2602.12125 • Published 5 days ago • 56

liked a model 6 days ago

openbmb/MiniCPM-SALA

Text Generation • 9B • Updated 6 days ago • 3.86k • 456

liked a model 12 days ago

openbmb/MiniCPM-o-4_5

Any-to-Any • 9B • Updated 4 days ago • 50.6k • 856

upvoted a paper 27 days ago

DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution

Paper • 2601.13761 • Published 28 days ago • 16

upvoted a paper 4 months ago

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

Paper • 2510.14943 • Published Oct 16, 2025 • 40

updated a model 11 months ago

YancyLee/ProactiveAgent

Updated Mar 21, 2025 • 6

published a model 11 months ago

YancyLee/ProactiveAgent

Updated Mar 21, 2025 • 6