garyzhang's picture

2 10 4

garyzhang

xiaoniqiu

·

garyzhang99

AI & ML interests

LLM, Agents

Recent Activity

upvoted a paper 3 days ago

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

upvoted a paper 2 months ago

Multi-Docker-Eval: A `Shovel of the Gold Rush' Benchmark on Automatic Environment Building for Software Engineering

updated a dataset 4 months ago

datajuicer/geometry_sft

View all activity

Organizations

Papers 1

arxiv:2508.11408

models 0

None public yet

datasets 0

None public yet

xiaoniqiu (garyzhang)

garyzhang's picture

2 10 4

garyzhang

xiaoniqiu

·

garyzhang99

AI & ML interests

LLM, Agents

Recent Activity

upvoted a paper 3 days ago

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

upvoted a paper 2 months ago

Multi-Docker-Eval: A `Shovel of the Gold Rush' Benchmark on Automatic Environment Building for Software Engineering

updated a dataset 4 months ago

datajuicer/geometry_sft

View all activity

Organizations

Papers 1

arxiv:2508.11408

models 0

None public yet

datasets 0

None public yet