arxiv:2508.11408
garyzhang
xiaoniqiu
·
AI & ML interests
LLM, Agents
Recent Activity
upvoted
a
paper
3 days ago
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models
updated
a dataset
4 months ago
datajuicer/geometry_sft