arxiv:2503.09662
Zikai Zhou
Klayand
AI & ML interests
Knowledge Distillation, Generated Models
Recent Activity
upvoted
a
paper
about 17 hours ago
Late-to-Early Training: LET LLMs Learn Earlier, So Faster and Better
upvoted
a
paper
3 days ago
Mano: Restriking Manifold Optimization for LLM Training
upvoted
a
paper
4 days ago
PISA: Piecewise Sparse Attention Is Wiser for Efficient Diffusion Transformers
Organizations
None yet