Beyond Fixed Frames: Dynamic Character-Aligned Speech Tokenization Paper • 2601.23174 • Published 9 days ago • 2
Learning Rate Matters: Vanilla LoRA May Suffice for LLM Fine-tuning Paper • 2602.04998 • Published 4 days ago • 3
Late-to-Early Training: LET LLMs Learn Earlier, So Faster and Better Paper • 2602.05393 • Published 3 days ago • 4 • 3
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 53 items • Updated about 8 hours ago • 3
Late-to-Early Training: LET LLMs Learn Earlier, So Faster and Better Paper • 2602.05393 • Published 3 days ago • 4
DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers Paper • 2602.02016 • Published 6 days ago • 6
HUMAN-WRITTEN & LEGALLY-SOURCED* Collection Datasets written by humans and/or reverse-engineered from text with deterministic algorithms. No illegal scraping or unethical synthesis. *Mostly. • 155 items • Updated 1 day ago • 1
Privileged Information Distillation for Language Models Paper • 2602.04942 • Published 4 days ago • 20
HUMAN-WRITTEN & LEGALLY-SOURCED* Collection Datasets written by humans and/or reverse-engineered from text with deterministic algorithms. No illegal scraping or unethical synthesis. *Mostly. • 155 items • Updated 1 day ago • 1
HUMAN-WRITTEN & LEGALLY-SOURCED* Collection Datasets written by humans and/or reverse-engineered from text with deterministic algorithms. No illegal scraping or unethical synthesis. *Mostly. • 155 items • Updated 1 day ago • 1
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 53 items • Updated about 8 hours ago • 3
Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing Paper • 2602.03845 • Published 5 days ago • 24
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 53 items • Updated about 8 hours ago • 3
Search-R2: Enhancing Search-Integrated Reasoning via Actor-Refiner Collaboration Paper • 2602.03647 • Published 5 days ago • 7
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 53 items • Updated about 8 hours ago • 3
Horizon-LM: A RAM-Centric Architecture for LLM Training Paper • 2602.04816 • Published 4 days ago • 16
Chronicals: A High-Performance Framework for LLM Fine-Tuning with 3.51x Speedup over Unsloth Paper • 2601.02609 • Published Jan 6 • 1 • 2
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 53 items • Updated about 8 hours ago • 3