lkevincc0/Step-3.5-Flash-REAP-128B-A11B Text Generation • 121B • Updated about 11 hours ago • 77 • 8
Routing the Lottery: Adaptive Subnetworks for Heterogeneous Data Paper • 2601.22141 • Published 13 days ago • 2
TTCS: Test-Time Curriculum Synthesis for Self-Evolving Paper • 2601.22628 • Published 13 days ago • 34
ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas Paper • 2601.21558 • Published 13 days ago • 58
Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning Paper • 2512.07461 • Published Dec 8, 2025 • 78