TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents Paper โข 2602.02196 โข Published 21 days ago โข 33
$ฯ$-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation Paper โข 2503.13288 โข Published Mar 17, 2025 โข 51
MUR: Momentum Uncertainty guided Reasoning for Large Language Models Paper โข 2507.14958 โข Published Jul 20, 2025 โข 47