CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech Paper โข 2506.02863 โข Published Jun 3, 2025 โข 8
SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline Paper โข 2505.19314 โข Published May 25, 2025 โข 4
Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits Paper โข 2505.14648 โข Published May 20, 2025 โข 9