issai/ms-swift_qwen3_vla_4b_whisper_original_init_thinking_10_qwen3_vla_bs_2_g4_ch-16348 1.07M • Updated about 6 hours ago
issai/ms-swift_qwen3_vla_4b_whisper_original_init_thinking_10_qwen3_vla_bs_2_g4_ch-16348 1.07M • Updated about 6 hours ago
T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground Paper • 2512.10430 • Published Dec 11, 2025 • 115