Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition Paper • 2305.05084 • Published May 8, 2023 • 3
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 7 days ago • 74
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 24 days ago • 253
Treble10: A high-quality dataset for far-field speech recognition, dereverberation, and enhancement Paper • 2510.23141 • Published Oct 27 • 4
view article Article Open ASR Leaderboard: Trends and Insights with New Multilingual & Long-Form Tracks +2 Nov 21 • 24
VibeVoice Collection Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated 21 days ago • 179
🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5 • 241
view article Article Llasa Goes RL: Training LLaSA with GRPO for Improved Prosody and Expressiveness Nov 5 • 10
Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Recognition Evaluation Paper • 2510.06961 • Published Oct 8 • 10
view article Article huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning +2 Oct 27 • 73