InstructLR: A Scalable Approach to Create Instruction Dataset for Under-Resourced Languages Paper • 2512.02213 • Published Dec 1, 2025 • 1
DisfluencySpeech -- Single-Speaker Conversational Speech Dataset with Paralanguage Paper • 2406.08820 • Published Jun 13, 2024 • 2
view article Article Fine-Tune MMS Adapter Models for low-resource ASR patrickvonplaten • Jun 19, 2023 • 27
Where Are We At with Automatic Speech Recognition for the Bambara Language? Paper • 2602.09785 • Published Feb 10 • 1
view article Article Synthetic dataset generation techniques: Self-Instruct davanstrien • May 15, 2024 • 23
view changelog Hugging Face Changelog Connect Your MCP Client to the Hugging Face Hub Jun 6, 2025 • 114
view article Article Recipe: Preparing Multilingual Speech Datasets for TTS Training PHBJT • Nov 4, 2024 • 20