Fair Speech AI for Disfluent speech

Trained and optimized for fluent speech, speech AI works poorly for people who stutter (PWS): it often cuts them off from speaking and interprets the speech of PWS with four times higher error rates than average. The increasing deployment of voice AI in automated phone menus, AI-conducted job interviews, and everyday devices poses tangible risks to PWS.

Even when automatic speech recognition (ASR) systems do manage to transcribe stuttered speech, the resultant transcriptions often remove disfluencies like filler words, inevitably stigmatizing stuttering and denying the option for PWS to have their disfluencies preserved and normalized in transcripts.

To address the fluency biases embedded within today’s speech AI technology, we turned to our community and engaged in grassroots AI efforts with and by the stuttering community. Together, we develop datasets, metrics, tools, and techniques to measure, understand, and reduce fluency biases in existing ASR models.

Our working process with the grassroots stuttering community also showcases an alternative model for AI development, a model that builds and amplifies capacities within marginalized communities to challenge the existing concentration of AI power, allowing us to envision a technological future that server broader public interests rather than the profits and dominance of a few.

Publications

ACM TechBrief: Automated Speech Recognition. Allison Koenecke, Niranjan Sivakumar, Jingjin Li, Shaomei Wu. ACM Technology Policy Council. Winter 2025, Issue 15. https://doi.org/10.1145/3779316
Govern With, Not For: Understanding the Stuttering Community’s Preferences and Goals for Speech AI Data Governance in the US and China. Jingjin Li, Peiyao Liu, Rebecca Lietz, Ningjing Tang, Norman Makoto Su, and Shaomei Wu. In Proc. of AAAI/ACM Conference on Artificial Intelligence, Ethics, and Society (AIES ’25). 2025. [preprint] 🥇 Best Paper Award 🥇
J-j-j-just Stutter: Benchmarking Whisper’s Performance Disparities on Different Stuttering Patterns. Charan Sridhar, Shaomei Wu. Proc. InterSpeech Conference 2025, 3753-3757, doi: 10.21437/Interspeech.2025-2700 [preprint, presentation]
Speech AI for All: Promoting Accessibility, Fairness, Inclusivity, and Equity. Shaomei Wu, Kimi V. Wenzel, Jingjin Li, Qisheng Li, Alisha Pradhan, Raja Kushalnagar, Colin Lea, Allison Koenecke, Christian Vogler, Mark Hasegawa-Johnson, Norman Makoto Su, and Nan Bernstein Ratner. CHI Workshop Proposal. In Extended Abstracts of the CHI Conference on Human Factors in Computing Systems (CHI EA ’25). 2025. [pdf] [workshop website]
AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection. Rong Gong, Hongfei Xue, Lezhi Wang, Xin Xu, Qisheng Li, Lei Xie, Hui Bu, Shaomei Wu, Jiaming Zhou, Yong Qin, Binbin Zhang, Jun Du, Jia Bin, Ming Li. In Proceedings of the InterSpeech Conference. 2024. [preprint]
Towards Fair and Inclusive Speech Recognition for Stuttering: Community-led Chinese Stuttered Speech Dataset Creation and Benchmarking. Qisheng Li, Shaomei Wu. In Extended Abstracts of the CHI Conference on Human Factors in Computing Systems (CHI EA ’24). [pdf]

Datasets, tools, and models

All the following technical assets were produced and maintained by people who stutter.

Mandarin Stuttered Speech dataset @ HuggingFace
English stuttered speech benchmarking python library @ GitHub
Fine tuned Whisper v2 model for stuttered speech @HuggingFace

Grants

We thank the following funders for support our work on fair speech AI for people who stutter:

NSF award #2427710: ReDDDoT Phase 1: Planning Grant: Destigmatizing Disfluencies in Speech AI with Grassroots Stuttering Communities. With Professor Norman Su, University of California, Santa Cruz.
Patrick J. McGovern Foundation

Press

Company seeks stammering samples to train AI. STAMMA (the British Association of People who Stammer). Jul 7, 2025.
Workshop aims to create speech AI for all. University of California, Santa Cruz. Mar 17, 2025
Disability inclusion in AI can benefit everyone. It’s time we prioritize it.. Patrick J. McGovern Foundation. Mar 19, 2025.

Fair Speech AI for Disfluent speech

Publications

Datasets, tools, and models

Grants

Press

About Us

Find us here