Limitations in speech recognition for young adults with down syndrome

被引:1
作者
Cibrian, Franceli L. [1 ]
Chen, Yingchen 'Yuki' [1 ]
Anderson, Kayla [1 ]
Abrahamsson, Cecilia Marie [1 ]
Motti, Vivian Genaro [2 ]
机构
[1] Chapman Univ, Fowler Sch Engn, One Univ Dr, Orange, CA 92866 USA
[2] George Mason Univ, Dept Informat Sci & Technol, 4400 Univ Dr, Fairfax, VA 22030 USA
关键词
Speech recognition; Natural language; Smart home devices; Neurodiverse users; Down syndrome; CHILDREN; YOUTUBE; AUTISM; VIDEO;
D O I
10.1007/s10209-025-01197-4
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Speech recognition has the potential to make technology more accessible to users. However, the accuracy of speech recognition remains limited for users with disabilities, including those with Down Syndrome, and the types and frequencies of recognition errors are poorly understood. This paper characterizes these problems, focusing on errors occurring when recognizing Down Syndrome speech. We analyze the transcripts from six speech recognition algorithms (Google, IBM, Otter.ai, Microsoft, AssemblyAI, OpenAI) using the audio content of 15 individuals with Down Syndrome (331 dialogues; 3428 words). Our analysis shows: (1) significant difference in speech recognition accuracy for people with Down Syndrome compared to neurotypical users; (2) the best algorithm for recognizing Down Syndrome speech is OpenAI (Word Accuracy = 67%; F1-score = 0.944); and (3) there is a prevalence of deletion errors followed by substitutions and insertions. These findings have implications for enhancing speech recognition for the next-generation voice assistants to meet the needs of users with Down Syndrome.
引用
收藏
页数:19
相关论文
共 93 条
[1]   "Siri Talks at You": An Empirical Investigation of Voice-Activated Personal Assistant (VAPA) Usage by Individuals Who Are Blind [J].
Abdolrahmani, Ali ;
Kuber, Ravi ;
Branham, Stacy M. .
ASSETS'18: PROCEEDINGS OF THE 20TH INTERNATIONAL ACM SIGACCESS CONFERENCE ON COMPUTERS AND ACCESSIBILITY, 2018, :249-258
[2]  
Armstrong Thomas., 2010, Neurodiversity: Discovering the extraordinary gifts of autism, ADHD, dyslexia, and other brain differences
[3]  
Ashok V., 2015, P 12 WEB ALL C
[4]  
Azenkot S., 2013, P 15 INT ACM SIGACCE, p11:1, DOI [10.1145/2513383.2513440, DOI 10.1145/2513383.2513440]
[5]   Use of Voice Activated Interfaces by People with Intellectual Disability [J].
Balasuriya, Saminda Sundeepa ;
Sitbon, Laurianne ;
Bayor, Andrew A. ;
Hoogstrate, Maria ;
Brereton, Margot .
PROCEEDINGS OF THE 30TH AUSTRALIAN COMPUTER-HUMAN INTERACTION CONFERENCE (OZCHI 2018), 2018, :102-112
[6]  
Bender Emily M., 2018, Transactions of the Association for Computational Linguistics, V6, P587, DOI [10.1162/tacl_a_00041, DOI 10.1162/TACL_A_00041]
[7]   Communication Breakdowns Between Families and Alexa [J].
Beneteau, Erin ;
Richards, Olivia K. ;
Zhang, Mingrui ;
Kientz, Julie A. ;
Yip, Jason ;
Hiniker, Alexis .
CHI 2019: PROCEEDINGS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
[8]   Understanding the long-term use of smart speaker assistants [J].
Bentley, Frank ;
Silverman, Max ;
Wirasinghe, Rushani ;
White, Brooke ;
Lottridge, Danielle .
Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 2018, 2 (03)
[9]   Diversity for Design: A Framework for Involving Neurodiverse Children in the Technology Design Process [J].
Benton, Laura ;
Vasalou, Asimina ;
Khaled, Rilla ;
Johnson, Hilary ;
Gooch, Daniel .
32ND ANNUAL ACM CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI 2014), 2014, :3747-3756
[10]  
Bhuiyan H, 2017, IEEE I C SIGNAL IMAG, P474, DOI 10.1109/ICSIPA.2017.8120658