Comparing Humans and Automatic Speech Recognition Systems in Recognizing Dysarthric Speech

被引:0
|
作者
Mengistu, Kinfe Tadesse [1 ]
Rudzicz, Frank [1 ]
机构
[1] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada
来源
ADVANCES IN ARTIFICIAL INTELLIGENCE | 2011年 / 6657卷
关键词
speech recognition; dysarthric speech; intelligibility; INTELLIGIBILITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech is a complex process that requires control and coordination of articulation, breathing, voicing, and prosody. Dysarthria is a manifestation of an inability to control and coordinate one or more of these aspects, which results in poorly articulated and hardly intelligible speech. Hence individuals with dysarthria are rarely understood by human listeners. In this paper, we compare and evaluate how well dysarthric speech can be recognized by an automatic speech recognition system (ASR) and naive adult human listeners. The results show that despite the encouraging performance of ASR systems, and contrary to the claims in other studies, on average human listeners perform better in recognizing single-word dysarthric speech. In particular, the mean word recognition accuracy of speaker-adapted monophone ASR systems on stimuli produced by six dysarthric speakers is 68.39% while the mean percentage correct response of 14 naive human listeners on the same speech is 79.78% as evaluated using single-word multiple-choice intelligibility test.
引用
收藏
页码:291 / 300
页数:10
相关论文
共 50 条
  • [1] Evaluation of an Automatic Speech Recognition Platform for Dysarthric Speech
    Calvo, Irene
    Tropea, Peppino
    Vigano, Mauro
    Scialla, Maria
    Cavalcante, Agnieszka B.
    Grajzer, Monika
    Gilardone, Marco
    Corbo, Massimo
    FOLIA PHONIATRICA ET LOGOPAEDICA, 2021, 73 (05) : 432 - 441
  • [2] Speech Technology for Automatic Recognition and Assessment of Dysarthric Speech: An Overview
    Bhat, Chitralekha
    Strik, Helmer
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2025, 68 (02): : 547 - 577
  • [3] SPEECH RECOGNITION-BASED FEATURE EXTRACTION FOR ENHANCED AUTOMATIC SEVERITY CLASSIFICATION IN DYSARTHRIC SPEECH
    Choi, Yerin
    Lee, Jeehyun
    Ko, Myoung-Wan
    2024 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2024, : 953 - 960
  • [4] The relationship between perceptual disturbances in dysarthric speech and automatic speech recognition performance
    Tu, Ming
    Wisler, Alan
    Berisha, Visar
    Liss, Julie M.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 140 (05) : EL416 - EL422
  • [5] Automatic Prediction of Speech Evaluation Metrics for Dysarthric Speech
    Laaridh, Imed
    Ben Kheder, Waad
    Fredouille, Corinne
    Meunier, Christine
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1834 - 1838
  • [6] Debiased Automatic Speech Recognition for Dysarthric Speech via Sample Reweighting with Sample Affinity Test
    Kim, Eungbeom
    Chae, Yunkee
    Sim, Jaeheon
    Lee, Kyogu
    INTERSPEECH 2023, 2023, : 1508 - 1512
  • [7] Use of Speech Impairment Severity for Dysarthric Speech Recognition
    Geng, Mengzhe
    Jin, Zengrui
    Wang, Tianzi
    Hu, Shujie
    Deng, Jiajun
    Cui, Mingyu
    Li, Guinan
    Yu, Jianwei
    Xie, Xurong
    Liu, Xunying
    INTERSPEECH 2023, 2023, : 2328 - 2332
  • [8] Measuring the intelligibility of dysarthric speech through automatic speech recognition in a pluricentric language
    Xue, Wei
    Cucchiarini, Catia
    van Hout, Roeland
    Strik, Helmer
    SPEECH COMMUNICATION, 2023, 148 : 23 - 30
  • [9] Multi-Stage DNN Training for Automatic Recognition of Dysarthric Speech
    Yilmaz, Emre
    Ganzeboom, Mario
    Cucchiarini, Catia
    Strik, Helmer
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2685 - 2689
  • [10] Optimization of dysarthric speech recognition
    Chen, FX
    Kostov, A
    PROCEEDINGS OF THE 19TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOL 19, PTS 1-6: MAGNIFICENT MILESTONES AND EMERGING OPPORTUNITIES IN MEDICAL ENGINEERING, 1997, 19 : 1436 - 1439