Automated Speech Production Assessment of Hard of Hearing Children

被引:4
|
作者
Czap, Laszlo [1 ]
机构
[1] Univ Miskolc, Inst Automat & Infocommun, H-3515 Miskolc, Hungary
关键词
Acoustic signal processing; automatic speech quality assessment; hard of hearing speech; pronunciation analysis; VOICE; FEATURES;
D O I
10.1109/JSTSP.2019.2949389
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A new method for the automated speech production assessment (ASPA) of hearing impaired children is presented in this paper, providing feedback about the pronunciation quality of words and sentences uttered during unsupervised practice in the course of speech development. A database of the sounds produced by hearing impaired subjects was set up and assessed with a subjective test. The Mean Opinion Score (MOS) obtained in this way constituted the reference for automated assessment. The essence of the ASPA method is the joint assessment of sound and rhythm errors. After several methods were tested, the output activity of the neural networks trained to classify speech sounds was used to assess sound correctness. Dynamic time warping, adapted to the speech of the hearing impaired, was used to determine rhythm errors. ASPA provides input data for an expert system for the selection of the next word to be practiced. The novelty of the procedure is that it provides a method for the assessment of non-typifiable pronunciation errors. Results were compared with individual expert assessment and subjective tests. Automated assessment surpassed the overwhelming majority of subjective assessors and approximated the correctness of individual expert assessment. Our ASPA method is implemented in our "Speech Assistant" application, which also provides a language-independent sound visualization module and is successfully applied to assist the hearing impaired.
引用
收藏
页码:380 / 389
页数:10
相关论文
共 50 条
  • [31] Neural competition between concurrent speech production and other speech perception
    Dietziker, Joris
    Staib, Matthias
    Fruhholz, Sascha
    NEUROIMAGE, 2021, 228
  • [32] EFFECT OF DELAYED AUDITORY FEEDBACK, SPEECH RATE, AND SEX ON SPEECH PRODUCTION
    Stuart, Andrew
    Kalinowski, Joseph
    PERCEPTUAL AND MOTOR SKILLS, 2015, 120 (03) : 747 - 765
  • [33] Speech Prosody Perception in Cochlear Implant Users With and Without Residual Hearing
    Marx, Mathieu
    James, Christopher
    Foxton, Jessica
    Capber, Amandine
    Fraysse, Bernard
    Barone, Pascal
    Deguine, Olivier
    EAR AND HEARING, 2015, 36 (02) : 239 - 248
  • [34] Samromur Children: An Icelandic Speech Corpus
    Mena, Carlos
    Mollberg, David Erik
    Borsky, Michal
    Gudnason, Jon
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 995 - 1002
  • [35] Digital assessment of speech in Huntington disease
    Nunes, Adonay S.
    Pawlik, Meghan
    Mishra, Ram Kinker
    Waddell, Emma
    Coffey, Madeleine
    Tarolli, Christopher G.
    Schneider, Ruth B.
    Dorsey, E. Ray
    Vaziri, Ashkan
    Adams, Jamie L.
    FRONTIERS IN NEUROLOGY, 2024, 15
  • [36] Effects of adenotonsillectomy on speech spectrum in children
    Mora, Renzo
    Crippa, Barbara
    Dellepiane, Massimo
    Jankowska, Barbara
    INTERNATIONAL JOURNAL OF PEDIATRIC OTORHINOLARYNGOLOGY, 2007, 71 (08) : 1299 - 1304
  • [37] Nasal vowel production and grammatical processing in French-speaking children with cochlear implants and normal-hearing peers
    Fagniart, Sophie
    Charlier, Brigitte
    Delvaux, Veronique
    Harmegnies, Bernard
    Huberlant, Anne
    Piccaluga, Myriam
    Huet, Kathy
    INTERSPEECH 2023, 2023, : 4249 - 4253
  • [38] Acoustic Features Characterization of Autism Speech for Automated Detection and Classification
    Mohanta, Abhijit
    Mukherjee, Prerana
    Mirtal, Vinay Kumar
    2020 TWENTY SIXTH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC 2020), 2020,
  • [39] HEARING FACES: TARGET SPEAKER TEXT-TO-SPEECH SYNTHESIS FROM A FACE
    Pluester, Bjoern
    Weber, Cornelius
    Qu, Leyuan
    Wermter, Stefan
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 757 - 764
  • [40] Hearing what is being said: the distributed neural substrate for early speech interpretation
    Clarke, Alex
    Tyler, Lorraine K.
    Marslen-Wilson, William
    LANGUAGE COGNITION AND NEUROSCIENCE, 2024, 39 (09) : 1097 - 1116