Automated Speech Production Assessment of Hard of Hearing Children

被引：4

作者：

Czap, Laszlo ^{[1
]}

机构：

[1] Univ Miskolc, Inst Automat & Infocommun, H-3515 Miskolc, Hungary

来源：

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING | 2020年 / 14卷 / 02期

关键词：

Acoustic signal processing; automatic speech quality assessment; hard of hearing speech; pronunciation analysis; VOICE; FEATURES;

D O I：

10.1109/JSTSP.2019.2949389

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

A new method for the automated speech production assessment (ASPA) of hearing impaired children is presented in this paper, providing feedback about the pronunciation quality of words and sentences uttered during unsupervised practice in the course of speech development. A database of the sounds produced by hearing impaired subjects was set up and assessed with a subjective test. The Mean Opinion Score (MOS) obtained in this way constituted the reference for automated assessment. The essence of the ASPA method is the joint assessment of sound and rhythm errors. After several methods were tested, the output activity of the neural networks trained to classify speech sounds was used to assess sound correctness. Dynamic time warping, adapted to the speech of the hearing impaired, was used to determine rhythm errors. ASPA provides input data for an expert system for the selection of the next word to be practiced. The novelty of the procedure is that it provides a method for the assessment of non-typifiable pronunciation errors. Results were compared with individual expert assessment and subjective tests. Automated assessment surpassed the overwhelming majority of subjective assessors and approximated the correctness of individual expert assessment. Our ASPA method is implemented in our "Speech Assistant" application, which also provides a language-independent sound visualization module and is successfully applied to assist the hearing impaired.

引用

页码：380 / 389

页数：10

共 50 条

[31] Neural competition between concurrent speech production and other speech perception
Dietziker, Joris
Staib, Matthias
Fruhholz, Sascha
NEUROIMAGE, 2021, 228
[32] EFFECT OF DELAYED AUDITORY FEEDBACK, SPEECH RATE, AND SEX ON SPEECH PRODUCTION
Stuart, Andrew
Kalinowski, Joseph
PERCEPTUAL AND MOTOR SKILLS, 2015, 120 (03) : 747 - 765
[33] Speech Prosody Perception in Cochlear Implant Users With and Without Residual Hearing
Marx, Mathieu
James, Christopher
Foxton, Jessica
Capber, Amandine
Fraysse, Bernard
Barone, Pascal
Deguine, Olivier
EAR AND HEARING, 2015, 36 (02) : 239 - 248
[34] Samromur Children: An Icelandic Speech Corpus
Mena, Carlos
Mollberg, David Erik
Borsky, Michal
Gudnason, Jon
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 995 - 1002
[35] Digital assessment of speech in Huntington disease
Nunes, Adonay S.
Pawlik, Meghan
Mishra, Ram Kinker
Waddell, Emma
Coffey, Madeleine
Tarolli, Christopher G.
Schneider, Ruth B.
Dorsey, E. Ray
Vaziri, Ashkan
Adams, Jamie L.
FRONTIERS IN NEUROLOGY, 2024, 15
[36] Effects of adenotonsillectomy on speech spectrum in children
Mora, Renzo
Crippa, Barbara
Dellepiane, Massimo
Jankowska, Barbara
INTERNATIONAL JOURNAL OF PEDIATRIC OTORHINOLARYNGOLOGY, 2007, 71 (08) : 1299 - 1304
[37] Nasal vowel production and grammatical processing in French-speaking children with cochlear implants and normal-hearing peers
Fagniart, Sophie
Charlier, Brigitte
Delvaux, Veronique
Harmegnies, Bernard
Huberlant, Anne
Piccaluga, Myriam
Huet, Kathy
INTERSPEECH 2023, 2023, : 4249 - 4253
[38] Acoustic Features Characterization of Autism Speech for Automated Detection and Classification
Mohanta, Abhijit
Mukherjee, Prerana
Mirtal, Vinay Kumar
2020 TWENTY SIXTH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC 2020), 2020,
[39] HEARING FACES: TARGET SPEAKER TEXT-TO-SPEECH SYNTHESIS FROM A FACE
Pluester, Bjoern
Weber, Cornelius
Qu, Leyuan
Wermter, Stefan
2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 757 - 764
[40] Hearing what is being said: the distributed neural substrate for early speech interpretation
Clarke, Alex
Tyler, Lorraine K.
Marslen-Wilson, William
LANGUAGE COGNITION AND NEUROSCIENCE, 2024, 39 (09) : 1097 - 1116

← 1 2 3 4 5 →