Self-conducted speech audiometry using automatic speech recognition: Simulation results for listeners with hearing loss

被引：6

作者：

Ooster, Jasper ^{[1
,3
]}

Tuschen, Laura ^{[2
]}

Meyer, Bernd T. ^{[1
,3
]}

机构：

[1] Carl von Ossietzky Univ Oldenburg, Commun Acoust, D-26129 Oldenburg, Germany

[2] Fraunhofer Inst Digital Media Technol IDMT, Oldenburg Branch Hearing Speech & Audio Technol HS, D-26129 Oldenburg, Germany

[3] Cluster Excellence Hearing4all, Oldenburg, Germany

来源：

COMPUTER SPEECH AND LANGUAGE | 2023年 / 78卷

关键词：

Speech audiometry; Automatic speech recognition; Matrix sentence test; Unsupervised measurement; NOISE; INTELLIGIBILITY; VALIDATION; SENTENCES; TESTS;

D O I：

10.1016/j.csl.2022.101447

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Speech-in-noise tests are an important tool for assessing hearing impairment, the successful fitting of hearing aids, as well as for research in psychoacoustics. An important drawback of many speech-based tests is the requirement of an expert to be present during the measurement, in order to assess the listener's performance. This drawback may be largely overcome through the use of automatic speech recognition (ASR), which utilizes automatic response logging. However, such an unsupervised system may reduce the accuracy due to the introduction of potential errors. In this study, two different ASR systems are compared for automated testing: A system with a feed-forward deep neural network (DNN) from a previous study (Ooster et al., 2018), as well as a state-of-the-art system utilizing a time-delay neural network (TDNN). The dynamic measurement procedure of the speech intelligibility test was simulated considering the subjects' hearing loss and selecting from real recordings of test participants. The ASR systems' performance is investigated based on responses of 73 listeners, ranging from normal -hearing to severely hearing-impaired as well as read speech from cochlear implant listeners. The feed-forward DNN produced accurate testing results for NH and unaided HI listeners but a decreased measurement accuracy was found in the simulation of the adaptive measurement procedure when considering aided severely HI listeners, recorded in noisy environments with a loudspeaker setup. The TDNN system produces error rates of 0.6% and 3.0% for deletion and insertion errors, respectively. We estimate that the SRT deviation with this system is below 1.38 dB for 95% of the users. This result indicates that a robust unsupervised conduction of the matrix sentence test is possible with a similar accuracy as with a human supervisor even when considering noisy conditions and altered or disordered speech from elderly severely HI listeners and listeners with a CI.

引用

页数：14

共 50 条

[1] Predicting Speech Perception in Older Listeners with Sensorineural Hearing Loss Using Automatic Speech Recognition
Fontan, Lionel
Cretin-Maitenaz, Tom
Fullgrabe, Christian
TRENDS IN HEARING, 2020, 24 : 2331216520914769
[2] Automatic Speech Recognition Predicts Speech Intelligibility and Comprehension for Listeners With Simulated Age-Related Hearing Loss
Fontan, Lionel
Ferrane, Isabelle
Farinas, Jerome
Pinquier, Julien
Tardieu, Julien
Magnen, Cynthia
Gaillard, Pascal
Aumont, Xavier
Fullgrabee, Christian
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2017, 60 (09): : 2394 - 2405
[3] Localization in speech mixtures by listeners with hearing loss
Best, Virginia
Carlile, Simon
Kopco, Norbert
van Schaik, Andre
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 129 (05) : E210 - E215
[4] Effect of simulated hearing loss on automatic speech recognition for an android robot-patient
Roehl, Jan Hendrik
Guenther, Ulf
Hein, Andreas
Cauchi, Benjamin
FRONTIERS IN ROBOTICS AND AI, 2024, 11
[5] Effect of Audibility and Suprathreshold Deficits on Speech Recognition for Listeners With Unilateral Hearing Loss
Bost, Tim J. M.
Versfeld, Niek J.
Goverts, S. Theo
EAR AND HEARING, 2019, 40 (04) : 1025 - 1034
[6] Speech intelligibility in different types of audiograms and speech audiometry by using the simulated hearing loss on the speech material with normal hearing people
Suskovic, Davor
Fajt, Sinisa
Olujic, Vladimir
AUTOMATIKA, 2021, 62 (01) : 118 - 126
[7] Automated Speech Audiometry: Can It Work Using Open-Source Pre-Trained Kaldi-NL Automatic Speech Recognition?
Araiza-Illan, Gloria
Meyer, Luke
Truong, Khiet P.
Baskent, Deniz
TRENDS IN HEARING, 2024, 28
[8] The irrelevant speech effect in listeners with normal hearing and self-reported hearing loss
Schlittenlacher, Josef
Brogan, Megan
ACOUSTICAL SCIENCE AND TECHNOLOGY, 2024, 45 (05) : 303 - 306
[9] Do Older Listeners With Hearing Loss Benefit From Dynamic Pitch for Speech Recognition in Noise?
Shen, Jing
Souza, Pamela E.
AMERICAN JOURNAL OF AUDIOLOGY, 2017, 26 (03) : 462 - 466
[10] Spoken Word Recognition Errors in Speech Audiometry: A Measure of Hearing Performance?
Coene, Martine
van der Lee, Anneke
Govaerts, Paul J.
BIOMED RESEARCH INTERNATIONAL, 2015, 2015

← 1 2 3 4 5 →