Effect of simulated hearing loss on automatic speech recognition for an android robot-patient

被引：0

作者：

Roehl, Jan Hendrik ^{[1
]}

Guenther, Ulf ^{[2
]}

Hein, Andreas ^{[1
,3
]}

Cauchi, Benjamin ^{[3
,4
]}

机构：

[1] Carl von Ossietzky Univ Oldenburg, Hlth Serv Res, Assistance Syst & Med Device Technol, Oldenburg, Germany

[2] Klinikum Oldenburg AoR, Oldenburg, Germany

[3] Inst Informat Technol, R&D Div Hlth, OFFIS e V, Oldenburg, Germany

[4] Bremerhaven Univ Appl Sci, Management & Informat Syst, Bremerhaven, Germany

来源：

FRONTIERS IN ROBOTICS AND AI | 2024年 / 11卷

关键词：

hearing loss simulation; automatic speech recognition; android robot-patient; simulated patient; patient simulation; INTENSIVE-CARE UNIT; LOUDNESS RECRUITMENT; THRESHOLD ELEVATION; DELIRIUM; INTELLIGIBILITY; VALIDATION; PREDICTOR; SENTENCES; IMPACT;

D O I：

10.3389/frobt.2024.1391818

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

The importance of simulating patient behavior for medical assessment training has grown in recent decades due to the increasing variety of simulation tools, including standardized/simulated patients, humanoid and android robot-patients. Yet, there is still a need for improvement of current android robot-patients to accurately simulate patient behavior, among which taking into account their hearing loss is of particular importance. This paper is the first to consider hearing loss simulation in an android robot-patient and its results provide valuable insights for future developments. For this purpose, an open-source dataset of audio data and audiograms from human listeners was used to simulate the effect of hearing loss on an automatic speech recognition (ASR) system. The performance of the system was evaluated in terms of both word error rate (WER) and word information preserved (WIP). Comparing different ASR models commonly used in robotics, it appears that the model size alone is insufficient to predict ASR performance in presence of simulated hearing loss. However, though absolute values of WER and WIP do not predict the intelligibility for human listeners, they do highly correlate with it and thus could be used, for example, to compare the performance of hearing aid algorithms.

引用

页数：11

共 50 条

[21] Effect of Face Masks on Automatic Speech Recognition Accuracy for Mandarin [J].

Li, Xiaoya ;

Ni, Ke ;

Huang, Yu .

APPLIED SCIENCES-BASEL, 2024, 14 (08)

[22] Effect of Speaker Age on Speech Recognition and Perceived Listening Effort in Older Adults With Hearing Loss [J].

McAuliffe, Megan J. ;

Wilding, Phillipa J. ;

Rickard, Natalie A. ;

O'Beirne, Greg A. .

JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2012, 55 (03) :838-847

[23] Improving hearing-aid gains based on automatic speech recognition [J].

Fontan, Lionel ;

Le Coz, Maxime ;

Azzopardi, Charlotte ;

Stone, Michael A. ;

Fuellgrabe, Christian .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2020, 148 (03) :EL227-EL233

[24] Automatic Speech Recognition Services: Deaf and Hard-of-Hearing Usability [J].

Glasser, Abraham .

CHI EA '19 EXTENDED ABSTRACTS: EXTENDED ABSTRACTS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,

[25] Effects of Various Extents of High-Frequency Hearing Loss on Speech Recognition and Gap Detection at Low Frequencies in Patients with Sensorineural Hearing Loss [J].

Li, Bei ;

Guo, Yang ;

Yang, Guang ;

Feng, Yanmei ;

Yin, Shankai .

NEURAL PLASTICITY, 2017, 2017

[26] Contribution of Consonant Landmarks to Speech Recognition in Simulated Acoustic-Electric Hearing [J].

Chen, Fei ;

Loizou, Philipos C. .

EAR AND HEARING, 2010, 31 (02) :259-267

[27] Automatic speech recognition by a Kinect sensor for a robot under ego noises [J].

Wang J. ;

Gao Y. ;

Zhang J. ;

Wei J. ;

Dang J. .

Qinghua Daxue Xuebao/Journal of Tsinghua University, 2017, 57 (09) :921-925

[28] Do Older Listeners With Hearing Loss Benefit From Dynamic Pitch for Speech Recognition in Noise? [J].

Shen, Jing ;

Souza, Pamela E. .

AMERICAN JOURNAL OF AUDIOLOGY, 2017, 26 (03) :462-466

[29] Automatic Speech Recognition for Live TV Subtitling for Hearing-Impaired People [J].

Obach, Michael ;

Lehr, Maider ;

Arruti, Andoni .

CHALLENGES FOR ASSISTIVE TECHNOLOGY, 2007, 20 :286-291

[30] Effect of Language Resources on Automatic Speech Recognition for Amharic [J].

Tachbelie, Martha Yifiru ;

Abate, Solomon Teferra .

PROCEEDINGS OF THE 2015 12TH IEEE AFRICON INTERNATIONAL CONFERENCE - GREEN INNOVATION FOR AFRICAN RENAISSANCE (AFRICON), 2015,

← 1 2 3 4 5 →