An algorithm to improve speech recognition in noise for hearing-impaired listeners

被引:177
作者
Healy, Eric W. [1 ,2 ]
Yoho, Sarah E. [1 ,2 ]
Wang, Yuxuan [2 ,3 ]
Wang, DeLiang [2 ,3 ]
机构
[1] Ohio State Univ, Dept Speech & Hearing Sci, Columbus, OH 43210 USA
[2] Ohio State Univ, Ctr Cognit & Brain Sci, Columbus, OH 43210 USA
[3] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
关键词
GAP DETECTION; BACKGROUND-NOISE; INTELLIGIBILITY; MASKING; REDUCTION; INTEGRATION; PERCEPTION; THRESHOLD;
D O I
10.1121/1.4820893
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Despite considerable effort, monaural (single-microphone) algorithms capable of increasing the intelligibility of speech in noise have remained elusive. Successful development of such an algorithm is especially important for hearing-impaired (HI) listeners, given their particular difficulty in noisy backgrounds. In the current study, an algorithm based on binary masking was developed to separate speech from noise. Unlike the ideal binary mask, which requires prior knowledge of the premixed signals, the masks used to segregate speech from noise in the current study were estimated by training the algorithm on speech not used during testing. Sentences were mixed with speech-shaped noise and with babble at various signal-to-noise ratios (SNRs). Testing using normal-hearing and HI listeners indicated that intelligibility increased following processing in all conditions. These increases were larger for HI listeners, for the modulated background, and for the least-favorable SNRs. They were also often substantial, allowing several HI listeners to improve intelligibility from scores near zero to values above 70%. (C) 2013 Acoustical Society of America.
引用
收藏
页码:3029 / 3038
页数:10
相关论文
共 51 条
[1]  
[Anonymous], 1969, IEEE T ACOUST SPEECH, VAU17, P225
[2]  
[Anonymous], 2004, ANSI S3.21-2004 R2009
[3]  
ANSI, 1987, S339R2012 ANSI AC SO
[4]   Determination of the potential benefit of time-frequency gain manipulation [J].
Anzalone, Michael C. ;
Calandruccio, Lauren ;
Doherty, Karen A. ;
Carney, Laurel H. .
EAR AND HEARING, 2006, 27 (05) :480-492
[5]   Relative contribution of off- and on-frequency spectral components of background noise to the masking of unprocessed and vocoded speech [J].
Apoux, Frederic ;
Healy, Eric W. .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 128 (04) :2075-2084
[6]   On the number of auditory filter outputs needed to understand speech: Further evidence for auditory channel independence [J].
Apoux, Frederic ;
Healy, Eric W. .
HEARING RESEARCH, 2009, 255 (1-2) :99-108
[7]  
Bacon S. P., 2004, COMPRESSION COCHLEA, P136
[8]   MODULATION DETECTION IN SUBJECTS WITH RELATIVELY FLAT HEARING LOSSES [J].
BACON, SP ;
GLEITMAN, RM .
JOURNAL OF SPEECH AND HEARING RESEARCH, 1992, 35 (03) :642-653
[9]   The effects of hearing loss and noise masking on the masking release for speech in temporally complex backgrounds [J].
Bacon, SP ;
Opie, JM ;
Montoya, DY .
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 1998, 41 (03) :549-563
[10]   EFFECTS OF SPECTRAL SMEARING ON THE INTELLIGIBILITY OF SENTENCES IN NOISE [J].
BAER, T ;
MOORE, BCJ .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1993, 94 (03) :1229-1241