Listening to speech in a background of other talkers: Effects of talker number and noise vocoding

被引:137
作者
Rosen, Stuart [1 ]
Souza, Pamela [2 ]
Ekelund, Caroline [1 ]
Majeed, Arooj A. [3 ]
机构
[1] UCL Speech Hearing & Phonet Sci, London WC1N 1PF, England
[2] Northwestern Univ, Dept Commun Sci & Disorders, Knowles Hearing Ctr, Evanston, IL 60208 USA
[3] UCL Ear Inst, London WC1X 8EE, England
基金
英国医学研究理事会; 美国国家卫生研究院;
关键词
INFORMATIONAL MASKING; COCHLEAR-IMPLANT; ENERGETIC MASKING; NORMAL-HEARING; RECOGNITION; PERCEPTION; INTELLIGIBILITY; RECEPTION; IDENTIFICATION; INTONATION;
D O I
10.1121/1.4794379
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Some of the most common interfering background sounds a listener experiences are the sounds of other talkers. In Experiment 1, recognition for natural Institute of Electrical and Electronics Engineers (IEEE) sentences was measured in normal-hearing adults at two fixed signal-to-noise ratios (SNRs) in 16 backgrounds with the same long-term spectrum: unprocessed speech babble (1, 2, 4, 8, and 16 talkers), noise-vocoded versions of the babbles (12 channels), noise modulated with the wide-band envelope of the speech babbles, and unmodulated noise. All talkers were adult males. For a given number of talkers, natural speech was always the most effective masker. The greatest changes in performance occurred as the number of talkers in the maskers increased from 1 to 2 or 4, with small changes thereafter. In Experiment 2, the same targets and maskers (1, 2, and 16 talkers) were used to measure speech reception thresholds (SRTs) adaptively. Periodicity in the target was also manipulated by noise-vocoding, which led to considerably higher SRTs. The greatest masking effect always occurred for the masker type most similar to the target, while the effects of the number of talkers were generally small. Implications are drawn with reference to glimpsing, informational vs energetic masking, overall SNR, and aspects of periodicity. (C) 2013 Acoustical Society of America.
引用
收藏
页码:2431 / 2443
页数:13
相关论文
共 51 条
[1]   Informational masking in young and elderly listeners for speech masked by simultaneous speech and noise [J].
Agus, Trevor R. ;
Akeroyd, Michael A. ;
Gatehouse, Stuart ;
Warden, David .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 126 (04) :1926-1940
[2]  
[Anonymous], 1969, IEEE T ACOUST SPEECH, VAU17, P225
[3]   The effect of spatial separation on informational and energetic masking of speech [J].
Arbogast, TL ;
Mason, CR ;
Kidd, G .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 112 (05) :2086-2098
[4]   Effects of Age on Concurrent Vowel Perception in Acoustic and Simulated Electroacoustic Hearing [J].
Arehart, Kathryn H. ;
Souza, Pamela E. ;
Muralimanohar, Ramesh Kumar ;
Miller, Christi Wise .
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2011, 54 (01) :190-210
[5]   Effects of spectral smearing and temporal fine-structure distortion on the fluctuating-masker benefit for speech at a fixed signal-to-noise ratio [J].
Bernstein, Joshua G. W. ;
Brungart, Douglas S. .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 130 (01) :473-488
[6]   Auditory and auditory-visual intelligibility of speech in fluctuating maskers for normal-hearing and hearing-impaired listeners [J].
Bernstein, Joshua G. W. ;
Grant, Ken W. .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 125 (05) :3358-3372
[7]   AN ANALYSIS OF TRANSFORMATIONS [J].
BOX, GEP ;
COX, DR .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1964, 26 (02) :211-252
[8]  
Bregman A. S., 1990, Auditory Scene Analysis: The Perceptual Organization of Sound, DOI [DOI 10.7551/MITPRESS/1486.001.0001, DOI 10.1121/1.408434]
[9]   INTONATION AND THE PERCEPTUAL SEPARATION OF SIMULTANEOUS VOICES [J].
BROKX, JPL ;
NOOTEBOOM, SG .
JOURNAL OF PHONETICS, 1982, 10 (01) :23-36
[10]   Informational and energetic masking effects in the perception of two simultaneous talkers [J].
Brungart, DS .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 109 (03) :1101-1109