Determining the energetic and informational components of speech-on-speech masking

被引：90

作者：

Kidd, Gerald, Jr. ^{[1
,2
]}

Mason, Christine R. ^{[1
,2
]}

Swaminathan, Jayaganesh ^{[1
,2
]}

Roverud, Elin ^{[1
,2
]}

Clayton, Kameron K. ^{[1
,2
,3
]}

Best, Virginia ^{[1
,2
]}

机构：

[1] Boston Univ, Dept Speech Language & Hearing Sci, 635 Commonwealth Ave, Boston, MA 02215 USA

[2] Boston Univ, Hearing Res Ctr, 635 Commonwealth Ave, Boston, MA 02215 USA

[3] Harvard Med Sch, Program Speech & Hearing Biosci & Technol, Div Med Sci, 260 Longwood Ave, Boston, MA 02115 USA

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2016年 / 140卷 / 01期

基金：

美国国家卫生研究院;

关键词：

TIME-FREQUENCY SEGREGATION; COCKTAIL-PARTY PROBLEM; SPATIAL RELEASE; MULTICOMPONENT MASKERS; SIMULTANEOUS TALKERS; RECEPTION THRESHOLD; INTERFERING-SPEECH; FLUCTUATING NOISE; COMPETING SPEECH; REVERSED SPEECH;

D O I：

10.1121/1.4954748

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Identification of target speech was studied under masked conditions consisting of two or four independent speech maskers. In the reference conditions, the maskers were colocated with the target, the masker talkers were the same sex as the target, and the masker speech was intelligible. The comparison conditions, intended to provide release from masking, included different-sex target and masker talkers, time-reversal of the masker speech, and spatial separation of the maskers from the target. Significant release from masking was found for all comparison conditions. To determine whether these reductions in masking could be attributed to differences in energetic masking, ideal time-frequency segregation (ITFS) processing was applied so that the time-frequency units where the masker energy dominated the target energy were removed. The remaining target-dominated "glimpses" were reassembled as the stimulus. Speech reception thresholds measured using these resynthesized ITFS-processed stimuli were the same for the reference and comparison conditions supporting the conclusion that the amount of energetic masking across conditions was the same. These results indicated that the large release from masking found under all comparison conditions was due primarily to a reduction in informational masking. Furthermore, the large individual differences observed generally were correlated across the three masking release conditions. (C) 2016 Author(s). All article content, except where otherwise noted, is licensed under a Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

引用

页码：132 / 144

页数：13

共 66 条

[1] Determination of the potential benefit of time-frequency gain manipulation [J].