Integrating Voice Quality Cues in the Pitch Perception of Speech and Non-speech Utterances

被引:27
作者
Kuang, Jianjing [1 ]
Liberman, Mark [1 ]
机构
[1] Univ Penn, Dept Linguist, Philadelphia, PA 19104 USA
关键词
pitch perception; voice quality; spectral cues; speech perception; cue integration; prosody; speech normalization;
D O I
10.3389/fpsyg.2018.02147
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Pitch perception plays a crucial role in speech processing. Since F0 is highly ambiguous and variable in the speech signal, effective pitch-range perception is important in perceiving the intended linguistic pitch targets. This study argues that the effectiveness of pitch-range perception can be achieved by taking advantage of other signal-internal information that co-varies with F0, such as voice quality cues. This study provides direct perceptual evidence that voice quality cues as an indicator of pitch ranges can effectively affect the pitch-height perception. A series of forced-choice pitch classification experiments with four spectral conditions were conducted to investigate the degree to which manipulating spectral slope affects pitch-height perception. Both non-speech and speech stimuli were investigated. The results suggest that the pitch classification function is significantly shifted under different spectral conditions. Listeners are likely to perceive a higher pitch when the spectrum has higher high-frequency energy (i.e., tenser phonation). The direction of the shift is consistent with the correlation between voice quality and pitch range. Moreover, cue integration is affected by the speech mode, where listeners are more sensitive to relative difference within an utterance when hearing speech stimuli. This study generally supports the hypothesis that voice quality is an important enhancement cue for pitch range.
引用
收藏
页数:11
相关论文
共 56 条
[1]   Voice register in Suai (Kuai): An analysis of perceptual and acoustic data [J].
Abramson, AS ;
L-Thongkum, TL ;
Nye, PW .
PHONETICA, 2004, 61 (2-3) :147-171
[2]   Symmetric interactions and interference between pitch and timbre [J].
Allen, Emily J. ;
Oxenham, Andrew J. .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 135 (03) :1371-1379
[3]  
Andruski J.E., 2000, Journal of the International Phonetic Association, V30, P37, DOI DOI 10.1017/S0025100300006654
[4]  
Baken R. J., 2000, Clinical Measurement of Speech and Voice
[5]   Perception of pitch location within a speaker's range: Fundamental frequency, voice quality and speaker sex [J].
Bishop, Jason ;
Keating, Patricia .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (02) :1100-1112
[6]   The phonetics of register in Takhian Thong Chong [J].
DiCanio, Christian T. .
JOURNAL OF THE INTERNATIONAL PHONETIC ASSOCIATION, 2009, 39 (02) :162-188
[7]  
Epstein M.A., 2002, VOICE QUALITY PROSOD
[8]   An acoustic and electroglottographic study of White Hmong tone and phonation [J].
Esposito, Christina M. .
JOURNAL OF PHONETICS, 2012, 40 (03) :466-476
[9]   Variation in contrastive phonation in Santa Ana Del Valle Zapotec [J].
Esposito, Christina M. .
JOURNAL OF THE INTERNATIONAL PHONETIC ASSOCIATION, 2010, 40 (02) :181-198
[10]   Modeling the voice source in terms of spectral slopes [J].
Garellek, Marc ;
Samlan, Robin ;
Gerratt, Bruce R. ;
Kreiman, Jody .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 139 (03) :1404-1410