Creaky voice as a function of tonal categories and prosodic boundaries

被引:6
作者
Kuang, Jianjing [1 ]
机构
[1] Univ Penn, Dept Linguist, Philadelphia, PA 19104 USA
来源
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION | 2017年
关键词
voice quality; creaky voice; Mandarin; tone; intonation; AUTOMATIC DETECTION; GLOTTALIZATION; PERCEPTION; PHONATION; NOISE;
D O I
10.21437/Interspeech.2017-1578
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study looks into the distribution of creaky voice in Mandarin in continuous speech. A creaky voice detector was used to automatically detect the appearance of creaky voice in a large-scale Mandarin corpus (Sinica COSPRO corpus). As the prosodic information has been annotated in the corpus, we were able to look at the distribution of creaky voice as a function of the interaction between tone and prosodic structures. As expected, among the five tonal categories (four lexical tones and one neutral tone), creaky voice is most likely to occur with Tone 3 and the neutral tone, followed by Tone 2 and Tone 4. Prosodic boundaries also play important roles, as the likelihood of creak increases when the prosodic boundaries are larger, regardless of the tonal categories. It is also confirmed that the pitch range for the occurrence of creaky voice is 110 Hz for male speakers and 170 Hz for female speakers, consistent with previous small-scale studies. Finally, male speakers have a higher overall rate of creaky voice than female speakers. Altogether, this study validates the hypotheses from previous studies, and provides a better understanding of voice -source variation in different prosodic conditions.
引用
收藏
页码:3216 / 3220
页数:5
相关论文
共 41 条
[1]  
[Anonymous], 2011, P 17 INT C PHONETIC
[2]  
[Anonymous], R LANG ENV STAT COMP
[3]  
[Anonymous], 1993, Proceedings of ESCA workshop on prosody
[4]  
[Anonymous], 1980, The phonetic description of voice quality
[5]  
[Anonymous], 2013, THESIS
[6]  
Belotel-Grenie A., 1994, ICSLP 94. 1994 International Conference on Spoken Language Processing, P343
[7]  
Belotel-Grenie A., 1997, CAHIERS LINGUISTIQUE, V26, P249, DOI DOI 10.3406/CLAO.1997.1516
[8]  
Belotel-Grenie A., 2004, INT S TON ASP LANG E
[9]  
Chen Y., 2007, Interspeech, P2749
[10]   A CEPSTRUM-BASED TECHNIQUE FOR DETERMINING A HARMONICS-TO-NOISE RATIO IN SPEECH SIGNALS [J].
DEKROM, G .
JOURNAL OF SPEECH AND HEARING RESEARCH, 1993, 36 (02) :254-266