The Spectral Dynamics of Vowels in Mandarin Chinese

被引:0
作者
Yuan, Jiahong [1 ]
机构
[1] Univ Penn, Linguist Data Consortium, Philadelphia, PA 19104 USA
来源
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 | 2013年
关键词
vowels; diphthongs; triphthongs; spectral dynamics; Mandarin Chinese; COARTICULATED VOWELS; AUSTRALIAN ENGLISH; DIPHTHONGS; CLASSIFICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study investigated the dynamic spectral patterns of vowels in Mandarin Chinese using a corpus of monosyllabic words spoken in isolation. Mel-frequency cepstral coefficients (MFCCs) were parameterized in different ways to test the nature of the dynamic information in vowels through automatic vowel classification. Compared to the MFCCs extracted at the vowel midpoint, using the MFCCs extracted at two or three points (vowel onset, offset, and midpoint) greatly improved classification accuracies. Legendre polynomials fitted to the MFCCs over the entire vowel duration achieved approximately 30% relative error reductions over the three-point model. Euclidean cepstral distance was employed to measure the magnitude of spectral change. A negative correlation was found between the rate of spectral change and vowel duration. Vowel-dependent spectral changes appear primarily in the first half of a vowel. There is great diversity among the diphthongs and a considerable overlap between the diphthongs and the monophthongs in terms of the spectral dynamics.
引用
收藏
页码:1192 / 1196
页数:5
相关论文
共 33 条
[1]  
[Anonymous], ACM T INTELLIGENT SY
[2]   DIPHTHONGS - A CASE-STUDY OF DYNAMIC AUDITORY PROCESSING [J].
BLADON, A .
SPEECH COMMUNICATION, 1985, 4 (1-3) :145-154
[3]   ARTICULATORY PHONOLOGY - AN OVERVIEW [J].
BROWMAN, CP ;
GOLDSTEIN, L .
PHONETICA, 1992, 49 (3-4) :155-180
[4]   COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES [J].
DAVIS, SB ;
MERMELSTEIN, P .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04) :357-366
[5]  
Duanmu San, 2000, PHONOLOGY STANDARD C
[7]   3 APPROACHES TO THE CLASSIFICATION OF AMERICAN ENGLISH DIPHTHONGS [J].
GOTTFRIED, M ;
MILLER, JD ;
MEYER, DJ .
JOURNAL OF PHONETICS, 1993, 21 (03) :205-229
[8]   IDENTIFICATION OF COARTICULATED VOWELS [J].
GOTTFRIED, TL ;
STRANGE, W .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1980, 68 (06) :1626-1635
[9]   DYNAMIC TARGET THEORIES OF VOWEL CLASSIFICATION - EVIDENCE FROM MONOPHTHONGS AND DIPHTHONGS IN AUSTRALIAN ENGLISH [J].
HARRINGTON, J ;
CASSIDY, S .
LANGUAGE AND SPEECH, 1994, 37 :357-373
[10]   ACOUSTIC CHARACTERISTICS OF AMERICAN ENGLISH VOWELS [J].
HILLENBRAND, J ;
GETTY, LA ;
CLARK, MJ ;
WHEELER, K .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1995, 97 (05) :3099-3111