TONE RECOGNITION FOR CONTINUOUS ACCENTED MANDARIN CHINESE

被引:0
作者
Wu, Jiang [1 ]
Zahorian, Stephen A. [1 ]
Hu, Hongbing [1 ]
机构
[1] SUNY Binghamton, Dept Elect & Comp Engn, Binghamton, NY 13902 USA
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年
关键词
tone recognition; continuous Mandarin Chinese; human listeners; neural networks; HMMs;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, the ability of human listeners to recognize tones from continuous Mandarin Chinese is evaluated and compared to the accuracy of automatic systems for tone classification and recognition. All tones used for experimentation were extracted from the RASC863 continuous Mandarin Chinese database. The human listeners are native speakers of Mandarin and the automatic methods consist of tone classification using neural networks and tone recognition using Hidden Markov Models. Features used for the automatic methods are a combination of spectral/temporal features, energy contours, and pitch contours. When very little context is used (i.e., vowel segments only) the human and machine performance is comparable. However, as the context interval is increased, the human performance is much better than the best machine performance obtained.
引用
收藏
页码:7180 / 7183
页数:4
相关论文
共 9 条
[1]  
Kalinli O, 2011, INT CONF ACOUST SPEE, P5208
[2]  
Lai Y., 2012, M AC SOC AM KANS CIT
[3]  
Lei X., 2006, INTERSPEECH 2006
[4]  
Li A., RASC863 CHINESE SPEE
[5]   Post-low bouncing in Mandarin Chinese: Acoustic analysis and computational modeling [J].
Prom-on, Santitham ;
Liu, Fang ;
Xu, Yi .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (01) :421-432
[6]  
Wayland R., 2012, INTERSPEECH
[7]   A spectral/temporal method for robust fundamental frequency tracking [J].
Zahorian, Stephen A. ;
Hu, Hongbing .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 123 (06) :4559-4571
[8]  
Zahorian SA, 2009, INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, P1075
[9]  
Zhou J., 2004, IEEE ICASSP