Advances in phone-based modeling for automatic accent classification

被引:47
作者
Angkititrakul, P [1 ]
Hansen, JHL
机构
[1] Univ Texas, Sch Engn & Comp Sci, Richardson, TX 75083 USA
[2] Univ Colorado, Ctr Spoken Language Res, Robust Speech Proc Grp, Boulder, CO 80302 USA
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2006年 / 14卷 / 02期
关键词
automatic accent classification; dialect modeling; open accent classification; phoneme recognition; spectral trajectory modeling; speech recognition;
D O I
10.1109/TSA.2005.851980
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
It is suggested that algorithms capable of estimating and characterizing accent knowledge would provide valuable information in the development of more effective speech systems such as speech recognition, speaker identification, audio stream tagging in spoken document retrieval, channel monitoring, or voice conversion. Accent knowledge could be used for selection of alternative pronunciations in a lexicon, engage adaptation for acoustic modeling, or provide information for biasing a language model in large vocabulary speech recognition. In this paper, we propose a text-independent automatic accent classification system using phone-based models. Algorithm formulation begins with a series of experiments focused on capturing the spectral evolution information as potential accent sensitive cues. Alternative subspace representations using principal component analysis and linear discriminant analysis with projected trajectories are considered. Finally, an experimental study is performed to compare the spectral trajectory model framework to a traditional hidden Markov model recognition framework using an accent sensitive word corpus. System evaluation is performed using a corpus representing five English speaker groups with native American English, and English spoken with Mandarin Chinese, French, Thai, and Turkish accents for both male and female speakers.
引用
收藏
页码:634 / 646
页数:13
相关论文
共 37 条