Advances in phone-based modeling for automatic accent classification

被引：48

作者：

Angkititrakul, P ^{[1
]}

Hansen, JHL

机构：

[1] Univ Texas, Sch Engn & Comp Sci, Richardson, TX 75083 USA

[2] Univ Colorado, Ctr Spoken Language Res, Robust Speech Proc Grp, Boulder, CO 80302 USA

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2006年 / 14卷 / 02期

关键词：

automatic accent classification; dialect modeling; open accent classification; phoneme recognition; spectral trajectory modeling; speech recognition;

D O I：

10.1109/TSA.2005.851980

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

It is suggested that algorithms capable of estimating and characterizing accent knowledge would provide valuable information in the development of more effective speech systems such as speech recognition, speaker identification, audio stream tagging in spoken document retrieval, channel monitoring, or voice conversion. Accent knowledge could be used for selection of alternative pronunciations in a lexicon, engage adaptation for acoustic modeling, or provide information for biasing a language model in large vocabulary speech recognition. In this paper, we propose a text-independent automatic accent classification system using phone-based models. Algorithm formulation begins with a series of experiments focused on capturing the spectral evolution information as potential accent sensitive cues. Alternative subspace representations using principal component analysis and linear discriminant analysis with projected trajectories are considered. Finally, an experimental study is performed to compare the spectral trajectory model framework to a traditional hidden Markov model recognition framework using an accent sensitive word corpus. System evaluation is performed using a corpus representing five English speaker groups with native American English, and English spoken with Mandarin Chinese, French, Thai, and Turkish accents for both male and female speakers.

引用

页码：634 / 646

页数：13

共 37 条

[1]

[Anonymous], 1997, Proceedings of the uropean Conference on Speech Communication and Technology

[2] Study of temporal features and frequency characteristics in American English foreign accent [J].

Arslan, LM ;

Hansen, JHL .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 102 (01) :28-40

[3] Language accent classification in American English [J].

Arslan, LM ;

Hansen, JHL .

SPEECH COMMUNICATION, 1996, 18 (04) :353-367

[4]

BATLINER A, 2001, P EUROSPEECH AALB DE

[5] SCoPE, syllable core and periphery evaluation: Automatic syllabification and foreign accent identification [J].

Berkling, K .

SPEECH COMMUNICATION, 2001, 35 (1-2) :125-138

[6]

BLACKBURN CS, 1993, P EUROSPEECH, P1241

[7] FACTORS AFFECTING DEGREE OF PERCEIVED FOREIGN ACCENT IN ENGLISH-SENTENCES [J].

FLEGE, JE .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1988, 84 (01) :70-79

[8] THE DETECTION OF FRENCH ACCENT BY AMERICAN LISTENERS [J].

FLEGE, JE .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1984, 76 (03) :692-707

[9]

Fukada T, 1997, INT CONF ACOUST SPEE, P1403, DOI 10.1109/ICASSP.1997.596210

[10] SPEAKER-INDEPENDENT ISOLATED WORD RECOGNITION USING DYNAMIC FEATURES OF SPEECH SPECTRUM [J].

FURUI, S .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1986, 34 (01) :52-59

← 1 2 3 4 →