Golden speaker builder - An interactive tool for pronunciation training

被引：36

作者：

Ding, Shaojin ^{[1
]}

Liberatore, Christopher ^{[1
]}

Sonsaat, Sinem ^{[2
]}

Lucic, Ivana ^{[2
]}

Silpachai, Alif ^{[2
]}

Zhao, Guanlong ^{[1
]}

Chukharev-Hudilainen, Evgeny ^{[2
]}

Levis, John ^{[2
]}

Gutierrez-Osuna, Ricardo ^{[1
]}

机构：

[1] Texas A&M Univ, Dept Comp Sci & Engn, College Stn, TX 77843 USA

[2] Iowa State Univ, Dept English, Ames, IA USA

来源：

SPEECH COMMUNICATION | 2019年 / 115卷

关键词：

LINEAR TRANSFORMATION; EXPLICIT CORRECTION; FOREIGN ACCENT; LEARNER REPAIR; ERROR TYPES; FLUENCY; COMPREHENSIBILITY; RECASTS; SPEECH; NEGOTIATION;

D O I：

10.1016/j.specom.2019.10.005

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The type of voice model used in Computer Assisted Pronunciation Instruction is a crucial factor in the quality of practice and the amount of uptake by language learners. As an example, prior research indicates that second-language learners are more likely to succeed when they imitate a speaker with a voice similar to their own, a so-called "golden speaker". This manuscript presents Golden Speaker Builder (GSB), a tool that allows learners to generate a personalized "golden-speaker" voice: one that mirrors their own voice but with a native accent. We describe the overall system design, including the web application with its user interface, and the underlying speech analysis/synthesis algorithms. Next, we present results from a series of listening tests, which show that GSB is capable of synthesizing such golden-speaker voices. Finally, we present results from a user study in a language-instruction setting, which show that practising with GSB leads to improved fluency and comprehensibility. We suggest reasons for why learners improved as they did and recommendations for the next iteration of the training.

引用

页码：51 / 66

页数：16

共 90 条

[31]

Heift T., 2004, ReCALL, V16, P416, DOI 10.1017/S0958344004001120

[32]

Hincks R., 2003, ReCALL, V15, P3, DOI 10.1017/S0958344003000211

[33]

Hirose K., 2003, P ANN C INT SPEECH C

[34]

Huckvale M., 2007, P ISCA SPEECH SYNTH

[35] Identifying the Linguistic Influences on Listeners' L2 Comprehensibility Ratings [J].

Isaacs, Talia ;

Trofimovich, Pavel .

STUDIES IN SECOND LANGUAGE ACQUISITION, 2012, 34 (03) :475-505

[36]

Jun S.-A., 1995, The Journal of the Acoustical Society of America, V98, P2893, DOI [10.1121/1.414317, DOI 10.1121/1.414317]

[37]

Kanters S., 2009, P SPEECH LANG TECHN

[38] STRAIGHT, exploitation of the other aspect of VOCODER: Perceptually isomorphic decomposition of speech sounds [J].

Kawahara, Hideki .

ACOUSTICAL SCIENCE AND TECHNOLOGY, 2006, 27 (06) :349-353

[39]

Kominek J., 2004, 5 ISCA WORKSHOP SPE

[40]

Kominek J., 2003, P 5 ISCA ITRW SPEECH

← 1 2 3 4 5 6 7 8 9 →