Golden speaker builder - An interactive tool for pronunciation training

被引:36
作者
Ding, Shaojin [1 ]
Liberatore, Christopher [1 ]
Sonsaat, Sinem [2 ]
Lucic, Ivana [2 ]
Silpachai, Alif [2 ]
Zhao, Guanlong [1 ]
Chukharev-Hudilainen, Evgeny [2 ]
Levis, John [2 ]
Gutierrez-Osuna, Ricardo [1 ]
机构
[1] Texas A&M Univ, Dept Comp Sci & Engn, College Stn, TX 77843 USA
[2] Iowa State Univ, Dept English, Ames, IA USA
关键词
LINEAR TRANSFORMATION; EXPLICIT CORRECTION; FOREIGN ACCENT; LEARNER REPAIR; ERROR TYPES; FLUENCY; COMPREHENSIBILITY; RECASTS; SPEECH; NEGOTIATION;
D O I
10.1016/j.specom.2019.10.005
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The type of voice model used in Computer Assisted Pronunciation Instruction is a crucial factor in the quality of practice and the amount of uptake by language learners. As an example, prior research indicates that second-language learners are more likely to succeed when they imitate a speaker with a voice similar to their own, a so-called "golden speaker". This manuscript presents Golden Speaker Builder (GSB), a tool that allows learners to generate a personalized "golden-speaker" voice: one that mirrors their own voice but with a native accent. We describe the overall system design, including the web application with its user interface, and the underlying speech analysis/synthesis algorithms. Next, we present results from a series of listening tests, which show that GSB is capable of synthesizing such golden-speaker voices. Finally, we present results from a user study in a language-instruction setting, which show that practising with GSB leads to improved fluency and comprehensibility. We suggest reasons for why learners improved as they did and recommendations for the next iteration of the training.
引用
收藏
页码:51 / 66
页数:16
相关论文
共 90 条
[71]   Enhancing foreign language tutors - In search of the golden speaker [J].
Probst, K ;
Ke, Y ;
Eskenazi, M .
SPEECH COMMUNICATION, 2002, 37 (3-4) :161-173
[72]   TOWARD AN UNDERSTANDING OF FLUENCY - A MICROANALYSIS OF NONNATIVE SPEAKER CONVERSATIONS [J].
RIGGENBACH, H .
DISCOURSE PROCESSES, 1991, 14 (04) :423-441
[73]  
Riggenbach H., 2000, Perspectives on Fluency
[74]  
Rypa M. E., 1999, CALICO Journal, V16, P385
[75]   Effects of Instruction on L2 Pronunciation Development: A Synthesis of 15 Quasi-Experimental Intervention Studies [J].
Saito, Kazuya .
TESOL QUARTERLY, 2012, 46 (04) :842-854
[76]  
Segalowitz N, 2007, TESOL QUART, V41, P181
[77]  
Solem A., 2016, Celery - distributed task queue
[78]  
Sundstrom A., 1998, PROC SPEECH TECHNOLO, P49
[79]   PROBLEMS IN OUTPUT AND THE COGNITIVE-PROCESSES THEY GENERATE - A STEP TOWARDS 2ND LANGUAGE-LEARNING [J].
SWAIN, M ;
LAPKIN, S .
APPLIED LINGUISTICS, 1995, 16 (03) :371-391
[80]  
Swain M., 2000, SOCIOCULTURAL THEORY