A review of tools and techniques for computer aided pronunciation training (CAPT) in English

被引:33
作者
Agarwal, Chesta [1 ]
Chakraborty, Pinaki [1 ]
机构
[1] Netaji Subhas Univ Technol, Div Comp Engn, New Delhi, India
关键词
Educational software; Computer aided pronunciation training (CAPT); English as a second language; English as a foreign language; Phonetics; LANGUAGE; RECOGNITION; DISCOVERY; SPEECH;
D O I
10.1007/s10639-019-09955-7
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Widespread use of English in the academia and in business is leading an increasing number of people to learn it as a second or a foreign language. Computer aided pronunciation training (CAPT) systems are used by non-native English speakers for improving their English pronunciation. A typical CAPT tool records the speech of a learner, detects and diagnoses mispronunciations in it, and suggests a way for correcting them. We classified the CAPT systems for English into four categories on the basis of the technology used in them and studied the salient features of each such category. We observed that visual simulation based systems are suitable for young and naive learners, game based systems are advantageous as they can be personalized as per the requirements of the learners, comparative phonetics based systems are suitable for adult learners fluent in another language, and artificial neural network based systems have the highest accuracy in mispronunciation diagnosis and are suitable for experienced and professional learners. We identified the state-of-the-art practices used in CAPT systems, and observed that CAPT systems can detect up to 86% mispronunciations in a speech and help learners to lessen mispronouncing by up to 23%. We recommend collaboration between language teachers and software developers to develop CAPT tools, their wide dissemination and integration with the curriculum at school and university levels, and further investigation on mobile and collaborative CAPT systems.
引用
收藏
页码:3731 / 3743
页数:13
相关论文
共 26 条
[1]  
Abdou SM, 2006, INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, P849
[2]   Designing and developing multilingual e-learning materials: TUFS language education pronunciation module - Introduction of a system for learning Japanese language pronunciation [J].
Abe, S ;
Nakata, S ;
Kigoshi, T ;
Mochizuki, H .
3RD IEEE INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES, PROCEEDINGS, 2003, :462-462
[3]  
Akima Y., 1992, Communications on the Move. Singapore. ICCS/ISITA '92(Cat. No.92TH0479-6), P553, DOI 10.1109/ICCS.1992.254890
[4]  
Athanasopoulos G., 2017, P INT C 3D IMM, P1
[5]   Automatic Pronunciation Scoring with Score Combination by Learning to Rank and Class-Normalized DP-Based Quantization [J].
Chen, Liang-Yu ;
Jang, Jyh-Shing Roger .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) :1737-1749
[6]   Design and implementation of video-enabled web-based pronunciation debugging system [J].
Chiu, Chiung-Fang ;
Lee, Greg C. ;
Yang, Ju-Hsush .
7TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES, PROCEEDINGS, 2007, :374-+
[7]   A study on the use of a voice interactive system for teaching English to Italian children [J].
Giuliani, D ;
Mich, O ;
Nardon, M .
3RD IEEE INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES, PROCEEDINGS, 2003, :376-377
[8]  
Jain DPA., 2018, Journal of Multi Disciplinary Engineering Technologies, V12, P59
[9]  
Jing X, 2014, PROCEEDINGS OF 2014 IEEE WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS (WARTIA), P546, DOI 10.1109/WARTIA.2014.6976318
[10]   Automatic recognition and understanding of spoken language - A first step toward natural human-machine communication [J].
Juang, BH ;
Furui, S .
PROCEEDINGS OF THE IEEE, 2000, 88 (08) :1142-1165