Computer-Assisted English Presentation Learning System Based on Word Stress in Pronunciation

被引:0
作者
Tagami, Yugo [1 ]
Kojima, Takako [2 ]
Yamaguchi, Saneyasu [3 ]
机构
[1] Kogakuin Univ, Grad Sch, Elect Engn & Elect, Tokyo, Japan
[2] Tokyo Med Univ, Ctr Int Educ & Res, Tokyo, Japan
[3] Kogakuin Univ, Dept Informat & Commun Engn, Tokyo, Japan
来源
2024 IEEE 48TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC 2024 | 2024年
关键词
CALL; Computer-Assisted Language Learning; English pronunciation;
D O I
10.1109/COMPSAC61105.2024.00271
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Non-English speaking students are often required to give presentations in English. Using CALL (Computer-Assisted Language Learning), students can practice English pronunciation as many times as they want. Some current CALLs make use of machine learning and these help students to learn English objectively and efficiently. In this paper, we propose and evaluate an English pronunciation training system. The proposed system focuses on improving sentence intonation by paying attention to word stress during pronunciation and aims to make the student's pronunciation of each word similar to that of the model that the student likes. For this purpose, we have developed a system that evaluates the word stress of the student's pronunciation and gives advice on how to make the student's pronunciation close to that of the model.
引用
收藏
页码:1721 / 1725
页数:5
相关论文
共 8 条
[1]  
Bain M, 2023, Arxiv, DOI [arXiv:2303.00747, 10.48550/arXiv.2303.00747]
[2]  
Goto Kenta, 2016, IPSJ AN10096193, No. 2016-CE-133, V11, P1
[3]  
huggingface, Wav2vec2(asr-base-960h)
[4]  
Kurzinger Ludwig, 2020, Speech and Computer. 22nd International Conference, SPECOM 2020. Proceedings. Lecture Notes in Artificial Intelligence Subseries of Lecture Notes in Computer Science (LNAI 12335), P267, DOI 10.1007/978-3-030-60276-5_27
[5]   Montreal Forced Aligner: trainable text-speech alignment using Kaldi [J].
McAuliffe, Michael ;
Socolof, Michaela ;
Mihuc, Sarah ;
Wagner, Michael ;
Sonderegger, Morgan .
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, :498-502
[6]  
Radford A., 2022, arXiv, DOI [DOI 10.48550/ARXIV.2212.04356, 10.48550/ARXIV.2212.04356]
[7]  
Teng Haikun, 2019, Speech Recognition Model Based on Deep Learning And Application in Pronunciation Quality Evaluation System (ICDMML 2019
[8]   Hybrid CTC/Attention Architecture for End-to-End Speech Recognition [J].
Watanabe, Shinji ;
Hori, Takaaki ;
Kim, Suyoun ;
Hershey, John R. ;
Hayashi, Tomoki .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (08) :1240-1253