Automatic Speech Recognition (ASR) Systems Applied to Pronunciation Assessment of L2 Spanish for Japanese Speakers

被引:10
|
作者
Tejedor-Garcia, Cristian [1 ,2 ]
Cardenoso-Payo, Valentin [2 ]
Escudero-Mancebo, David [2 ]
机构
[1] Radboud Univ Nijmegen, Ctr Language & Speech Technol CLST, POB 9103, NL-6500 Nijmegen, Netherlands
[2] Univ Valladolid, Dept Comp Sci, ECA SIMM Res Grp, Valladolid 47002, Spain
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 15期
关键词
automatic speech recognition (ASR); automatic assessment tools; foreign language pronunciation; pronunciation training; computer-assisted pronunciation training (CAPT); automatic pronunciation assessment; learning environments; minimal pairs; ENGLISH; ERRORS;
D O I
10.3390/app11156695
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Featured Application The CAPT tool, ASR technology and procedure described in this work can be successfully applied to support typical learning paces for Spanish as a foreign language for Japanese people. With small changes, the application can be tailored to a different target L2, if the set of minimal pairs used for the discrimination, pronunciation and mixed-mode activities is adapted to the specific L1-L2 pair. General-purpose automatic speech recognition (ASR) systems have improved in quality and are being used for pronunciation assessment. However, the assessment of isolated short utterances, such as words in minimal pairs for segmental approaches, remains an important challenge, even more so for non-native speakers. In this work, we compare the performance of our own tailored ASR system (kASR) with the one of Google ASR (gASR) for the assessment of Spanish minimal pair words produced by 33 native Japanese speakers in a computer-assisted pronunciation training (CAPT) scenario. Participants in a pre/post-test training experiment spanning four weeks were split into three groups: experimental, in-classroom, and placebo. The experimental group used the CAPT tool described in the paper, which we specially designed for autonomous pronunciation training. A statistically significant improvement for the experimental and in-classroom groups was revealed, and moderate correlation values between gASR and kASR results were obtained, in addition to strong correlations between the post-test scores of both ASR systems and the CAPT application scores found at the final stages of application use. These results suggest that both ASR alternatives are valid for assessing minimal pairs in CAPT tools, in the current configuration. Discussion on possible ways to improve our system and possibilities for future research are included.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Automatic Speech Recognition in L2 Learning: A Review Based on PRISMA Methodology
    Farrus, Mireia
    LANGUAGES, 2023, 8 (04)
  • [22] voisTUTOR corpus: A speech corpus of Indian L2 English learners for pronunciation assessment.
    Yarra, Chiranjeevi
    Srinivasan, Aparna
    Srinivasa, Chandana
    Aggarwal, Ritu
    Ghosh, Prasanta Kumar
    2019 22ND CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA), 2019, : 1 - 6
  • [23] Vowel characteristics in the assessment of L2 English pronunciation
    Graham, Calbert
    Buttery, Paula
    Nolan, Francis
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1127 - 1131
  • [24] Automatic Evaluation of English Pronunciation by Japanese Speakers Using Various Acoustic Features and Pattern Recognition Techniques
    Hirabayashi, Kuniaki
    Nakagawa, Seiichi
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 598 - 601
  • [25] Native speakers' perceptions of fluency and accent in L2 speech
    Pinget, Anne-France
    Bosker, Hans Rutger
    Quene, Hugo
    de Jong, Nivja H.
    LANGUAGE TESTING, 2014, 31 (03) : 349 - 365
  • [26] SELECTION AND DISTRIBUTION OF PRONOUNS IN L2 SPANISH OF ARAB SPEAKERS
    Garcia-Alcaraz, Estela
    Bel, Aurora
    REVISTA DE LINGUISTICA Y LENGUAS APLICADAS, 2011, 6 : 165 - 179
  • [27] Teaching humorous irony to L2 and heritage speakers of Spanish
    Shively, Rachel L.
    Acevedo, Juan
    Cano, Rocio
    Etxeberria-Ortego, Izadi
    LANGUAGE TEACHING RESEARCH, 2022, 26 (02) : 279 - 302
  • [28] THE DISCOURSE OF SPANISH AS L2: ANALYSIS OF NATIVE ENGLISH SPEAKERS
    Inigo-Mora, Isabel Ma
    CAUCE-REVISTA INTERNACIONAL DE FILOLOGIA COMUNICACION Y SUS DIDACTICAS, 2007, (30): : 165 - 173
  • [29] Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations
    Dhawan, Kunal
    Koluguri, Nithin Rao
    Jukic, Ante
    Langman, Ryan
    Balam, Jagadeesh
    Ginsburg, Boris
    INTERSPEECH 2024, 2024, : 2574 - 2578
  • [30] The perceptions of Japanese speakers about the alternation between tu/usted in Spanish L2/FL requests
    Serra-Canton, Angel
    Manas, Iban
    Rosado, Elisa
    REVISTA ESPANOLA DE LINGUISTICA APLICADA, 2022, 35 (01): : 265 - 293