The XMUSPEECH System for Accented English Automatic Speech Recognition

被引:2
|
作者
Tong, Fuchuan [1 ]
Li, Tao [2 ]
Liao, Dexin [2 ]
Xia, Shipeng [2 ]
Li, Song [1 ]
Hong, Qingyang [2 ]
Li, Lin [1 ]
机构
[1] Xiamen Univ, Sch Elect Sci & Engn, Xiamen 361005, Peoples R China
[2] Xiamen Univ, Sch Informat, Xiamen 361005, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 03期
基金
中国国家自然科学基金;
关键词
AESRC2020; i-vector; x-vector; multistream CNN;
D O I
10.3390/app12031478
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In this paper, we present the XMUSPEECH systems for Track 2 of the Interspeech 2020 Accented English Speech Recognition Challenge (AESRC2020). Track 2 is an Automatic Speech Recognition (ASR) task where the non-native English speakers have various accents, which reduces the accuracy of the ASR system. To solve this problem, we experimented with acoustic models and input features. Furthermore, we trained a TDNN-LSTM language model for lattice rescoring to obtain better results. Compared with our baseline system, we achieved relative word error rate (WER) improvements of 40.7% and 35.7% on the development set and evaluation set, respectively.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] The role of acoustic similarity in listening to foreign-accented speech: Recognition of Spanish-accented English words by Japanese native listeners
    Matsui, Sanae
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2024, 45 (04) : 216 - 223
  • [22] End-to-end Accented Speech Recognition
    Viglino, Thibault
    Motlicek, Petr
    Cernak, Milos
    INTERSPEECH 2019, 2019, : 2140 - 2144
  • [23] Automatic Scoring of English Speaking Test Using Automatic Speech Recognition
    Yasuda, Keiji
    Kawashima, Hiroyuki
    Kimura, Hiroaki
    24TH INTERNATIONAL CONFERENCE ON COMPUTERS IN EDUCATION (ICCE 2016): THINK GLOBAL ACT LOCAL, 2016, : 495 - 497
  • [24] Fast accent identification and accented speech recognition
    Univ of Science and Technology, Hong Kong, Hong Kong
    ICASSP IEEE Int Conf Acoust Speech Signal Process Proc, (221-224):
  • [25] THE ACCENTED ENGLISH SPEECH RECOGNITION CHALLENGE 2020: OPEN DATASETS, TRACKS, BASELINES, RESULTS AND METHODS
    Shi, Xian
    Yu, Fan
    Lu, Yizhou
    Liang, Yuhao
    Feng, Qiangze
    Wang, Daliang
    Qian, Yanmin
    Xie, Lei
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6918 - 6922
  • [26] DOMAIN ADVERSARIAL TRAINING FOR ACCENTED SPEECH RECOGNITION
    Sun, Sining
    Yeh, Ching-Feng
    Hwang, Mei-Yuh
    Ostendorf, Mari
    Xie, Lei
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4854 - 4858
  • [27] Fast accent identification and accented speech recognition
    Kat, LW
    Fung, P
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 221 - 224
  • [28] English Speech Recognition System on Chip
    刘鸿
    钱彦旻
    刘加
    TsinghuaScienceandTechnology, 2011, 16 (01) : 95 - 99
  • [29] The AhoSR Automatic Speech Recognition System
    Odriozola, Igor
    Serrano, Luis
    Hernaez, Inma
    Navas, Eva
    ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, IBERSPEECH 2014, 2014, 8854 : 279 - 288
  • [30] AN AUTOMATIC SPEECH RECOGNITION SYSTEM TABARCA
    BENEDI, JM
    CASACUBERTA, F
    VIDAL, E
    REVISTA DE INFORMATICA Y AUTOMATICA, 1990, 23 (01): : 15 - 24