The XMUSPEECH System for Accented English Automatic Speech Recognition

被引：2

作者：

Tong, Fuchuan ^{[1
]}

Li, Tao ^{[2
]}

Liao, Dexin ^{[2
]}

Xia, Shipeng ^{[2
]}

Li, Song ^{[1
]}

Hong, Qingyang ^{[2
]}

Li, Lin ^{[1
]}

机构：

[1] Xiamen Univ, Sch Elect Sci & Engn, Xiamen 361005, Peoples R China

[2] Xiamen Univ, Sch Informat, Xiamen 361005, Peoples R China

来源：

APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 03期

基金：

中国国家自然科学基金;

关键词：

AESRC2020; i-vector; x-vector; multistream CNN;

D O I：

10.3390/app12031478

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

In this paper, we present the XMUSPEECH systems for Track 2 of the Interspeech 2020 Accented English Speech Recognition Challenge (AESRC2020). Track 2 is an Automatic Speech Recognition (ASR) task where the non-native English speakers have various accents, which reduces the accuracy of the ASR system. To solve this problem, we experimented with acoustic models and input features. Furthermore, we trained a TDNN-LSTM language model for lattice rescoring to obtain better results. Compared with our baseline system, we achieved relative word error rate (WER) improvements of 40.7% and 35.7% on the development set and evaluation set, respectively.

引用

页数：8

共 50 条

[21] The role of acoustic similarity in listening to foreign-accented speech: Recognition of Spanish-accented English words by Japanese native listeners
Matsui, Sanae
ACOUSTICAL SCIENCE AND TECHNOLOGY, 2024, 45 (04) : 216 - 223
[22] End-to-end Accented Speech Recognition
Viglino, Thibault
Motlicek, Petr
Cernak, Milos
INTERSPEECH 2019, 2019, : 2140 - 2144
[23] Automatic Scoring of English Speaking Test Using Automatic Speech Recognition
Yasuda, Keiji
Kawashima, Hiroyuki
Kimura, Hiroaki
24TH INTERNATIONAL CONFERENCE ON COMPUTERS IN EDUCATION (ICCE 2016): THINK GLOBAL ACT LOCAL, 2016, : 495 - 497
[24] Fast accent identification and accented speech recognition
Univ of Science and Technology, Hong Kong, Hong Kong
ICASSP IEEE Int Conf Acoust Speech Signal Process Proc, (221-224):
[25] THE ACCENTED ENGLISH SPEECH RECOGNITION CHALLENGE 2020: OPEN DATASETS, TRACKS, BASELINES, RESULTS AND METHODS
Shi, Xian
Yu, Fan
Lu, Yizhou
Liang, Yuhao
Feng, Qiangze
Wang, Daliang
Qian, Yanmin
Xie, Lei
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6918 - 6922
[26] DOMAIN ADVERSARIAL TRAINING FOR ACCENTED SPEECH RECOGNITION
Sun, Sining
Yeh, Ching-Feng
Hwang, Mei-Yuh
Ostendorf, Mari
Xie, Lei
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4854 - 4858
[27] Fast accent identification and accented speech recognition
Kat, LW
Fung, P
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 221 - 224
[28] English Speech Recognition System on Chip
刘鸿
钱彦旻
刘加
TsinghuaScienceandTechnology, 2011, 16 (01) : 95 - 99
[29] The AhoSR Automatic Speech Recognition System
Odriozola, Igor
Serrano, Luis
Hernaez, Inma
Navas, Eva
ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, IBERSPEECH 2014, 2014, 8854 : 279 - 288
[30] AN AUTOMATIC SPEECH RECOGNITION SYSTEM TABARCA
BENEDI, JM
CASACUBERTA, F
VIDAL, E
REVISTA DE INFORMATICA Y AUTOMATICA, 1990, 23 (01): : 15 - 24

← 1 2 3 4 5 →