Improving Aphasic Speech Recognition by Using Novel Semi-Supervised Learning Methods on AphasiaBank for English and Spanish

被引:21
作者
Torre, Ivan G. [1 ]
Romero, Monica [1 ]
Alvarez, Aitor [1 ]
机构
[1] Basque Res & Technol Alliance BRTA, Vicomtech Fdn, Mikeletegi 57, Donostia San Sebastian 20009, Spain
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 19期
关键词
aphasia; speech recognition; wav2vec2.0; semi-supervised learning; aphasiabank; low-resource; AUTOMATIC ASSESSMENT; STROKE;
D O I
10.3390/app11198872
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Automatic speech recognition in patients with aphasia is a challenging task for which studies have been published in a few languages. Reasonably, the systems reported in the literature within this field show significantly lower performance than those focused on transcribing non-pathological clean speech. It is mainly due to the difficulty of recognizing a more unintelligible voice, as well as due to the scarcity of annotated aphasic data. This work is mainly focused on applying novel semi-supervised learning methods to the AphasiaBank dataset in order to deal with these two major issues, reporting improvements for the English language and providing the first benchmark for the Spanish language for which less than one hour of transcribed aphasic speech was used for training. In addition, the influence of reinforcing the training and decoding processes with out-of-domain acoustic and text data is described by using different strategies and configurations to fine-tune the hyperparameters and the final recognition systems. The interesting results obtained encourage extending this technological approach to other languages and scenarios where the scarcity of annotated data to train recognition models is a challenging reality.
引用
收藏
页数:14
相关论文
共 52 条
[1]   Automatic word naming recognition for an on-line aphasia treatment system [J].
Abad, Alberto ;
Pompili, Anna ;
Costa, Angela ;
Trancoso, Isabel ;
Fonseca, Jose ;
Leal, Gabriela ;
Farrajota, Luisa ;
Martins, Isabel P. .
COMPUTER SPEECH AND LANGUAGE, 2013, 27 (06) :1235-1248
[2]  
[Anonymous], 2016, Fundamentals of Brain Network Analysis
[3]  
Ardila R., 2019, ARXIV191206670
[4]  
Baevski A., 2021, Neural Information Processing System, P15
[5]  
Baevski A, 2019, ARXIV191103912
[6]  
Baevski A, 2020, ADV NEUR IN, V33
[7]   Feasibility of Automatic Speech Recognition for Providing Feedback During Tablet-Based Treatment for Apraxia of Speech Plus Aphasia [J].
Ballard, Kirrie J. ;
Etter, Nicole M. ;
Shen, Songjia ;
Monroe, Penelope ;
Tan, Chek Tien .
AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY, 2019, 28 (02) :818-834
[8]   NUVA: A Naming Utterance Verifier for Aphasia Treatment [J].
Barbera, David S. ;
Huckvale, Mark ;
Fleming, Victoria ;
Upton, Emily ;
Coley-Fisher, Henry ;
Doogan, Catherine ;
Shaw, Ian ;
Latham, William ;
Leff, Alexander P. ;
Crinion, Jenny .
COMPUTER SPEECH AND LANGUAGE, 2021, 69
[9]   Intensity of aphasia therapy, impact on recovery [J].
Bhogal, SK ;
Teasell, R ;
Speechley, M .
STROKE, 2003, 34 (04) :987-992
[10]  
Biewald Lukas, 2020, EXPT TRACKING WEIGHT