Tools and Technologies for Computer-Aided Speech and Language Therapy

被引:72
作者
Saz, Oscar [1 ]
Yin, Shou-Chun [2 ]
Lleida, Eduardo [1 ]
Rose, Richard [2 ]
Vaquero, Carlos [1 ]
Rodriguez, William R. [1 ]
机构
[1] Univ Zaragoza, GTC, Aragon Inst Engn Res 13A, Zaragoza, Spain
[2] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ H3A 2A7, Canada
关键词
Spoken language learning; Speech disorders; Speech corpora; Automatic speech recognition; Pronunciation verification; RECOGNITION;
D O I
10.1016/j.specom.2009.04.006
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper addresses the problem of Computer-Aided Speech and Language Therapy (CASLT). The goal of the work described in the paper is to develop and evaluate a semi-automated system for providing interactive speech therapy to the increasing population of impaired individuals and help professional speech therapists. A discussion on the development and evaluation of a set of interactive therapy tools, along with the underlying speech technologies that support these tools is provided. The interactive tools are designed to facilitate the acquisition of language skills in the areas of basic phonatory skills, phonetic articulation and language understanding primarily for children with neuromuscular disorders like dysarthria. Human-machine interaction for all of these areas requires the existence of speech analysis, speech recognition, and speech verification algorithms that are robust with respect to the sources of speech variability that are characteristic of this population of speakers. The paper will present an experimental study that demonstrates the effectiveness of an interactive system for eliciting speech from a population of impaired children and young speakers ranging in age from 11 to 21 years. The performance of automatic speech recognition (ASR) systems and subword-based pronunciation verification (PV) on this domain are also presented. The results indicate that ASR and PV systems configured from speech utterances taken from the impaired speech domain can provide adequate performance, similar to the experts' agreement rate, for supporting the presented CASLT applications. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:948 / 967
页数:20
相关论文
共 47 条
  • [1] ACEROVILLAN P, 2005, TRATAMIENTO VOZ MANU
  • [2] AGUINAGA Gloria., 2004, Prueba de Lenguaje Oral de Navarra Revisada (PLON-R)
  • [3] Alarcos Emilio Llorach., 1950, Fonologia Espanola
  • [4] Albor J., 1991, ELA-Examen Logopedico de Articulacion
  • [5] [Anonymous], 2005, P 10 MACH TRANSL SUM
  • [6] Bengio S., 2004, P OD SPEAK LANG REC, P237
  • [7] COORMAN G, 2000, P INT C SPOK LANG PR, P395
  • [8] CUCCHIARINI C, 2007, P INT 2007 ANTW BELG, P2181
  • [9] ON THE USE OF HIDDEN MARKOV MODELING FOR RECOGNITION OF DYSARTHRIC SPEECH
    DELLER, JR
    HSU, D
    FERRIER, LJ
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 1991, 35 (02) : 125 - 139
  • [10] DELONG ER, 1988, J BIOMETR, V3, P837