Improving Hypernasality Estimation with Automatic Speech Recognition in Cleft Palate Speech

被引:0
|
作者
Song, Kaitao [1 ]
Wan, Teng [2 ]
Wang, Bixia [2 ]
Jiang, Huiqiang [1 ]
Qiu, Luna [1 ]
Xu, Jiahang [1 ]
Jiang, Liping [2 ]
Lou, Qun [2 ]
Yang, Yuqing [1 ]
Li, Dongsheng [1 ]
Wang, Xudong [2 ]
Qiu, Lili [1 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
[2] Shanghai Jiao Tong Univ, Dept Oral & Craniomaxillofacial Surg, Shanghai Ninth Peoples Hosp, Sch Med, Shanghai, Peoples R China
来源
INTERSPEECH 2022 | 2022年
关键词
Cleft Palate; Hypernasality; Automatic Speech Recognition; ACOUSTIC ANALYSIS; LIP; CHILDREN;
D O I
10.21437/Interspeech.2022-438
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Hypernasality is an abnormal resonance in human speech production, especially in patients with craniofacial anomalies such as cleft palate. In clinical application, hypernasality estimation is crucial in cleft palate diagnosis, as its results determine the subsequent surgery and additional speech therapy. Therefore, designing an automatic hypernasality assessment method will facilitate speech-language pathologists to make precise diagnoses. Existing methods for hypernasality estimation only conduct acoustic analysis based on low-resource cleft palate dataset, by using statistical or neural network-based features. In this paper, we propose a novel approach that uses automatic speech recognition model to improve hypernasality estimation. Specifically, we first pre-train an encoder-decoder framework in an automatic speech recognition (ASR) objective by using speech-to-text dataset, and then fine-tune ASR encoder on the cleft palate dataset for hypernasality estimation. Benefiting from such design, our model for hypernasality estimation can enjoy the advantages of ASR model: 1) compared with low-resource cleft palate dataset, the ASR task usually includes large-scale speech data in the general domain, which enables better model generalization; 2) the text annotations in ASR dataset guide model to extract better acoustic features. Experimental results on two cleft palate datasets demonstrate that our method achieves superior performance compared with previous approaches.
引用
收藏
页码:4820 / 4824
页数:5
相关论文
共 50 条
  • [31] Speech production of preschoolers with cleft palate
    Hardin-Jones, MA
    Jones, DL
    CLEFT PALATE-CRANIOFACIAL JOURNAL, 2005, 42 (01) : 7 - 13
  • [32] Analysis of oral-nasal balance after intensive speech therapy combined with speech bulb in speakers with cleft palate and hypernasality
    Ferreira, Gabriela Zuin
    Bressmann, Tim
    Rillo Dutka, Jennifer de Cassia
    Whitaker, Melina Evangelista
    de Boer, Gillian
    de Castro Marino, Viviane Cristina
    Pegoraro-Krook, Maria Ines
    JOURNAL OF COMMUNICATION DISORDERS, 2020, 85
  • [33] Speech and language development in toddlers with and without cleft palate
    Priester, G. H.
    Goorhuis-Brouwer, S. M.
    INTERNATIONAL JOURNAL OF PEDIATRIC OTORHINOLARYNGOLOGY, 2008, 72 (06) : 801 - 806
  • [34] Speech assessment in cleft palate patients: A descriptive study
    Rullo, R.
    Di Maggio, D.
    Festa, V. M.
    Mazzarella, N.
    INTERNATIONAL JOURNAL OF PEDIATRIC OTORHINOLARYNGOLOGY, 2009, 73 (05) : 641 - 644
  • [35] A Deep Learning Algorithm for Objective Assessment of Hypernasality in Children With Cleft Palate
    Mathad, Vikram C.
    Scherer, Nancy
    Chapman, Kathy
    Liss, Julie M.
    Berisha, Visar
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2021, 68 (10) : 2986 - 2996
  • [36] Pattern Recognition of Hypernasality in voice of patients with Cleft and Lip Palate
    Gomez Nieto, Roger
    Ivan Marin-Hurtado, Jorge
    Miguel Capacho-Valbuena, Luis
    Amaya Suarez, Alexander
    Belalcazar Bolanos, Elkyn Alexander
    2014 XIX SYMPOSIUM ON IMAGE, SIGNAL PROCESSING AND ARTIFICIAL VISION (STSIVA), 2014,
  • [37] Detection of Hypernasal Speech in Children with Cleft Palate
    Akafi, Ehsan
    Vali, Mansour
    Moradi, Negin
    2012 19TH IRANIAN CONFERENCE OF BIOMEDICAL ENGINEERING (ICBME), 2012, : 186 - 190
  • [38] Influence of an Intensive Speech Therapy Program on the Speech of Individuals with Cleft Lip and Palate
    Felix de Andrade, Laura Katarine
    Rillo Dutka, Jeniffer de Cassia
    Ferreira, Gabriela Zuin
    Borro Pinto, Maria Daniela
    Pegoraro-Krook, Maria Ines
    INTERNATIONAL ARCHIVES OF OTORHINOLARYNGOLOGY, 2023, 27 (01) : 3 - 9
  • [39] Cleft palate speech and velopharyngeal dysfunction: the approach of the speech therapist
    De Bodt, M.
    Van Lierde, K.
    B-ENT, 2006, : 63 - 70
  • [40] Speech after repair of isolated cleft palate and cleft lip and palate
    Timmons, MJ
    Wyatt, RA
    Murphy, T
    BRITISH JOURNAL OF PLASTIC SURGERY, 2001, 54 (05): : 377 - 384