MULTILINGUAL ANALYSIS OF INTELLIGIBILITY CLASSIFICATION USING ENGLISH, KOREAN, AND TAMIL DYSARTHRIC SPEECH DATASETS

被引:1
作者
Yeo, Eun Jung [1 ]
Kim, Sunhee [2 ]
Chung, Minhwa [1 ]
机构
[1] Seoul Natl Univ, Dept Linguist, Seoul, South Korea
[2] Seoul Natl Univ, Dept French Language Educ, Seoul, South Korea
来源
2022 25TH CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA 2022) | 2022年
关键词
dysarthria; multilingual analysis; acoustic measurements; automatic assessment; CEREBRAL-PALSY;
D O I
10.1109/O-COCOSDA202257103.2022.9997931
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper analyzes dysarthric speech datasets from three languages with different prosodic systems: English, Korean, and Tamil. We inspect 39 acoustic measurements which reflect three speech dimensions including voice quality, pronunciation, and prosody. As multilingual analysis, examination on the mean values of acoustic measurements by intelligibility levels is conducted. Further, automatic intelligibility classification is performed to scrutinize the optimal feature set by languages. Analyses suggest pronunciation features, such as Percentage of Correct Consonants, Percentage of Correct Vowels, and Percentage of Correct Phonemes to be language-independent measurements. Voice quality and prosody features, however, generally present different aspects by languages. Experimental results additionally show that different speech dimension play a greater role for different languages: prosody for English, pronunciation for Korean, both prosody and pronunciation for Tamil. This paper contributes to speech pathology in that it differentiates between language-independent and language-dependent measurements in intelligibility classification for English, Korean, and Tamil dysarthric speech.
引用
收藏
页数:6
相关论文
共 24 条
  • [1] [Anonymous], 2003, P ICPHS
  • [2] Boersma P, 2013, Praat: Doing phonetics by computer Computer software
  • [3] Celin TAM, 2016, PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), P2610, DOI 10.1109/TENCON.2016.7848510
  • [4] Choi DL, 2012, LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P3378
  • [5] ENDERBY P, 1980, BRIT J DISORD COMMUN, V15, P165
  • [6] Enderby Pam, 2013, Handb Clin Neurol, V110, P273, DOI 10.1016/B978-0-444-52901-5.00022-8
  • [7] Intensive Voice Treatment (LSVT LOUD) for Children With Spastic Cerebral Palsy and Dysarthria
    Fox, Cynthia Marie
    Boliek, Carol Ann
    [J]. JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2012, 55 (03): : 930 - 945
  • [8] Grosman Jonatas, 2021, Fine-tuned XLSR-53 large model for speech recognition in English
  • [9] Prosody-Based Measures for Automatic Severity Assessment of Dysarthric Speech
    Hernandez, Abner
    Kim, Sunhee
    Chung, Minhwa
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (19): : 1 - 18
  • [10] Introducing Parselmouth: A Python']Python interface to Praat
    Jadoul, Yannick
    Thompson, Bill
    de Boer, Bart
    [J]. JOURNAL OF PHONETICS, 2018, 71 : 1 - 15