Language-Independent Approach for Automatic Computation of Vowel Articulation Features in Dysarthric Speech Assessment

被引:12
作者
Liu, Yuanyuan [1 ]
Penttila, Nelly [2 ]
Ihalainen, Tiina [2 ]
Lintula, Juulia [2 ]
Convey, Rachel [2 ]
Rasanen, Okko [1 ,3 ]
机构
[1] Tampere Univ, Unit Comp Sci, Tampere 33720, Pirkanmaa, Finland
[2] Tampere Univ, Fac Social Sci, Tampere 33100, Pirkanmaa, Finland
[3] Aalto Univ, Dept Signal Proc, Acoust, Espoo 02150, Finland
基金
芬兰科学院;
关键词
Acoustics; Feature extraction; Speech processing; Manuals; Diseases; Annotations; Task analysis; Parkinson's diseases; dysarthria; vowel articulation; automatic corner vowels detection; phoneme recognition; ACOUSTIC CHARACTERISTICS; PARKINSONS-DISEASE; SPACE; INTELLIGIBILITY;
D O I
10.1109/TASLP.2021.3090973
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Imprecise vowel articulation can be observed in people with Parkinson's disease (PD). Acoustic features measuring vowel articulation have been demonstrated to be effective indicators of PD in its assessment. Standard clinical vowel articulation features of vowel working space area (VSA), vowel articulation index (VAI) and formants centralization ratio (FCR), are derived the first two formants of the three corner vowels /a/, /i/ and /u/. Conventionally, manual annotation of the corner vowels from speech data is required before measuring vowel articulation. This process is time-consuming. The present work aims to reduce human effort in clinical analysis of PD speech by proposing an automatic pipeline for vowel articulation assessment. The method is based on automatic corner vowel detection using a language universal phoneme recognizer, followed by statistical analysis of the formant data. The approach removes the restrictions of prior knowledge of speaking content and the language in question. Experimental results on a Finnish PD speech corpus demonstrate the efficacy and reliability of the proposed automatic method in deriving VAI, VSA, FCR and F2i/F2u (the second formant ratio for vowels /i/ and /u/). The automatically computed parameters are shown to be highly correlated with features computed with manual annotations of corner vowels. In addition, automatically and manually computed vowel articulation features have comparable correlations with experts' ratings on speech intelligibility, voice impairment and overall severity of communication disorder. Language-independence of the proposed approach is further validated on a Spanish PD database, PC-GITA, as well as on TORGO corpus of English dysarthric speech.
引用
收藏
页码:2228 / 2243
页数:16
相关论文
共 45 条
  • [1] User's guide to correlation coefficients
    Akoglu, Haldun
    [J]. TURKISH JOURNAL OF EMERGENCY MEDICINE, 2018, 18 (03): : 91 - 93
  • [2] Albuquerque L., 2020, J VOICE
  • [3] An GZ, 2015, 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, P508
  • [4] [Anonymous], 2015, INTERSPEECH
  • [5] [Anonymous], 1993, ASHA PRACT POLICY
  • [6] Pathomechanisms and compensatory efforts related to Parkinsonian speech
    Arnold, Christiane
    Gehrig, Johannes
    Gispert, Suzana
    Seifried, Carola
    Kell, Christian A.
    [J]. NEUROIMAGE-CLINICAL, 2014, 4 : 82 - 97
  • [7] Boersma P., 2018, Praat: Doing phonetics by the computer
  • [8] Stability, reliability, and sensitivity of acoustic measures of vowel space: A comparison of vowel space area, formant centralization ratio, and vowel articulation index
    Caverle, Marja W. J.
    Vogel, Adam P.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2020, 148 (03) : 1436 - 1444
  • [9] Childers D G., 1978, Modern spectrum analysis, P252
  • [10] The Impact of Contrastive Stress on Vowel Acoustics and Intelligibility in Dysarthria
    Connaghan, Kathryn P.
    Patel, Rupal
    [J]. JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2017, 60 (01): : 38 - 50