Dysarthric speech classification from coded telephone speech using glottal features

被引:32
|
作者
Narendra, N. P. [1 ]
Alku, Paavo [1 ]
机构
[1] Aalto Univ, Dept Signal Proc & Acoust, Espoo 00076, Finland
基金
芬兰科学院;
关键词
Dysarthric speech; Glottal parameters; Glottal source estimation; Glottal inverse filtering; OpenSMILE; Support vector machines; Telemonitoring; PARKINSONS-DISEASE; INTELLIGIBILITY; DATABASE; MODELS; VOICE;
D O I
10.1016/j.specom.2019.04.003
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes a new dysarthric speech classification method from coded telephone speech using glottal features. The proposed method utilizes glottal features, which are efficiently estimated from coded telephone speech using a recently proposed deep neural net-based glottal inverse filtering method. Two sets of glottal features were considered: (1) time- and frequency-domain parameters and (2) parameters based on principal component analysis (PCA). In addition, acoustic features are extracted from coded telephone speech using the openSMILE toolkit. The proposed method utilizes both acoustic and glottal features extracted from coded speech utterances and their corresponding dysarthric/healthy labels to train support vector machine classifiers. Separate classifiers are trained using both individual, and the combination of glottal and acoustic features. The coded telephone speech used in the experiments is generated using the adaptive multi-rate codec, which operates in two transmission bandwidths: narrowband (300 Hz - 3.4 kHz) and wideband (50 Hz - 7 kHz). The experiments were conducted using dysarthric and healthy speech utterances of the TORGO and universal access speech (UA-Speech) databases. Classification accuracy results indicated the effectiveness of glottal features in the identification of dysarthria from coded telephone speech. The results also showed that the glottal features in combination with the openSMILE-based acoustic features resulted in improved classification accuracies, which validate the complementary nature of glottal features. The proposed dysarthric speech classification method can potentially be employed in telemonitoring application for identifying the presence of dysarthria from coded telephone speech.
引用
收藏
页码:47 / 55
页数:9
相关论文
共 50 条
  • [31] Using speech rhythm knowledge to improve dysarthric speech recognition
    S.-A. Selouani
    H. Dahmani
    R. Amami
    H. Hamam
    International Journal of Speech Technology, 2012, 15 (1) : 57 - 64
  • [32] Classification of Fricatives Using Feature Extrapolation of Acoustic-Phonetic Features in Telephone Speech
    Lee, Jung-Won
    Choi, Jeung-Yoon
    Kang, Hong-Goo
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1268 - 1271
  • [33] Speech/music classification using speech-specific features
    Khonglah, Banriskhem K.
    Prasanna, S. R. Mahadeva
    DIGITAL SIGNAL PROCESSING, 2016, 48 : 71 - 83
  • [34] Whispered Speech Detection Using Glottal Flow-Based Features
    Phapatanaburi, Khomdet
    Pathonsuwan, Wongsathon
    Wang, Longbiao
    Anchuen, Patikorn
    Jumphoo, Talit
    Buayai, Prawit
    Uthansakul, Monthippa
    Uthansakul, Peerapong
    SYMMETRY-BASEL, 2022, 14 (04):
  • [35] Phoneme-Discriminative Features for Dysarthric Speech Conversion
    Aihara, Ryo
    Takiguchi, Tetsuya
    Ariki, Yasuo
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3374 - 3378
  • [36] Automatic Analyses of Dysarthric Speech based on Distinctive Features
    Wong, Ka Ho
    Meng, Helen Mei-Ling
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2023, 12 (03)
  • [37] Shouted / Normal Speech Classification using Speech-Specific Features
    Baghel, Shikha
    Khonglah, Banriskhem K.
    Prasanna, S. R. Mahadeva
    Guha, Prithwijit
    PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 1655 - 1659
  • [38] Call Analysis with Classification Using Speech and Non-Speech Features
    Ju, Yun-Cheng
    Wang, Ye-Yi
    Acero, Alex
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1902 - 1905
  • [39] Speech/Music Classification Using Features From Spectral Peaks
    Bhattacharjee, Mrinmoy
    Prasanna, S. R. Mahadeva
    Guha, Prithwijit
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 (28) : 1549 - 1559
  • [40] Dysarthric Speech Recognition, Detection and Classification using Raw Phase and Magnitude Spectra
    Yue, Zhengjun
    Loweimi, Erfan
    Cvetkovic, Zoran
    INTERSPEECH 2023, 2023, : 1533 - 1537