Dysarthric speech classification from coded telephone speech using glottal features

被引:32
|
作者
Narendra, N. P. [1 ]
Alku, Paavo [1 ]
机构
[1] Aalto Univ, Dept Signal Proc & Acoust, Espoo 00076, Finland
基金
芬兰科学院;
关键词
Dysarthric speech; Glottal parameters; Glottal source estimation; Glottal inverse filtering; OpenSMILE; Support vector machines; Telemonitoring; PARKINSONS-DISEASE; INTELLIGIBILITY; DATABASE; MODELS; VOICE;
D O I
10.1016/j.specom.2019.04.003
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes a new dysarthric speech classification method from coded telephone speech using glottal features. The proposed method utilizes glottal features, which are efficiently estimated from coded telephone speech using a recently proposed deep neural net-based glottal inverse filtering method. Two sets of glottal features were considered: (1) time- and frequency-domain parameters and (2) parameters based on principal component analysis (PCA). In addition, acoustic features are extracted from coded telephone speech using the openSMILE toolkit. The proposed method utilizes both acoustic and glottal features extracted from coded speech utterances and their corresponding dysarthric/healthy labels to train support vector machine classifiers. Separate classifiers are trained using both individual, and the combination of glottal and acoustic features. The coded telephone speech used in the experiments is generated using the adaptive multi-rate codec, which operates in two transmission bandwidths: narrowband (300 Hz - 3.4 kHz) and wideband (50 Hz - 7 kHz). The experiments were conducted using dysarthric and healthy speech utterances of the TORGO and universal access speech (UA-Speech) databases. Classification accuracy results indicated the effectiveness of glottal features in the identification of dysarthria from coded telephone speech. The results also showed that the glottal features in combination with the openSMILE-based acoustic features resulted in improved classification accuracies, which validate the complementary nature of glottal features. The proposed dysarthric speech classification method can potentially be employed in telemonitoring application for identifying the presence of dysarthria from coded telephone speech.
引用
收藏
页码:47 / 55
页数:9
相关论文
共 50 条
  • [1] Automatic assessment of intelligibility in speakers with dysarthria from coded telephone speech using glottal features
    Narendra, N. P.
    Alku, Paavo
    COMPUTER SPEECH AND LANGUAGE, 2021, 65
  • [2] Dysarthric speech classification using glottal features computed from non-words, words and sentences
    Narendra, N. P.
    Alku, Paavo
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3403 - 3407
  • [3] Estimation of the glottal source from coded telephone speech using deep neural networks
    Narendra, N. P.
    Airaksinen, Manu
    Story, Brad
    Alku, Paavo
    SPEECH COMMUNICATION, 2019, 106 : 95 - 104
  • [4] Glottal source estimation from coded telephone speech using a deep neural network
    Narendra, N. P.
    Airaksinen, Manu
    Alku, Paavo
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3931 - 3935
  • [5] Dysarthric speech detection from telephone quality speech using epoch-based pitch perturbation features
    Madhu Keerthana Y.
    Sreenivasa Rao K.
    Mitra P.
    International Journal of Speech Technology, 2022, 25 (04) : 967 - 973
  • [6] PHONOLOGICAL FEATURES IN DISCRIMINATIVE CLASSIFICATION OF DYSARTHRIC SPEECH
    Rudzicz, Frank
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4605 - 4608
  • [7] Automatic intelligibility assessment of dysarthric speech using glottal parameters
    Narendra, N. P.
    Alku, Paavo
    SPEECH COMMUNICATION, 2020, 123 : 1 - 9
  • [8] A study of glottal waveform features for deceptive speech classification
    Torres, Juan F.
    Moore, Elliot, II
    Bryant, Ernest
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4489 - 4492
  • [9] AFM signal model for dysarthric speech classification using speech biomarkers
    Shabber, Shaik Mulla
    Sumesh, Eratt Parameswaran
    FRONTIERS IN HUMAN NEUROSCIENCE, 2024, 18
  • [10] Dysarthric Speech Classification Using Hierarchical Multilayer Perceptrons and Posterior Rhythmic Features
    Selouani, Sid-Ahmed
    Dahmani, Habiba
    Amami, Riadh
    Hamam, Habib
    SOFT COMPUTING MODELS IN INDUSTRIAL AND ENVIRONMENTAL APPLICATIONS, 6TH INTERNATIONAL CONFERENCE SOCO 2011, 2011, 87 : 437 - 444