Dysarthric speech classification using glottal features computed from non-words, words and sentences

被引:35
|
作者
Narendra, N. P. [1 ]
Alku, Paavo [1 ]
机构
[1] Aalto Univ, Dept Signal Proc & Acoust, Espoo, Finland
基金
芬兰科学院;
关键词
Dysarthric speech; glottal source; glottal parameters; openSMILE; support vector machines; INTELLIGIBILITY; DATABASE; QUALITY;
D O I
10.21437/Interspeech.2018-1059
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dysarthria is a neuro-motor disorder resulting from the disruption of normal activity in speech production leading to slow, slurred and imprecise (low intelligible) speech. Automatic classification of dysarthria from speech can be used as a potential clinical tool in medical treatment. This paper examines the effectiveness of glottal source parameters in dysarthric speech classification from three categories of speech signals, namely non-words, words and sentences. In addition to the glottal parameters, two sets of acoustic parameters extracted by the openSMILE toolkit are used as baseline features. A dysarthric speech classification system is proposed by training support vector machines (SVMs) using features extracted from speech utterances and their labels indicating dysarthria/healthy. Classification accuracy results indicate that the glottal parameters contain discriminating information required for the identification of dysarthria. Additionally, the complementary nature of the glottal parameters is demonstrated when these parameters, in combination with the openSMILE-based acoustic features, result in improved classification accuracy. Analysis of classification accuracies of the glottal and openSMILE features for non-words, words and sentences is carried out. Results indicate that in terms of classification accuracy the word level is best suited in identifying the presence of dysarthria.
引用
收藏
页码:3403 / 3407
页数:5
相关论文
共 50 条
  • [31] Automated classification of facial expressions using bag of visual words and texture-based features
    Harrati, Nouzha
    Bouchrika, Imed
    Tari, Abdelkamel
    Ladjailia, Ammar
    2015 16TH INTERNATIONAL CONFERENCE ON SCIENCES AND TECHNIQUES OF AUTOMATIC CONTROL AND COMPUTER ENGINEERING (STA), 2015, : 363 - 367
  • [32] Automatic assessment of intelligibility in speakers with dysarthria from coded telephone speech using glottal features
    Narendra, N. P.
    Alku, Paavo
    COMPUTER SPEECH AND LANGUAGE, 2021, 65
  • [33] Classification of Non-Conventional Ships Using a Neural Bag-Of-Words Mechanism
    Polap, Dawid
    Wlodarczyk-Sielicka, Marta
    SENSORS, 2020, 20 (06)
  • [34] Speech/non-speech classification using multiple features for robust endpoint detection
    Shin, WH
    Lee, BS
    Lee, YK
    Lee, JS
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1399 - 1402
  • [35] HEp-2 image classification using intensity order pooling based features and bag of words
    Shen, Linlin
    Lin, Jiaming
    Wu, Shengyin
    Yu, Shiqi
    PATTERN RECOGNITION, 2014, 47 (07) : 2419 - 2427
  • [36] Speech/Music Classification Using Features From Spectral Peaks
    Bhattacharjee, Mrinmoy
    Prasanna, S. R. Mahadeva
    Guha, Prithwijit
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 (28) : 1549 - 1559
  • [37] Disaster damage assessment from the tweets using the combination of statistical features and informative words
    Madichetty, Sreenivasulu
    Sridevi, M.
    SOCIAL NETWORK ANALYSIS AND MINING, 2019, 9 (01)
  • [38] Disaster damage assessment from the tweets using the combination of statistical features and informative words
    Sreenivasulu Madichetty
    M. Sridevi
    Social Network Analysis and Mining, 2019, 9
  • [39] Emotion Recognition from Speech Using the Bag-of-Visual Words on Audio Segment Spectrograms
    Spyrou, Evaggelos
    Nikopoulou, Rozalia
    Vernikos, Ioannis
    Mylonas, Phivos
    TECHNOLOGIES, 2019, 7 (01)
  • [40] Silent EEG-Speech Recognition Using Convolutional and Recurrent Neural Network with 85% Accuracy of 9 Words Classification
    Vorontsova, Darya
    Menshikov, Ivan
    Zubov, Aleksandr
    Orlov, Kirill
    Rikunov, Peter
    Zvereva, Ekaterina
    Flitman, Lev
    Lanikin, Anton
    Sokolova, Anna
    Markov, Sergey
    Bernadotte, Alexandra
    SENSORS, 2021, 21 (20)