Dysarthric speech classification using glottal features computed from non-words, words and sentences

被引:35
|
作者
Narendra, N. P. [1 ]
Alku, Paavo [1 ]
机构
[1] Aalto Univ, Dept Signal Proc & Acoust, Espoo, Finland
基金
芬兰科学院;
关键词
Dysarthric speech; glottal source; glottal parameters; openSMILE; support vector machines; INTELLIGIBILITY; DATABASE; QUALITY;
D O I
10.21437/Interspeech.2018-1059
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dysarthria is a neuro-motor disorder resulting from the disruption of normal activity in speech production leading to slow, slurred and imprecise (low intelligible) speech. Automatic classification of dysarthria from speech can be used as a potential clinical tool in medical treatment. This paper examines the effectiveness of glottal source parameters in dysarthric speech classification from three categories of speech signals, namely non-words, words and sentences. In addition to the glottal parameters, two sets of acoustic parameters extracted by the openSMILE toolkit are used as baseline features. A dysarthric speech classification system is proposed by training support vector machines (SVMs) using features extracted from speech utterances and their labels indicating dysarthria/healthy. Classification accuracy results indicate that the glottal parameters contain discriminating information required for the identification of dysarthria. Additionally, the complementary nature of the glottal parameters is demonstrated when these parameters, in combination with the openSMILE-based acoustic features, result in improved classification accuracy. Analysis of classification accuracies of the glottal and openSMILE features for non-words, words and sentences is carried out. Results indicate that in terms of classification accuracy the word level is best suited in identifying the presence of dysarthria.
引用
收藏
页码:3403 / 3407
页数:5
相关论文
共 50 条
  • [1] Dysarthric speech classification from coded telephone speech using glottal features
    Narendra, N. P.
    Alku, Paavo
    SPEECH COMMUNICATION, 2019, 110 : 47 - 55
  • [2] Speech production:: Phonetic encoding of real and non-words
    Klecková, J
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2003, 2807 : 281 - 286
  • [3] Decision Tree Technique for Arabic Sentences Classification with Preprocessing of NLP by Using of Words Features
    Al-Rufaye, Faiez Musa Lahmood
    Mhaibes, Hakeem Imad
    2022 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL, COMPUTING, COMMUNICATION AND SUSTAINABLE TECHNOLOGIES (ICAECT), 2022,
  • [4] PERCEPTION OF SENTENCES, WORDS, AND SPEECH FEATURES BY PROFOUNDLY HEARING-IMPAIRED CHILDREN USING A MULTICHANNEL ELECTROTACTILE SPEECH PROCESSOR
    COWAN, RSC
    BLAMEY, PJ
    GALVIN, KL
    SARANT, JZ
    ALCANTARA, JI
    CLARK, GM
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 88 (03): : 1374 - 1384
  • [5] Acoustic analysis and perception of emotions in hindi speech using words and sentences
    Bansal S.
    Agrawal S.S.
    Kumar A.
    International Journal of Information Technology, 2019, 11 (4) : 807 - 812
  • [6] SPEECH DISCRIMINATION TASK USING MULTIPLE-CHOICE KEY WORDS IN SENTENCES
    BERGER, KW
    JOURNAL OF AUDITORY RESEARCH, 1969, 9 (03): : 247 - 262
  • [7] A Novel Dysarthric Speech Synthesis system using Tacotron2 for specific and OOV words
    Bharti, Komal
    Haque, Samiul
    Das, Pradip K.
    2024 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, SPCOM 2024, 2024,
  • [8] Imagined Speech Classification Using Six Phonetically Distributed Words
    Varshney, Yash V.
    Khan, Azizuddin
    FRONTIERS IN SIGNAL PROCESSING, 2022, 2
  • [9] Dysarthric Speech Classification Using Hierarchical Multilayer Perceptrons and Posterior Rhythmic Features
    Selouani, Sid-Ahmed
    Dahmani, Habiba
    Amami, Riadh
    Hamam, Habib
    SOFT COMPUTING MODELS IN INDUSTRIAL AND ENVIRONMENTAL APPLICATIONS, 6TH INTERNATIONAL CONFERENCE SOCO 2011, 2011, 87 : 437 - 444
  • [10] ADHD Classification Using Bag of Words Approach on Network Features
    Solmaz, Berkan
    Dey, Soumyabrata
    Rao, A. Ravishankar
    Shah, Mubarak
    MEDICAL IMAGING 2012: IMAGE PROCESSING, 2012, 8314