COMBINING MULTIPLE KERNEL MODELS FOR AUTOMATIC INTELLIGIBILITY DETECTION OF PATHOLOGICAL SPEECH

被引:0
作者
Huang, Dong-Yan [1 ]
Dong, Minghui [1 ]
Li, Haizhou [1 ]
机构
[1] ASTAR, Inst Infocomm Res, Human Language Technol Dept, 21-01 Fusionopolis Way,Connexis South Tower, Singapore 138632, Singapore
来源
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS | 2016年
关键词
pathological speech; intelligibility; correlation structure feature; multiple kernel models;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Automatic detection of pathological voice is a challenging task in speech processing. Appropriate acoustic cues of voice can be used to differentiate between normal voices and pathological voices. We propose a method to represent each speech utterance using three types of speech signal representations (i.e.,cross-correlation matrix, Gaussian distribution and linear subspace) respectively. Various kernels were applied to these representations for measuring resemblance and difference. Four classifiers, i.e., KNN, kernel partial least squares, kernel SVM, and logistic regression, are studied for comparing their performance of classification. Finally, a simple fusion of learning classifiers from different acoustic representations was carried out at the score decision level for enhancing the performance. The different classifiers were evaluated on the Interspeech 2012 challenge development data set and test data set. Their effects in a fusion scheme are studied. The accuracy of the fusion system attained 78.0 % on test set, with an improved gain of 9.1 % over the challenge baseline 68.9 %.
引用
收藏
页码:6485 / 6489
页数:5
相关论文
共 44 条
  • [1] INTELLIGIBILITY DETECTION OF PATHOLOGICAL SPEECH USING ASYMMETRIC SPARSE KERNEL PARTIAL LEAST SQUARES CLASSIFIER
    Huang, Dong-Yan
    Dong, Minghui
    Li, Haizhou
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [2] Automatic intelligibility classification of sentence-level pathological speech
    Kim, Jangwon
    Kumar, Naveen
    Tsiartas, Andreas
    Li, Ming
    Narayanan, Shrikanth S.
    COMPUTER SPEECH AND LANGUAGE, 2015, 29 (01) : 132 - 144
  • [3] Intelligibility Classification of Pathological Speech Using Fusion of Multiple Subsystems
    Kim, Jangwon
    Kumar, Naveen
    Tsiartas, Andreas
    Li, Ming
    Narayanan, Shrikanth S.
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 534 - 537
  • [4] Impact of Speech Mode in Automatic Pathological Speech Detection
    Sheikh, Shakeel A.
    Kodrasi, Ina
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 81 - 85
  • [5] Feature analysis for automatic detection of pathological speech
    Dibazar, AA
    Narayanan, S
    Berger, TW
    SECOND JOINT EMBS-BMES CONFERENCE 2002, VOLS 1-3, CONFERENCE PROCEEDINGS: BIOENGINEERING - INTEGRATIVE METHODOLOGIES, NEW TECHNOLOGIES, 2002, : 182 - 183
  • [6] Combining phonological and acoustic ASR-free features for pathological speech intelligibility assessment
    Middag, Catherine
    Bocklet, Tobias
    Martens, Jean-Pierre
    Noeth, Elmar
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 3016 - +
  • [7] A MIXTURE OF EXPERTS APPROACH TOWARDS INTELLIGIBILITY CLASSIFICATION OF PATHOLOGICAL SPEECH
    Gupta, Rahul
    Audhkhasi, Kartik
    Narayanan, Shrikanth
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 1986 - 1990
  • [8] An automatic measure for speech intelligibility in dysarthrias-validation across multiple languages and neurological disorders
    Troeger, Johannes
    Doerr, Felix
    Schwed, Louisa
    Linz, Nicklas
    Koenig, Alexandra
    Thies, Tabea
    Orozco-Arroyave, Juan Rafael
    Rusz, Jan
    FRONTIERS IN DIGITAL HEALTH, 2024, 6
  • [9] Automated Intelligibility Assessment of Pathological Speech Using Phonological Features
    Middag, Catherine
    Martens, Jean-Pierre
    Van Nuffelen, Gwen
    De Bodt, Marc
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2009,
  • [10] Automated Intelligibility Assessment of Pathological Speech Using Phonological Features
    Catherine Middag
    Jean-Pierre Martens
    Gwen Van Nuffelen
    Marc De Bodt
    EURASIP Journal on Advances in Signal Processing, 2009