USING ACOUSTIC DEEP NEURAL NETWORK EMBEDDINGS TO DETECT MULTIPLE SCLEROSIS FROM SPEECH

被引:8
作者
Gosztolya, Gabor [1 ,2 ]
Toth, Laszlo [1 ]
Svindt, Veronika [3 ]
Bona, Judit [4 ]
Hoffmann, Ildiko [3 ,5 ]
机构
[1] Univ Szeged, Inst Informat, Szeged, Hungary
[2] ELRN SZTE Res Grp Artificial Intelligence, Szeged, Hungary
[3] ELRN, Res Ctr Linguist, Budapest, Hungary
[4] Eotvos Lorand Univ, Dept Appl Linguist & Phonet, Budapest, Hungary
[5] Univ Szeged, Dept Linguist, Szeged, Hungary
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年
关键词
Multiple Sclerosis; medical speech processing; Deep Neural Networks; embeddings; x-vectors; DYSARTHRIA; LANGUAGE;
D O I
10.1109/ICASSP43922.2022.9746856
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Multiple sclerosis (MS) is a chronic inflammatory disease of the central nervous system. It affects cognitive and motor functions, and the limitation of executive functions can also manifest itself in speech production. Due to this, automatic speech analysis might serve as an effective technique for assessing MS, or for monitoring the status of the patient. However, choosing the features to be extracted from the recordings is not straightforward. In the past few years, general feature extractors such as i-vectors, d-vectors and x-vectors have found their way into automatic speech analysis. In this study we show that there is no need to employ a special neural network architecture such as x-vectors to calculate effective features, but (even more) indicative features can be derived on the basis of a standard Deep Neural Network acoustic model. From our results, these features could effectively be used to distinguish MS subjects from healthy controls, as we measured AUC scores up to 0.935. We found that classification performance depended only slightly on the choice of the hidden layer used to extract our features, but the speech task performed by the subject turned out to be an important factor.
引用
收藏
页码:6927 / 6931
页数:5
相关论文
共 21 条
[21]   Vocal symptoms and acoustic changes in relation to the expanded disability status scale, duration and stage of disease in patients with multiple sclerosis [J].
Yamout, Bassem ;
Fuleihan, Nabil ;
Hajj, Taghrid ;
Sibai, Abla ;
Sabra, Omar ;
Rifai, Hani ;
Hamdan, Abdul-Latif .
EUROPEAN ARCHIVES OF OTO-RHINO-LARYNGOLOGY, 2009, 266 (11) :1759-1765