Automatic Early Detection of Amyotrophic Lateral Sclerosis from Intelligible Speech Using Convolutional Neural Networks

被引:43
作者
An, KwangHoon [1 ]
Kim, Myungjong [1 ]
Teplansky, Kristin [1 ,2 ]
Green, Jordan R. [3 ]
Campbell, Thomas F. [2 ]
Yunusova, Yana [4 ]
Heitzman, Daragh [5 ]
Wang, Jun [1 ,2 ]
机构
[1] Univ Texas Dallas, Speech Disorders & Technol Lab, Dept Bioengn, Richardson, TX 75083 USA
[2] Univ Texas Dallas, Callier Ctr Commun Disorders, Richardson, TX 75083 USA
[3] MGH Inst Hlth Profess, Dept Commun Sci & Disorders, Boston, MA USA
[4] Univ Toronto, Dept Speech Language Pathol, Toronto, ON, Canada
[5] Texas Neurol, MDA ALS Ctr, Dallas, TX USA
来源
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES | 2018年
基金
美国国家卫生研究院;
关键词
amyotrophic lateral sclerosis; human-computer interaction; computational paralinguistics; BULBAR;
D O I
10.21437/Interspeech.2018-2496
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Amyotrophic lateral sclerosis (ALS) is a rapidly progressive neurodegenerative disease of the motor system that leads to the impairment of speech and swallowing functions. The lack of a biomarker typically causes a diagnostic delay. To advance the current diagnostic process, we explored the feasibility of automatic detection of patients with ALS at an early stage from highly intelligible speech. A speech dataset was collected from thirteen newly diagnosed patients with ALS and thirteen age and gender-matched healthy controls. Convolutional Neural Networks (CNNs), including time-domain CNN and frequency-domain CNN, were used to classify the intelligible speech produced by patients with ALS and those by healthy individuals. Experimental results indicated both time- and frequency-CNN outperformed standard neural network. The best sample-level sensitivity and specificity were obtained by time-CNN (71.6% and 80.9%, respectively). When multiple samples were used to vote to estimate a person-level performance, the best result was obtained by frequency-CNN (76.9% sensitivity and 92.3% specificity). Results demonstrated the possibility of early detection of ALS from intelligible speech signals.
引用
收藏
页码:1913 / 1917
页数:5
相关论文
共 32 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]   Convolutional Neural Networks for Speech Recognition [J].
Abdel-Hamid, Ossama ;
Mohamed, Abdel-Rahman ;
Jiang, Hui ;
Deng, Li ;
Penn, Gerald ;
Yu, Dong .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (10) :1533-1545
[3]   The diagnostic utility of patient-report and speech-language pathologists' ratings for detecting the early onset of bulbar symptoms due to ALS [J].
Allison, Kristen M. ;
Yunusova, Yana ;
Campbell, Thomas F. ;
Wang, Jun ;
Berry, James D. ;
Green, Jordan R. .
AMYOTROPHIC LATERAL SCLEROSIS AND FRONTOTEMPORAL DEGENERATION, 2017, 18 (5-6) :358-366
[4]  
[Anonymous], 2013, Proceedings of the 21st ACM International Conference on Multimedia, DOI DOI 10.1145/2502081.2502224
[5]   Learned vs. Hand-Crafted Features for Pedestrian Gender Recognition [J].
Antipov, Grigory ;
Berrani, Sid-Ahmed ;
Ruchaud, Natacha ;
Dugelay, Jean-Luc .
MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, :1263-1266
[6]   El Escorial revisited: Revised criteria for the diagnosis of amyotrophic lateral sclerosis [J].
Brooks, BR ;
Miller, RG ;
Swash, M ;
Munsat, TL .
AMYOTROPHIC LATERAL SCLEROSIS AND OTHER MOTOR NEURON DISORDERS, 2000, 1 (05) :293-299
[7]  
Brown RH, 2017, NEW ENGL J MED, V377, P1602, DOI [10.1056/NEJMra1603471, 10.1056/NEJMc1710379, 10.1016/S0140-6736(10)61156-7, 10.1038/nrdp.2017.85, 10.1016/S0140-6736(17)31287-4]
[8]   The ALSFRS-R: a revised ALS functional rating scale that incorporates assessments of respiratory function [J].
Cedarbaum, JM ;
Stambler, N ;
Malta, E ;
Fuller, C ;
Hilt, D ;
Thurmond, B ;
Nakanishi, A .
JOURNAL OF THE NEUROLOGICAL SCIENCES, 1999, 169 (1-2) :13-21
[9]   A review of depression and suicide risk assessment using speech analysis [J].
Cummins, Nicholas ;
Scherer, Stefan ;
Krajewski, Jarek ;
Schnieder, Sebastian ;
Epps, Julien ;
Quatieri, Thomas F. .
SPEECH COMMUNICATION, 2015, 71 :10-49
[10]  
Falcone M, 2013, INT CONF ACOUST SPEE, P7577, DOI 10.1109/ICASSP.2013.6639136