Automatic Early Detection of Amyotrophic Lateral Sclerosis from Intelligible Speech Using Convolutional Neural Networks

被引：43

作者：

An, KwangHoon ^{[1
]}

Kim, Myungjong ^{[1
]}

Teplansky, Kristin ^{[1
,2
]}

Green, Jordan R. ^{[3
]}

Campbell, Thomas F. ^{[2
]}

Yunusova, Yana ^{[4
]}

Heitzman, Daragh ^{[5
]}

Wang, Jun ^{[1
,2
]}

机构：

[1] Univ Texas Dallas, Speech Disorders & Technol Lab, Dept Bioengn, Richardson, TX 75083 USA

[2] Univ Texas Dallas, Callier Ctr Commun Disorders, Richardson, TX 75083 USA

[3] MGH Inst Hlth Profess, Dept Commun Sci & Disorders, Boston, MA USA

[4] Univ Toronto, Dept Speech Language Pathol, Toronto, ON, Canada

[5] Texas Neurol, MDA ALS Ctr, Dallas, TX USA

来源：

19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES | 2018年

基金：

美国国家卫生研究院;

关键词：

amyotrophic lateral sclerosis; human-computer interaction; computational paralinguistics; BULBAR;

D O I：

10.21437/Interspeech.2018-2496

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Amyotrophic lateral sclerosis (ALS) is a rapidly progressive neurodegenerative disease of the motor system that leads to the impairment of speech and swallowing functions. The lack of a biomarker typically causes a diagnostic delay. To advance the current diagnostic process, we explored the feasibility of automatic detection of patients with ALS at an early stage from highly intelligible speech. A speech dataset was collected from thirteen newly diagnosed patients with ALS and thirteen age and gender-matched healthy controls. Convolutional Neural Networks (CNNs), including time-domain CNN and frequency-domain CNN, were used to classify the intelligible speech produced by patients with ALS and those by healthy individuals. Experimental results indicated both time- and frequency-CNN outperformed standard neural network. The best sample-level sensitivity and specificity were obtained by time-CNN (71.6% and 80.9%, respectively). When multiple samples were used to vote to estimate a person-level performance, the best result was obtained by frequency-CNN (76.9% sensitivity and 92.3% specificity). Results demonstrated the possibility of early detection of ALS from intelligible speech signals.

引用

页码：1913 / 1917

页数：5

共 32 条

[1]

Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265

[2] Convolutional Neural Networks for Speech Recognition [J].

Abdel-Hamid, Ossama ;

Mohamed, Abdel-Rahman ;

Jiang, Hui ;

Deng, Li ;

Penn, Gerald ;

Yu, Dong .

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (10) :1533-1545

[3] The diagnostic utility of patient-report and speech-language pathologists' ratings for detecting the early onset of bulbar symptoms due to ALS [J].

Allison, Kristen M. ;

Yunusova, Yana ;

Campbell, Thomas F. ;

Wang, Jun ;

Berry, James D. ;

Green, Jordan R. .

AMYOTROPHIC LATERAL SCLEROSIS AND FRONTOTEMPORAL DEGENERATION, 2017, 18 (5-6) :358-366

[4]

[Anonymous], 2013, Proceedings of the 21st ACM International Conference on Multimedia, DOI DOI 10.1145/2502081.2502224

[5] Learned vs. Hand-Crafted Features for Pedestrian Gender Recognition [J].

Antipov, Grigory ;

Berrani, Sid-Ahmed ;

Ruchaud, Natacha ;

Dugelay, Jean-Luc .

MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, :1263-1266

[6] El Escorial revisited: Revised criteria for the diagnosis of amyotrophic lateral sclerosis [J].

Brooks, BR ;

Miller, RG ;

Swash, M ;

Munsat, TL .

AMYOTROPHIC LATERAL SCLEROSIS AND OTHER MOTOR NEURON DISORDERS, 2000, 1 (05) :293-299

[7]

Brown RH, 2017, NEW ENGL J MED, V377, P1602, DOI [10.1056/NEJMra1603471, 10.1056/NEJMc1710379, 10.1016/S0140-6736(10)61156-7, 10.1038/nrdp.2017.85, 10.1016/S0140-6736(17)31287-4]

[8] The ALSFRS-R: a revised ALS functional rating scale that incorporates assessments of respiratory function [J].

Cedarbaum, JM ;

Stambler, N ;

Malta, E ;

Fuller, C ;

Hilt, D ;

Thurmond, B ;

Nakanishi, A .

JOURNAL OF THE NEUROLOGICAL SCIENCES, 1999, 169 (1-2) :13-21

[9] A review of depression and suicide risk assessment using speech analysis [J].

Cummins, Nicholas ;

Scherer, Stefan ;

Krajewski, Jarek ;

Schnieder, Sebastian ;

Epps, Julien ;

Quatieri, Thomas F. .

SPEECH COMMUNICATION, 2015, 71 :10-49

[10]

Falcone M, 2013, INT CONF ACOUST SPEE, P7577, DOI 10.1109/ICASSP.2013.6639136

← 1 2 3 4 →