Classification of Parkinson's disease from smartphone recording data using time-frequency analysis and convolutional neural network

被引:1
作者
Worasawate, Denchai [1 ]
Asawaponwiput, Warisara [1 ]
Yoshimura, Natsue [2 ]
Intarapanich, Apichart [3 ]
Surangsrirat, Decho [4 ]
机构
[1] Kasetsart Univ, Fac Engn, Dept Elect Engn, Bangkok, Thailand
[2] Tokyo Inst Technol, Inst Innovat Res, Yokohama, Kanagawa, Japan
[3] Natl Elect & Comp Technol Ctr, Educ Technol Team, Pathum Thani, Thailand
[4] Natl Sci & Technol Dev Agcy, Assist Technol & Med Devices Res Ctr, Pathum Thani, Thailand
关键词
PD voice; audio classification; convolutional neural network; mPower study; AUTOMATIC CLASSIFICATION;
D O I
10.3233/THC-220386
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
BACKGROUND: Parkinson's disease (PD) is a long-term neurodegenerative disease of the central nervous system. The current diagnosis is dependent on clinical observation and the abilities and experience of a trained specialist. One of the symptoms that affects most patients is voice impairment. OBJECTIVE: Voice samples are non-invasive data that can be collected remotely for diagnosis and disease progression monitoring. In this study, we analyzed voice recording data from a smartphone as a possible medical self-diagnosis tool by using only one-second voice recording. The data from one of the largest mobile PD studies, the mPower study, was used. METHODS: A total of 29,798 ten-second voice recordings on smartphone from 4,051 participants were used for the analysis. The voice recordings were from sustained phonation by participants saying /aa/ for ten seconds into an iPhone microphone. A dataset comprising 385,143 short one-second audio samples was generated from the original ten-second voice recordings. The samples were converted to a spectrogram using a short-time Fourier transform. CNN models were then applied to classify the samples. RESULTS: Classification accuracies of the proposed method with LeNet-5, ResNet-50, and VGGNet-16 are 97.7 +/- 0.1%, 98.6 +/- 0.2%, and 99.3 +/- 0.1%, respectively. CONCLUSIONS: We achieve a respectable classification performance using a generalized approach on a dataset with a large number of samples. The result emphasizes that an analysis based on one-second clip recorded on a smartphone could be a promising non-invasive and remotely available PD biomarker.
引用
收藏
页码:705 / 718
页数:14
相关论文
共 42 条
  • [1] Abujrida H, 2017, 2017 IEEE-NIH HEALTHCARE INNOVATIONS AND POINT OF CARE TECHNOLOGIES (HI-POCT), P208, DOI 10.1109/HIC.2017.8227621
  • [2] Developing a large scale population screening tool for the assessment of Parkinson's disease using telephone-quality voice
    Arora, Siddharth
    Baghai-Ravary, Ladan
    Tsanas, Athanasios
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 145 (05) : 2871 - 2884
  • [3] An Expert Diagnostic System to Automatically Identify Asthma and Chronic Obstructive Pulmonary Disease in Clinical Settings
    Badnjevic, Almir
    Gurbeta, Lejla
    Custovic, Eddie
    [J]. SCIENTIFIC REPORTS, 2018, 8
  • [4] The mPower study, Parkinson disease mobile data collected using ResearchKit
    Bot, Brian M.
    Suver, Christine
    Neto, Elias Chaibub
    Kellen, Michael
    Klein, Arno
    Bare, Christopher
    Doerr, Megan
    Pratap, Abhishek
    Wilbanks, John
    Dorsey, E. Ray
    Friend, Stephen H.
    Trister, Andrew D.
    [J]. SCIENTIFIC DATA, 2016, 3
  • [5] Application of Neural Networks for classification of Patau, Edwards, Down, Turner and Klinefelter Syndrome based on first trimester maternal serum screening data, ultrasonographic findings and patient demographics
    Catic, Aida
    Gurbeta, Lejla
    Kurtovic-Kozaric, Amina
    Mehmedbasic, Senad
    Badnjevic, Almir
    [J]. BMC MEDICAL GENOMICS, 2018, 11
  • [6] Feature selection and extraction for class prediction in dysphonia measures analysis: A case study on Parkinson's disease speech rehabilitation
    El Moudden, Ismail
    Ouzir, Mounir
    ElBernoussi, Souad
    [J]. TECHNOLOGY AND HEALTH CARE, 2017, 25 (04) : 693 - 708
  • [7] Gao Y., 2020, Frontiers in Neurology, V11
  • [8] Giuliano M, 2019, SYMP IMAG SIG PROC A
  • [9] Grinstein E, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P586, DOI 10.1109/ICASSP.2018.8461711
  • [10] Automatic Classification on Multi-Modal MRI Data for Diagnosis of the Postural Instability and Gait Difficulty Subtype of Parkinson's Disease
    Gu, Quanquan
    Zhang, Huan
    Xuan, Min
    Luo, Wei
    Huang, Peiyu
    Xia, Shunren
    Zhang, Minming
    [J]. JOURNAL OF PARKINSONS DISEASE, 2016, 6 (03) : 545 - 556