Speech signal-based accurate neurological disorders detection using convolutional neural network and recurrent neural network based deep network

被引:2
作者
Soylu, Emel [1 ]
Guel, Sema [2 ]
Koca, Kuebra Aslan [1 ]
Tuerkoglu, Muammer [1 ]
Terzi, Murat [1 ,3 ]
Senguer, Abdulkadir [4 ]
机构
[1] Samsun Univ, Dept Software Engn, Fac Engn & Nat Sci, Samsun, Turkiye
[2] Ondokuz Mayis Univ, Grad Inst, Dept Neurosci, Samsun, Turkiye
[3] Ondokuz Mayis Univ, Fac Med, Dept Neurol, Samsun, Turkiye
[4] Firat Univ, Fac Technol, Dept Elect & Elect Engn, Elazig, Turkiye
关键词
Deep learning; Audio classification; Neurological diseases; Recurrent neural network; Gated recurrent unit; PARKINSONS-DISEASE; MULTIPLE-SCLEROSIS; EARLY-DIAGNOSIS; ALZHEIMERS-DISEASE; ACOUSTIC ANALYSIS; VOICE DISORDERS; SOUND; CLASSIFICATION; HEART; IDENTIFICATION;
D O I
10.1016/j.engappai.2025.110558
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Neurological diseases often manifest in subtle alterations to the human voice due to damage in the sound-related regions of the brain. Leveraging advancements in artificial intelligence (AI) technologies, computers can discern minute variations in sound imperceptible to the human ear, enabling rapid and precise diagnostic support. This paper presents a novel approach to neurological disease classification utilizing voice recordings of individuals diagnosed with various neurological conditions alongside healthy controls. By employing AI techniques, particularly a hybrid deep network framework integrating Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN), we aimed to classify one-sentence audio inputs of Multiple Sclerosis (MS) patients, healthy individuals, and other neurological diseases. In our dataset, we have compiled audio recordings from 95 healthy individuals, 99 individuals diagnosed with multiple sclerosis (MS), and 96 individuals with other neurological disorders. Of these, 20 % of the data was reserved for testing. Our proposed architecture achieved remarkable performance metrics in experimental evaluations, exhibiting 96.55 % accuracy, 98.25 % specificity, 96.49 % sensitivity, 96.97 % precision, and 96.56 % F1-Score. The results obtained are more successful compared to the methods of AlexNet from scratch, fine-tuned AlexNet, Long Short-Term Memory (LSTM) based CNN, and Gated Recurrent Unit (GRU) based CNN. The results of our study highlight the potential of this framework to be integrated into clinical diagnostic workflows, providing clinicians with an effective tool for early and precise detection of neurological diseases, ultimately improving patient outcomes through timely intervention and personalized treatment strategies.
引用
收藏
页数:15
相关论文
共 122 条
[1]   Delay in the diagnosis of multiple sclerosis in Croatia [J].
Adamec, Ivan ;
Barun, Barbara ;
Gabelic, Tereza ;
Zadro, Ivana ;
Habek, Mario .
CLINICAL NEUROLOGY AND NEUROSURGERY, 2013, 115 :S70-S72
[2]   A comprehensive review of synthetic data generation in smart farming by using variational autoencoder and generative adversarial network [J].
Akkem, Yaganteeswarudu ;
Biswas, Saroj Kumar ;
Varanasi, Aruna .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 131
[3]   Smart farming using artificial intelligence: A review [J].
Akkem, Yaganteeswarudu ;
Biswas, Saroj Kumar ;
Varanasi, Aruna .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
[4]   RETRACTED: Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning (Retracted Article) [J].
Alghamdi, Norah Saleh ;
Zakariah, Mohammed ;
Hoang, Vinh Truong ;
Elahi, Mohammad Mamun .
COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2022, 2022
[5]   Detecting Parkinson's disease with sustained phonation and speech signals using machine learning techniques [J].
Almeida, Jefferson S. ;
Reboucas Filho, Pedro R. ;
Carneiro, Tiago ;
Wei, Wei ;
Damasevicius, Robertas ;
Maskeliunas, Rytis ;
de Albuquerque, Victor Hugo C. .
PATTERN RECOGNITION LETTERS, 2019, 125 :55-62
[6]   Effect of Combined Respiratory Muscle Training (cRMT) on Dysphonia following Single CVA: A Retrospective Pilot Study [J].
Arnold, Robert J. ;
Gaskill, Christopher S. ;
Bausek, Nina .
JOURNAL OF VOICE, 2023, 37 (04) :529-538
[7]  
Asfiati S., 2020, J. Phys. Conf., V12063
[8]   Voice Quality Evaluation in Patients With COVID-19: An Acoustic Analysis [J].
Asiaee, Maral ;
Vahedian-azimi, Amir ;
Atashi, Seyed Shahab ;
Keramatfar, Abdalsamad ;
Nourbakhsh, Mandana .
JOURNAL OF VOICE, 2022, 36 (06) :879.e13-879.e19
[9]  
Ball LJ, 2002, J MED SPEECH-LANG PA, V10, P231
[10]  
Becker G., 2002, J NEUROL, V249, P40, DOI DOI 10.1007/S00415-002-1309-9