Vision Transformer for Parkinson's Disease Classification using Multilingual Sustained Vowel Recordings

被引:5
|
作者
Hemmerling, Daria [1 ]
Wodzinski, Marek [1 ,2 ]
Orozco-Arroyave, Juan Rafael [3 ,4 ]
Sztaho, David [5 ]
Daniol, Mateusz [1 ]
Jemiolo, Pawel [1 ]
Wojcik-Pedziwiatr, Magdalena [6 ]
机构
[1] AGH Univ Sci & Technol, Fac Elect Engn Automat Comp Sci & Biomed Engn, Krakow, Poland
[2] Univ Appl Sci Western Switzerland, Inst Informat Syst, HES SO Valais, Sierre, Switzerland
[3] Univ Antioquia, Medellin, Colombia
[4] Univ Erlangen Nurnberg, Pattern Recognit Lab, Erlangen, Germany
[5] Budapest Univ Technol & Econ, Dept Telecommun & Media Informat, Budapest, Hungary
[6] Krakow Univ, Dept Neurol, Krakow, Poland
来源
2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC | 2023年
关键词
Deep Learning; Vision Transformer; Voice Processing; Neurodegenerative Diseases; Hypokinetic Dysarthria;
D O I
10.1109/EMBC40787.2023.10340478
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Parkinson's disease (PD) is the 2(nd) most prevalent neurodegenerative disease in the world. Thus, the early detection of PD has recently been the subject of several scientific and commercial studies. In this paper, we propose a pipeline using Vision Transformer applied to mel-spectrograms for PD classification using multilingual sustained vowel recordings. Furthermore, our proposed transformed-based model shows a great potential to use voice as a single modality biomarker for automatic PD detection without language restrictions, a wide range of vowels, with an F1-score equal to 0.78. The results of our study fall within the range of the estimated prevalence of voice and speech disorders in Parkinson's disease, which ranges from 70-90%. Our study demonstrates a high potential for adaptation in clinical decision-making, allowing for increasingly systematic and fast diagnosis of PD with the potential for use in telemedicine.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] Automated Ischemic Stroke Classification from MRI Scans: Using a Vision Transformer Approach
    Abbaoui, Wafae
    Retal, Sara
    Ziti, Soumia
    El Bhiri, Brahim
    JOURNAL OF CLINICAL MEDICINE, 2024, 13 (08)
  • [22] Waste classification using vision transformer based on multilayer hybrid convolution neural network
    Alrayes, Fatma S.
    Asiri, Mashael M.
    Maashi, Mashael S.
    Nour, Mohamed K.
    Rizwanullah, Mohammed
    Osman, Azza Elneil
    Drar, Suhanda
    Zamani, Abu Sarwar
    URBAN CLIMATE, 2023, 49
  • [23] A hybrid Framework for plant leaf disease detection and classification using convolutional neural networks and vision transformer
    Aboelenin, Sherihan
    Elbasheer, Foriaa Ahmed
    Eltoukhy, Mohamed Meselhy
    El-Hady, Walaa M.
    Hosny, Khalid M.
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (02)
  • [24] Multilingual Analysis of Speech and Voice Disorders in Patients with Parkinson's Disease
    Kovac, Daniel
    Mekyska, Jiri
    Galaz, Zoltan
    Brabenec, Lubos
    Kostalova, Milena
    Rapcsak, Steven Z.
    Rektorova, Irena
    2021 44TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2021, : 273 - 277
  • [25] Satellite Images Analysis and Classification using Deep Learning-based Vision Transformer Model
    Adegun, Adekanmi Adeyinka
    Viriri, Serestina
    2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 1275 - 1279
  • [26] Automated classification of remote sensing satellite images using deep learning based vision transformer
    Adegun, Adekanmi
    Viriri, Serestina
    Tapamo, Jules-Raymond
    APPLIED INTELLIGENCE, 2024, 54 (24) : 13018 - 13037
  • [27] Classification of Mobile-Based Oral Cancer Images Using the Vision Transformer and the Swin Transformer
    Song, Bofan
    Raj, Dharma K. C.
    Yang, Rubin Yuchan
    Li, Shaobai
    Zhang, Chicheng
    Liang, Rongguang
    CANCERS, 2024, 16 (05)
  • [28] Lightweight vision image transformer (LViT) model for skin cancer disease classification
    Dwivedi, Tanay
    Chaurasia, Brijesh Kumar
    Shukla, Man Mohan
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2024, 15 (10) : 5030 - 5055
  • [29] A Multitask Learning-Based Vision Transformer for Plant Disease Localization and Classification
    Hemalatha, S.
    Jayachandran, Jai Jaganath Babu
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2024, 17 (01)
  • [30] Language Generalization Using Active Learning in the Context of Parkinson's Disease Classification
    Moreno-Acevedo, S. A.
    Rios-Urrego, C. D.
    Vasquez-Correa, J. C.
    Rusz, J.
    Noeth, E.
    Orozco-Arroyave, J. R.
    TEXT, SPEECH, AND DIALOGUE, TSD 2023, 2023, 14102 : 349 - 359