A Deep Learning-Based Multimodal Architecture to predict Signs of Dementia

被引:4
|
作者
Ortiz-Perez, David [1 ]
Ruiz-Ponce, Pablo [1 ]
Tomas, David [2 ]
Garcia-Rodriguez, Jose [1 ]
Vizcaya-Moreno, M. Flores [3 ]
Leo, Marco [4 ]
机构
[1] Univ Alicante, Dept Comp Sci & Technol, Carretera San Vicente Raspeig, Alicante 03690, Spain
[2] Univ Alicante, Dept Software & Comp Syst, Carretera San Vicente Raspeig, Alicante 03690, Spain
[3] Univ Alicante, Fac Hlth Sci, Unit Clin Nursing Res, Carretera San Vicente Raspeig, Alicante 03690, Spain
[4] Natl Res Council Italy, Inst Appl Sci & Intelligent Syst, I-73100 Lecce, Italy
关键词
Multimodal; Deep learning; Transformers; Dementia prediction;
D O I
10.1016/j.neucom.2023.126413
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a multimodal deep learning architecture combining text and audio information to predict dementia, a disease which affects around 55 million people all over the world and makes them in some cases dependent people. The system was evaluated on the DementiaBank Pitt Corpus dataset, which includes audio recordings as well as their transcriptions for healthy people and people with dementia. Different models have been used and tested, including Convolutional Neural Networks (CNN) for audio classification, Transformers for text classification, and a combination of both in a multimodal ensemble. These models have been evaluated on a test set, obtaining the best results by using the text modality, achieving 90.36% accuracy on the task of detecting dementia. Additionally, an analysis of the corpus has been conducted for the sake of explainability, aiming to obtain more information about how the models generate their predictions and identify patterns in the data. & COPY; 2023 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页数:10
相关论文
共 50 条
  • [41] DEEP LEARNING-BASED ELECTROCARDIOGRAM ANALYSIS TO PREDICT MORTALITY IN REPAIRED TETRALOGY OF FALLOT
    Van Boxtel, Juul
    Mayourian, Joshua
    Sleeper, Lynn
    Diwanji, Vedang
    Geva, Alon
    O'Leary, Edward
    Triedman, John K.
    Ghelani, Sunil J.
    Valente, Anne Marie
    Geva, Tal
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2024, 83 (13) : 1581 - 1581
  • [42] A Deep Learning-based Model to Predict Limb Amputation in Peripheral Vascular Trauma
    Kania, Thomas A.
    Patel, Hardik
    Introna, Leonard
    Kimyaghalam, Ali
    Arjmand, Shadi
    Joutovsky, Boris
    Younan, Duraid
    Singh, Kuldeep
    JOURNAL OF VASCULAR SURGERY, 2023, 77 (06) : E225 - E225
  • [43] A SELF-ADAPTIVE DEEP LEARNING-BASED MODEL TO PREDICT CLOUD WORKLOAD
    Borna, K.
    Ghanbari, R.
    NEURAL NETWORK WORLD, 2023, 33 (03) : 161 - 169
  • [44] DeepSP: Deep learning-based spatial properties to predict monoclonal antibody stability
    Kalejaye, Lateefat
    Wu, I-En
    Terry, Taylor
    Lai, Pin-Kuang
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2024, 23 : 2220 - 2229
  • [45] A review of deep learning-based information fusion techniques for multimodal medical image classification
    Li Y.
    El Habib Daho M.
    Conze P.-H.
    Zeghlache R.
    Le Boité H.
    Tadayoni R.
    Cochener B.
    Lamard M.
    Quellec G.
    Computers in Biology and Medicine, 2024, 177
  • [46] Deep learning-based 3D brain multimodal medical image registration
    Liwei Deng
    Qi Lan
    Qiang Zhi
    Sijuan Huang
    Jing Wang
    Xin Yang
    Medical & Biological Engineering & Computing, 2024, 62 : 505 - 519
  • [47] Multimodal Deep Learning-Based Prediction of Immune Checkpoint Inhibitor Efficacy in Brain Metastases
    Bodenmann, Tobias R.
    Gil, Nelson
    Dorfner, Felix J.
    Cleveland, Mason C.
    Patel, Jay B.
    Brahmavar, Shreyas Bhat
    Guelen, Melisa S.
    Pulido-Arias, Dagoberto
    Kalpathy-Cramer, Jayashree
    Thiran, Jean-Philippe
    Rosen, Bruce R.
    Gerstner, Elizabeth
    Kim, Albert E.
    Bridge, Christopher P.
    CANCER PREVENTION, DETECTION, AND INTERVENTION, CAPTION 2024, 2025, 15199 : 37 - 47
  • [48] Deep learning-based late fusion of multimodal information for emotion classification of music video
    Pandeya, Yagya Raj
    Lee, Joonwhoan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (02) : 2887 - 2905
  • [49] Multimodal Deep Learning-based Feature Fusion for Object Detection in Remote Sensing Images
    Yin, Shoulin
    Wang, Qunming
    Wang, Liguo
    Ivanovic, Mirjana
    Li, Hang
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2025, 22 (01) : 327 - 344
  • [50] Deep learning-based multimodal integration of histology and genomics improves cancer origin prediction
    Shaban, Muhammad
    Lu, Ming Y.
    Williamson, Drew F. K.
    Chen, Richard J.
    Lipkova, Jana
    Chen, Tiffany Y.
    Mahmood, Faisal
    CANCER RESEARCH, 2023, 83 (02)