A multi-modal deep learning solution for precise pneumonia diagnosis: the PneumoFusion-Net model

被引:0
作者
Wang, Yujie [1 ,2 ]
Liu, Can [1 ]
Fan, Yinghan [1 ]
Niu, Chenyue [1 ]
Huang, Wanyun [1 ]
Pan, Yixuan [3 ]
Li, Jingze [1 ,2 ]
Wang, Yilin [1 ]
Li, Jun [1 ,4 ,5 ]
机构
[1] Sichuan Agr Univ, Coll Informat Engn, Yaan, Peoples R China
[2] Sichuan Agr Univ, Deep Vis Agr Lab, Yaan, Peoples R China
[3] Sichuan Agr Univ, Coll Sci, Yaan, Peoples R China
[4] Agr Informat Engn Higher Inst Key Lab Sichuan Prov, Yaan, Peoples R China
[5] Yaan Digital Agr Engn Technol Res Ctr, Yaan, Peoples R China
关键词
pneumonia classification; deep learning; multimodal framework; clinical data integration; PneumoFuison-Net; TRANSFORMER; FUSION; PREDICTION; PROTEIN;
D O I
10.3389/fphys.2025.1512835
中图分类号
Q4 [生理学];
学科分类号
071003 ;
摘要
Background Pneumonia is considered one of the most important causes of morbidity and mortality in the world. Bacterial and viral pneumonia share many similar clinical features, thus making diagnosis a challenging task. Traditional diagnostic method developments mainly rely on radiological imaging and require a certain degree of consulting clinical experience, which can be inefficient and inconsistent. Deep learning for the classification of pneumonia in multiple modalities, especially integrating multiple data, has not been well explored.Methods The study introduce the PneumoFusion-Net, a deep learning-based multimodal framework that incorporates CT images, clinical text, numerical lab test results, and radiology reports for improved diagnosis. In the experiments, a dataset of 10,095 pneumonia CT images was used-including associated clinical data-most of which was used for training and validation while keeping part of it for validation on a held-out test set. Five-fold cross-validation was considered in order to evaluate this model, calculating different metrics including accuracy and F1-Score.Results PneumoFusion-Net, which achieved 98.96% classification accuracy with a 98% F1-score on the held-out test set, is highly effective in distinguishing bacterial from viral types of pneumonia. This has been highly beneficial for diagnosis, reducing misdiagnosis and further improving homogeneity across various data sets from multiple patients.Conclusion PneumoFusion-Net offers an effective and efficient approach to pneumonia classification by integrating diverse data sources, resulting in high diagnostic accuracy. Its potential for clinical integration could significantly reduce the burden of pneumonia diagnosis by providing radiologists and clinicians with a robust, automated diagnostic tool.
引用
收藏
页数:23
相关论文
共 67 条
  • [1] Efficient Skip Connections-Based Residual Network (ESRNet) for Brain Tumor Classification
    Ashwini, B.
    Kaur, Manjit
    Singh, Dilbag
    Roy, Satyabrata
    Amoon, Mohammed
    [J]. DIAGNOSTICS, 2023, 13 (20)
  • [2] Multimodal Machine Learning: A Survey and Taxonomy
    Baltrusaitis, Tadas
    Ahuja, Chaitanya
    Morency, Louis-Philippe
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (02) : 423 - 443
  • [3] Automatic COVID-19 Detection Using Exemplar Hybrid Deep Features with X-ray Images
    Barua, Prabal Datta
    Gowdh, Nadia Fareeda Muhammad
    Rahmat, Kartini
    Ramli, Norlisah
    Ng, Wei Lin
    Chan, Wai Yee
    Kuluozturk, Mutlu
    Dogan, Sengul
    Baygin, Mehmet
    Yaman, Orhan
    Tuncer, Turker
    Wen, Tao
    Cheong, Kang Hao
    Acharya, U. Rajendra
    [J]. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (15)
  • [4] Multi-layer perceptron classification & quantification of neuronal survival in hypoxic-ischemic brain image slices using a novel gradient direction, grey level co-occurrence matrix image training
    Bhattacharya, Saheli
    Bennet, Laura
    Davidson, Joanne O.
    Unsworth, Charles P.
    [J]. PLOS ONE, 2022, 17 (12):
  • [5] Multimodal Data Fusion Based on Mutual Information
    Bramon, Roger
    Boada, Imma
    Bardera, Anton
    Rodriguez, Joaquim
    Feixas, Miquel
    Puig, Josep
    Sbert, Mateu
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2012, 18 (09) : 1574 - 1587
  • [6] A General Survey on Attention Mechanisms in Deep Learning
    Brauwers, Gianni
    Frasincar, Flavius
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) : 3279 - 3298
  • [7] MLP (muscle LIM protein) as a stress sensor in the heart
    Buyandelger, Byambajav
    Ng, Keat-Eng
    Miocic, Snjezana
    Piotrowska, Izabela
    Gunkel, Sylvia
    Ku, Ching-Hsin
    Knoell, Ralph
    [J]. PFLUGERS ARCHIV-EUROPEAN JOURNAL OF PHYSIOLOGY, 2011, 462 (01): : 135 - 142
  • [8] DO-Conv: Depthwise Over-Parameterized Convolutional Layer
    Cao, Jinming
    Li, Yangyan
    Sun, Mingchao
    Chen, Ying
    Lischinski, Dani
    Cohen-Or, Daniel
    Chen, Baoquan
    Tu, Changhe
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3726 - 3736
  • [9] Castaneda Christian, 2015, J Clin Bioinforma, V5, P4, DOI 10.1186/s13336-015-0019-3
  • [10] Viral and bacterial co-infection in pneumonia: do we know enough to improve clinical care?
    Cawcutt, Kelly A.
    Kalil, Andre C.
    [J]. CRITICAL CARE, 2017, 21