MultiFusionNet: multilayer multimodal fusion of deep neural networks for chest X-ray image classification

被引:0
作者
Agarwal, Saurabh [1 ]
Arya, K.V. [1 ]
Meena, Yogesh Kumar [2 ]
机构
[1] Multimedia and Information Security Research Group, ABV-IIITM, Gwalior
[2] Human-AI Interaction (HAIx) Lab, IIT Gandhinagar
关键词
Chest X-ray image; Convolutional neural network (CNN); Disease classifications; Medical image processing; Multilayer fusion model; Multimodal fusion model;
D O I
10.1007/s00500-024-09901-x
中图分类号
学科分类号
摘要
Chest X-ray imaging is a critical diagnostic tool for identifying pulmonary diseases. However, manual interpretation of these images is time-consuming and error-prone. Automated systems utilizing convolutional neural networks (CNNs) have shown promise in improving the accuracy and efficiency of chest X-ray image classification. While previous work has mainly focused on using feature maps from the final convolution layer, there is a need to explore the benefits of leveraging additional layers for improved disease classification. Extracting robust features from limited medical image datasets remains a critical challenge. In this paper, we propose a novel deep learning-based multilayer multimodal fusion model that emphasizes extracting features from different layers and fusing them. Our disease detection model considers the discriminatory information captured by each layer. Furthermore, we propose the fusion of different-sized feature maps (FDSFM) module to effectively merge feature maps from diverse layers. The proposed model achieves a significantly higher accuracy of 97.21% and 99.60% for both three-class and two-class classifications, respectively. The proposed multilayer multimodal fusion model, along with the FDSFM module, holds promise for accurate disease classification and can also be extended to other disease classifications in chest X-ray images. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024.
引用
收藏
页码:11535 / 11551
页数:16
相关论文
共 62 条
  • [31] LeCun Y., Boser B., Denker J.S., Henderson D., Howard R.E., Hubbard W., Jackel L.D., Backpropagation applied to handwritten zip code recognition, Neural Comput, 1, 4, pp. 541-551, (1989)
  • [32] Li Z., Xu X., Cao X., Liu W., Zhang Y., Chen D., Dai H., Integrated cnn and federated learning for covid-19 detection on chest X-ray images, IEEE/ACM Trans Comput Biol Bioinform, (2022)
  • [33] Meena Y.K., Arya K., Multimodal interaction and iot applications, Multimed Tools Appl, 82, 4, pp. 4781-4785, (2023)
  • [34] Mohagheghi S., Alizadeh M., Safavi S.M., Foruzan A.H., Chen Y.-W., Integration of cnn, cbmir, and visualization techniques for diagnosis and quantification of covid-19 disease, IEEE J Biomed Health Inform, 25, 6, pp. 1873-1880, (2021)
  • [35] Monteral J.C., Covid-Chestxray Database, (2020)
  • [36] Mooney P., Chest X-Ray Images (Pneumonia, (2018)
  • [37] Muhammad G., Hossain M.S., Covid-19 and non-covid-19 classification using multi-layers fusion from lung ultrasound images, Inform Fus, 72, pp. 80-88, (2021)
  • [38] Narin A., Kaya C., Pamuk Z., Automatic detection of coronavirus disease (covid-19) using x-ray images and deep convolutional neural networks, Pattern Anal Appl, 24, 3, pp. 1207-1220, (2021)
  • [39] Ozturk T., Talo M., Yildirim E.A., Baloglu U.B., Yildirim O., Acharya U.R., Automated detection of covid-19 cases using deep neural networks with x-ray images, Comput Biol Med, 121, (2020)
  • [40] Pan S.J., Yang Q., A survey on transfer learning, IEEE Trans Knowl Data Eng, 22, 10, pp. 1345-1359, (2009)