Focal Modulation Based End-to-End Multi-Label Classification for Chest X-ray Image Classification

被引:2
作者
Ozturk, Saban [1 ,2 ,3 ]
Cukur, Tolga [1 ]
机构
[1] Bilkent Univ, Elekt & Elekt Muhendisligi Bolumu, Ankara, Turkiye
[2] Amasya Univ, Elekt & Elekt Muhendisligi Bolumu, Amasya, Turkiye
[3] Bilkent Univ, Ulusal Manyet Rezonans Arastirma Merkezi UMRAM, Ankara, Turkiye
来源
2023 31ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU | 2023年
关键词
chest x-ray; deep learning; focal modulation networks; multi-label classification; CANCER STATISTICS;
D O I
10.1109/SIU59756.2023.10223975
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Chest X-ray imaging is of critical importance in order to effectively diagnose chest diseases, which are increasing today due to various environmental and hereditary factors. Although chest X-ray is the most commonly used device for detecting pathological abnormalities, it can be quite challenging for specialists due to misleading locations and sizes of pathological abnormalities, visual similarities, and complex backgrounds. Traditional deep learning (DL) architectures fall short due to relatively small areas of pathological abnormalities and similarities between diseased and healthy areas. In addition, DL structures with standard classification approaches are not ideal for dealing with problems involving multiple diseases. In order to overcome the aforementioned problems, firstly, background-independent feature maps were created using a conventional convolutional neural network (CNN). Then, the relationships between objects in the feature maps are made suitable for multi-label classification tasks using the focal modulation network (FMA), an innovative attention module that is more effective than the self-attention approach. Experiments using a Chest x-ray dataset containing both single and multiple labels for a total of 14 different diseases show that the proposed approach can provide superior performance for multi-label datasets.
引用
收藏
页数:4
相关论文
共 23 条
  • [1] Bingzhi Chen, 2019, Pattern Recognition and Computer Vision. Second Chinese Conference, PRCV 2019. Proceedings. Lecture Notes in Computer Science (LNCS 11858), P682, DOI 10.1007/978-3-030-31723-2_58
  • [2] Lesion Location Attention Guided Network for Multi-Label Thoracic Disease Classification in Chest X-Rays
    Chen, Bingzhi
    Li, Jinxing
    Lu, Guangming
    Zhang, David
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (07) : 2016 - 2027
  • [3] Thorax Disease Classification Based on Pyramidal Convolution Shuffle Attention Neural Network
    Chen, Kai
    Wang, Xuqi
    Zhang, Shanwen
    [J]. IEEE ACCESS, 2022, 10 : 85571 - 85581
  • [4] ResViT: Residual Vision Transformers for Multimodal Medical Image Synthesis
    Dalmaz, Onat
    Yurt, Mahmut
    Cukur, Tolga
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (10) : 2598 - 2614
  • [5] Dar SU, 2023, Arxiv, DOI arXiv:2301.02613
  • [6] Federated Learning of Generative Image Priors for MRI Reconstruction
    Elmas, Gokberk
    Dar, Salman U. H.
    Korkmaz, Yilmaz
    Ceyani, Emir
    Susam, Burak
    Ozbey, Muzaffer
    Avestimehr, Salman
    Cukur, Tolga
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (07) : 1996 - 2009
  • [7] Güngor A, 2023, Arxiv, DOI [arXiv:2207.05876, DOI 10.48550/ARXIV.2207.05876]
  • [8] Weakly supervised thoracic disease localization via disease masks
    Jung, Hong-Gyu
    Nam, Woo-Jeoung
    Kim, Hyun-Woo
    Lee, Seong-Whan
    [J]. NEUROCOMPUTING, 2023, 517 (34-43) : 34 - 43
  • [9] Unsupervised MRI Reconstruction via Zero-Shot Learned Adversarial Transformers
    Korkmaz, Yilmaz
    Dar, Salman U. H.
    Yurt, Mahmut
    Ozbey, Muzaffer
    Cukur, Tolga
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (07) : 1747 - 1763
  • [10] Kruk ME, 2018, LANCET, V392, P2203, DOI 10.1016/s0140-6736(18)31668-4