A deep learning based dual encoder-decoder framework for anatomical structure segmentation in chest X-ray images

被引:22
作者
Ullah, Ihsan [1 ]
Ali, Farman [2 ,6 ]
Shah, Babar [3 ]
El-Sappagh, Shaker [4 ,5 ]
Abuhmed, Tamer
Park, Sang Hyun [1 ]
机构
[1] Daegu Gyeonbuk Inst Sci & Engn DGIST, Dept Robot & Mechatron Engn, Daegu 42988, South Korea
[2] Sungkyunkwan Univ, Sch Convergence, Dept Comp Sci & Engn, Coll Comp & Informat, Seoul 03063, South Korea
[3] Zayed Univ, Coll Technol Innovat, Dubai 19282, U Arab Emirates
[4] Galala Univ, Fac Comp Sci & Engn, Suez 435611, Egypt
[5] Benha Univ, Fac Comp & Artificial Intelligence, Informat Syst Dept, Banha 13518, Egypt
[6] Sungkyunkwan Univ, Dept Comp Sci & Engn, Coll Comp & Informat, Suwon 16419, South Korea
基金
新加坡国家研究基金会;
关键词
COMPUTER-AIDED DIAGNOSIS; CONVOLUTIONAL NEURAL-NETWORKS; LUNG SEGMENTATION; AUTOMATED SEGMENTATION; RADIOGRAPHS; REGIONS; FIELD; SHAPE; IDENTIFICATION;
D O I
10.1038/s41598-023-27815-w
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Automated multi-organ segmentation plays an essential part in the computer-aided diagnostic (CAD) of chest X-ray fluoroscopy. However, developing a CAD system for the anatomical structure segmentation remains challenging due to several indistinct structures, variations in the anatomical structure shape among different individuals, the presence of medical tools, such as pacemakers and catheters, and various artifacts in the chest radiographic images. In this paper, we propose a robust deep learning segmentation framework for the anatomical structure in chest radiographs that utilizes a dual encoder-decoder convolutional neural network (CNN). The first network in the dual encoder-decoder structure effectively utilizes a pre- trained VGG19 as an encoder for the segmentation task. The pre-trained encoder output is fed into the squeeze-and-excitation (SE) to boost the network's representation power, which enables it to perform dynamic channel-wise feature calibrations. The calibrated features are efficiently passed into the first decoder to generate the mask. We integrated the generated mask with the input image and passed it through a second encoder-decoder network with the recurrent residual blocks and an attention the gate module to capture the additional contextual features and improve the segmentation of the smaller regions. Three public chest X-ray datasets are used to evaluate the proposed method for multi-organs segmentation, such as the heart, lungs, and clavicles, and single-organ segmentation, which include only lungs. The results from the experiment show that our proposed technique outperformed the existing multi-class and single-class segmentation methods.
引用
收藏
页数:14
相关论文
共 65 条
  • [61] Instance Segmentation of Anatomical Structures in Chest Radiographs
    Wang, Jie
    Li, Zhigang
    Jiang, Rui
    Xie, Zhen
    [J]. 2019 IEEE 32ND INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2019, : 441 - 446
  • [62] Image feature analysis for computer-aided diagnosis: Detection of right and left hemidiaphragm edges and delineation of lung field in chest radiographs
    Xu, HW
    Doi, K
    [J]. MEDICAL PHYSICS, 1996, 23 (09) : 1613 - 1624
  • [63] Lung Field Segmentation in Chest Radiographs From Boundary Maps by a Structured Edge Detector
    Yang, Wei
    Liu, Yunbi
    Lin, Liyan
    Yun, Zhaoqiang
    Lu, Zhentai
    Feng, Qianjin
    Chen, Wufan
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2018, 22 (03) : 842 - 851
  • [64] Pyramid Scene Parsing Network
    Zhao, Hengshuang
    Shi, Jianping
    Qi, Xiaojuan
    Wang, Xiaogang
    Jia, Jiaya
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6230 - 6239
  • [65] Zhenghao Shi, 2009, Proceedings of the 2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2009), P428, DOI 10.1109/FSKD.2009.811