MCIF-Transformer Mask RCNN: Multi-Branch Cross-Scale Interactive Feature Fusion Transformer Model for PET/CT Lung Tumor Instance Segmentation

被引:0
作者
Lu, Huiling [1 ]
Zhou, Tao [2 ,3 ]
机构
[1] Ningxia Med Univ, Sch Med Informat & Engn, Yinchuan 750004, Peoples R China
[2] North Minzu Univ, Sch Comp Sci & Engn, Yinchuan 750021, Peoples R China
[3] North Minzu Univ, State Ethn Affairs Commiss, Key Lab Image & Graph Intelligent Proc, Yinchuan 750021, Peoples R China
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 79卷 / 03期
基金
中国国家自然科学基金;
关键词
PET/CT images; instance segmentation; mask RCNN; interactive fusion; transformer;
D O I
10.32604/cmc.2024.047827
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The precise detection and segmentation of tumor lesions are very important for lung cancer computer-aided diagnosis. However, in PET/CT (Positron Emission Tomography/Computed Tomography) lung images, the lesion shapes are complex, the edges are blurred, and the sample numbers are unbalanced. To solve these problems, this paper proposes a Multi-branch Cross-scale Interactive Feature fusion Transformer model (MCIF-Transformer Mask RCNN) for PET/CT lung tumor instance segmentation, The main innovative works of this paper are as follows: Firstly, the ResNet-Transformer backbone network is used to extract global feature and local feature in lung images. The pixel dependence relationship is established in local and non-local fields to improve the model perception ability. Secondly, the Cross-scale Interactive Feature Enhancement auxiliary network is designed to provide the shallow features to the deep features, and the cross-scale interactive feature enhancement module (CIFEM) is used to enhance the attention ability of the fine-grained features. Thirdly, the Cross-scale Interactive Feature fusion FPN network (CIF-FPN) is constructed to realize bidirectional interactive fusion between deep features and shallow features, and the low-level features are enhanced in deep semantic features. Finally, 4 ablation experiments, 3 comparison experiments of detection, 3 comparison experiments of segmentation and 6 comparison experiments with two-stage and single-stage instance segmentation networks are done on PET/CT lung medical image datasets. The results showed that APdet, APseg, ARdet and ARseg indexes are improved by 5.5%, 5.15%, 3.11% and 6.79% compared with Mask RCNN (resnet50). Based on the above research, the precise detection and segmentation of the lesion region are realized in this paper. This method has positive significance for the detection of lung tumors.
引用
收藏
页码:4371 / 4393
页数:23
相关论文
共 26 条
  • [1] Bai Shuhua, 2022, Cognitive Robotics, V2, P193
  • [2] YOLACT Real-time Instance Segmentation
    Bolya, Daniel
    Zhou, Chong
    Xiao, Fanyi
    Lee, Yong Jae
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9156 - 9165
  • [3] Cascade R-CNN: High Quality Object Detection and Instance Segmentation
    Cai, Zhaowei
    Vasconcelos, Nuno
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) : 1483 - 1498
  • [4] A survey on instance segmentation: state of the art
    Hafiz, Abdul Mueed
    Bhat, Ghulam Mohiuddin
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2020, 9 (03) : 171 - 189
  • [5] An adaptive morphology based segmentation technique for lung nodule detection in thoracic CT image
    Halder, Amitava
    Chatterjee, Saptarshi
    Dey, Debangshu
    Kole, Surajit
    Munshi, Sugata
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2020, 197
  • [6] Unsupervised supervoxel-based lung tumor segmentation across patient scans in hybrid PET/MRI
    Hansen, Stine
    Kuttner, Samuel
    Kampffmeyer, Michael
    Markussen, Tom-Vegard
    Sundset, Rune
    Oen, Silje Kjaernes
    Eikenes, Live
    Jenssen, Robert
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 167
  • [7] Haoyu Zhou, 2019, 2019 IEEE 4th International Conference on Image, Vision and Computing (ICIVC), P55, DOI 10.1109/ICIVC47709.2019.8980993
  • [8] He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
  • [9] Homayounfar Namdar, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12368), P555, DOI 10.1007/978-3-030-58592-1_33
  • [10] Mask Scoring R-CNN
    Huang, Zhaojin
    Huang, Lichao
    Gong, Yongchao
    Huang, Chang
    Wang, Xinggang
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6402 - 6411