TACT: Text attention based CNN-Transformer network for polyp segmentation

被引:1
作者
Zhao, Yiyang [1 ]
Li, Jinjiang [1 ,3 ]
Hua, Zhen [2 ]
机构
[1] Shandong Technol & Business Univ, Sch Comp Sci & Technol, Yantai, Peoples R China
[2] Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai, Peoples R China
[3] Shandong Technol & Business Univ, Yantai 264005, Peoples R China
基金
中国国家自然科学基金;
关键词
CNN-Transformer; colonoscopy; medical image segmentation; polyp segmentation;
D O I
10.1002/ima.22997
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Colorectal cancer (CRC) has been one of the top three disease in the world in terms of incidence for many years. Therefore, how to prevent and treat CRC has become a topic of concern for an increasing number of people, and colonoscopy is the most effective detection method in polyp examination. According to studies, 90% of CRC is caused by adenomatous polyps of the large intestine. In clinical practice, the diversity of polyps' size, number, and shape and the unclear boundary between polyps and colon folds can reduce the operator's accuracy of polyps segmentation and lead to a higher rate of missed diagnosis. To better address the inaccurate segmentation or high miss rate due to the above factors, we propose a text attention-based CNN-Transformer network for polyp segmentation (TACT) network to process the images in a way that minimizes operator subjectivity and miss rate. The network is based on the CNN-Transformer structure, and on this basis, a fully attention progressive sampling module is added to more accurately divide the polyp boundary. Moreover, an auxiliary text classification task was added to focus on polyp size and number features in the form of text attention, which more effectively copes with the segmentation tasks of different sizes and different numbers of polyps. After comparing with multiple state-of-the-art segmentation methods in four challenging datasets, our proposed TACT improves segmentation accuracy for polyps of different sizes in different datasets.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] PRCNet: A parallel reverse convolutional attention network for colorectal polyp segmentation
    Li, Jian
    Wang, Jiawei
    Lin, Fengwu
    Heidari, Ali Asghar
    Chen, Yi
    Chen, Huiling
    Wu, Wenqi
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 95
  • [42] Progressively Normalized Self-Attention Network for Video Polyp Segmentation
    Ji, Ge-Peng
    Chou, Yu-Cheng
    Fan, Deng-Ping
    Chen, Geng
    Fu, Huazhu
    Jha, Debesh
    Shao, Ling
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I, 2021, 12901 : 142 - 152
  • [43] FMCA-Net: A feature secondary multiplexing and dilated convolutional attention polyp segmentation network based on pyramid vision transformer
    Li, Weisheng
    Nie, Xiaolong
    Li, Feiyan
    Huang, Zhaopeng
    Zeng, Guofeng
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 260
  • [44] Boundary Refinement Network for Polyp Segmentation With Deformable Attention
    Li, Zijian
    Yang, Zhiyong
    Wu, Wangsheng
    Guo, Zihang
    Zhu, Dongdong
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 121 - 125
  • [45] A frequency attention-embedded network for polyp segmentation
    Tang, Rui
    Zhao, Hejing
    Tong, Yao
    Mu, Ruihui
    Wang, Yuqiang
    Zhang, Shuhao
    Zhao, Yao
    Wang, Weidong
    Zhang, Min
    Liu, Yilin
    Gao, Jianbo
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [46] Cross Attention Multi Scale CNN-Transformer Hybrid Encoder Is General Medical Image Learner
    Zhou, Rongzhou
    Yao, Junfeng
    Hong, Qingqi
    Li, Xingxin
    Cao, Xianpeng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XIII, 2024, 14437 : 85 - 97
  • [47] Polyp Segmentation Based on Multilevel Information Correction Transformer
    Yuan, Ju
    Liu, Guoqi
    Nam, Haewoon
    IEEE ACCESS, 2024, 12 : 91619 - 91633
  • [48] Attention-Guided Pyramid Context Network for Polyp Segmentation in Colonoscopy Images
    Yue, Guanghui
    Li, Siying
    Cong, Runmin
    Zhou, Tianwei
    Lei, Baiying
    Wang, Tianfu
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [49] Semi-supervised Spatial Temporal Attention Network for Video Polyp Segmentation
    Zhao, Xinkai
    Wu, Zhenhua
    Tan, Shuangyi
    Fan, De-Jun
    Li, Zhen
    Wan, Xiang
    Li, Guanbin
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT IV, 2022, 13434 : 456 - 466
  • [50] Multi-view orientational attention network combining point-based affinity for polyp segmentation
    Liu, Yan
    Yang, Yan
    Jiang, Yongquan
    Xie, Zhuyang
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249