PSTNet: Enhanced Polyp Segmentation With Multi-Scale Alignment and Frequency Domain Integration

被引:3
|
作者
Xu, Wenhao [1 ]
Xu, Rongtao [2 ,3 ]
Wang, Changwei [4 ,5 ,6 ]
Li, Xiuli [7 ]
Xu, Shibiao [1 ]
Guo, Li [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[2] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing, Peoples R China
[3] Mohamed Bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab Emirates
[4] Qilu Univ Technol, Shandong Acad Sci, Key Lab Comp Power Network & Informat Secur, Minist Educ,Shandong Comp Sci Ctr,Natl Supercomp C, Jinan 250013, Peoples R China
[5] Shandong Fundamental Res Ctr Comp Sci, Shandong Prov Key Lab Comp Networks, Jinan 250013, Peoples R China
[6] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing 100876, Peoples R China
[7] Deepwise Healthcare, AI Lab, Beijing 100080, Peoples R China
基金
中国国家自然科学基金;
关键词
Image segmentation; Feature extraction; Transformers; Accuracy; Frequency-domain analysis; Location awareness; Colonoscopy; Polyp segmentation; shunted transformer; multi-scale fusion; VALIDATION;
D O I
10.1109/JBHI.2024.3421550
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurate segmentation of colorectal polyps in colonoscopy images is crucial for effective diagnosis and management of colorectal cancer (CRC). However, current deep learning-based methods primarily rely on fusing RGB information across multiple scales, leading to limitations in accurately identifying polyps due to restricted RGB domain information and challenges in feature misalignment during multi-scale aggregation. To address these limitations, we propose the Polyp Segmentation Network with Shunted Transformer (PSTNet), a novel approach that integrates both RGB and frequency domain cues present in the images. PSTNet comprises three key modules: the Frequency Characterization Attention Module (FCAM) for extracting frequency cues and capturing polyp characteristics, the Feature Supplementary Alignment Module (FSAM) for aligning semantic information and reducing misalignment noise, and the Cross Perception localization Module (CPM) for synergizing frequency cues with high-level semantics to achieve efficient polyp segmentation. Extensive experiments on challenging datasets demonstrate PSTNet's significant improvement in polyp segmentation accuracy across various metrics, consistently outperforming state-of-the-art methods. The integration of frequency domain cues and the novel architectural design of PSTNet contribute to advancing computer-assisted polyp segmentation, facilitating more accurate diagnosis and management of CRC.
引用
收藏
页码:6042 / 6053
页数:12
相关论文
共 50 条
  • [41] META-Unet: Multi-Scale Efficient Transformer Attention Unet for Fast and High-Accuracy Polyp Segmentation
    Wu, Huisi
    Zhao, Zebin
    Wang, Zhaoze
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (03) : 4117 - 4128
  • [42] Lung segmentation based on random forest and multi-scale edge detection
    Liu, Caixia
    Zhao, Ruibin
    Pang, Mingyong
    IET IMAGE PROCESSING, 2019, 13 (10) : 1745 - 1754
  • [43] MMW-YOLOv5: A Multi-Scale Enhanced Traffic Sign Detection Algorithm
    Wang, Tong
    Zhang, Juwei
    Ren, Bingyi
    Liu, Bo
    IEEE ACCESS, 2024, 12 : 148880 - 148892
  • [44] An Attention Guided Multi-scale Network with Channel-Enhanced Transformer for Coronary Arteries Segmentation
    Yang, Jinzhong
    Hong, Peng
    Xu, Bu
    Chen, Yaojun
    Xu, Lisheng
    Peng, Chengbao
    Sun, Yu
    Yang, Benqiang
    12TH ASIAN-PACIFIC CONFERENCE ON MEDICAL AND BIOLOGICAL ENGINEERING, VOL 1, APCMBE 2023, 2024, 103 : 157 - 167
  • [45] GMAlignNet: multi-scale lightweight brain tumor image segmentation with enhanced semantic information consistency
    Song, Jianli
    Lu, Xiaoqi
    Gu, Yu
    PHYSICS IN MEDICINE AND BIOLOGY, 2024, 69 (11)
  • [46] Unsupervised segmentation of noisy image in a multi-scale framework
    Zhang, YB
    Ma, S
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 905 - 909
  • [47] MicroSeg: Multi-scale fusion learning for microaneurysms segmentation
    Wu, Yun
    Jiao, Ge
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 37
  • [48] Contour segmentation algorithm of multi-scale GVF snake
    Jin, Li
    Xu Junhong
    Wang, Cong
    Zhou Lulu
    Hong, Yu
    Hong, Liang
    IEEE ICMA 2006: PROCEEDING OF THE 2006 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS 1-3, PROCEEDINGS, 2006, : 537 - +
  • [49] LMDF-Net: Layered Multi-Scale Dual-Branch Dual-Temporal Fusion Network for Medical Image Segmentation
    Ye, Siyuan
    Wei, Yan
    IEEE ACCESS, 2024, 12 : 183078 - 183088
  • [50] Multi-Scale Contourlet Knowledge Guide Learning Segmentation
    Liu, Mengkun
    Jiao, Licheng
    Liu, Xu
    Li, Lingling
    Liu, Fang
    Yang, Shuyuan
    Wang, Shuang
    Hou, Biao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4831 - 4845