PSTNet: Enhanced Polyp Segmentation With Multi-Scale Alignment and Frequency Domain Integration

被引:3
|
作者
Xu, Wenhao [1 ]
Xu, Rongtao [2 ,3 ]
Wang, Changwei [4 ,5 ,6 ]
Li, Xiuli [7 ]
Xu, Shibiao [1 ]
Guo, Li [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[2] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing, Peoples R China
[3] Mohamed Bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab Emirates
[4] Qilu Univ Technol, Shandong Acad Sci, Key Lab Comp Power Network & Informat Secur, Minist Educ,Shandong Comp Sci Ctr,Natl Supercomp C, Jinan 250013, Peoples R China
[5] Shandong Fundamental Res Ctr Comp Sci, Shandong Prov Key Lab Comp Networks, Jinan 250013, Peoples R China
[6] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing 100876, Peoples R China
[7] Deepwise Healthcare, AI Lab, Beijing 100080, Peoples R China
基金
中国国家自然科学基金;
关键词
Image segmentation; Feature extraction; Transformers; Accuracy; Frequency-domain analysis; Location awareness; Colonoscopy; Polyp segmentation; shunted transformer; multi-scale fusion; VALIDATION;
D O I
10.1109/JBHI.2024.3421550
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurate segmentation of colorectal polyps in colonoscopy images is crucial for effective diagnosis and management of colorectal cancer (CRC). However, current deep learning-based methods primarily rely on fusing RGB information across multiple scales, leading to limitations in accurately identifying polyps due to restricted RGB domain information and challenges in feature misalignment during multi-scale aggregation. To address these limitations, we propose the Polyp Segmentation Network with Shunted Transformer (PSTNet), a novel approach that integrates both RGB and frequency domain cues present in the images. PSTNet comprises three key modules: the Frequency Characterization Attention Module (FCAM) for extracting frequency cues and capturing polyp characteristics, the Feature Supplementary Alignment Module (FSAM) for aligning semantic information and reducing misalignment noise, and the Cross Perception localization Module (CPM) for synergizing frequency cues with high-level semantics to achieve efficient polyp segmentation. Extensive experiments on challenging datasets demonstrate PSTNet's significant improvement in polyp segmentation accuracy across various metrics, consistently outperforming state-of-the-art methods. The integration of frequency domain cues and the novel architectural design of PSTNet contribute to advancing computer-assisted polyp segmentation, facilitating more accurate diagnosis and management of CRC.
引用
收藏
页码:6042 / 6053
页数:12
相关论文
共 50 条
  • [21] MCSF-Net: a multi-scale channel spatial fusion network for real-time polyp segmentation
    Liu, Weikang
    Li, Zhigang
    Xia, Jiaao
    Li, Chunyang
    PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (17)
  • [22] MSRF-Net: A Multi-Scale Residual Fusion Network for Biomedical Image Segmentation
    Srivastava, Abhishek
    Jha, Debesh
    Chanda, Sukalpa
    Pal, Umapada
    Johansen, Havard
    Johansen, Dag
    Riegler, Michael
    Ali, Sharib
    Halvorsen, Pal
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (05) : 2252 - 2263
  • [23] Multi-Scale Alignment Domain Adaptation for Ship Classification in Multi-Resolution SAR Images
    Liu, Zhunga
    Li, Kun
    Wang, Longfei
    Zhang, Zuowei
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, : 4051 - 4062
  • [24] CafeNet : A Novel Multi-Scale Context Aggregation and Multi-Level Foreground Enhancement Network for Polyp Segmentation
    Ji, Zhanlin
    Li, Xiaoyu
    Wang, Zhiwu
    Zhang, Haiyang
    Yuan, Na
    Zhang, Xueji
    Ganchev, Ivan
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (05)
  • [25] MSSINet: Real-Time Segmentation Based on Multi-Scale Strip Integration
    Wang, Lin
    Zhu, Fenghua
    Zhang, Hui
    Xiong, Gang
    Huang, Yunhu
    Chen, Dewang
    IEEE JOURNAL OF RADIO FREQUENCY IDENTIFICATION, 2024, 8 : 241 - 251
  • [26] Multi-scale fusion dehazing network for high-frequency information alignment 
    Li, Peng-ze
    Li, Wan
    Zhang, Xuan-de
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2023, 38 (02) : 216 - 224
  • [27] Salient region detection using multi-scale analysis in the frequency domain
    Zhang Q.-R.
    Gu G.-C.
    Liu H.-B.
    Xiao H.-M.
    Harbin Gongcheng Daxue Xuebao/Journal of Harbin Engineering University, 2010, 31 (03): : 361 - 365
  • [28] 2MGAS-Net: multi-level multi-scale gated attentional squeezed network for polyp segmentation
    Bakkouri, Ibtissam
    Bakkouri, Siham
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (6-7) : 5377 - 5386
  • [29] HMDA: A Hybrid Model With Multi-Scale Deformable Attention for Medical Image Segmentation
    Wu, Mengmeng
    Liu, Tiantian
    Dai, Xin
    Ye, Chuyang
    Wu, Jinglong
    Funahashi, Shintaro
    Yan, Tianyi
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2025, 29 (02) : 1243 - 1255
  • [30] MSPMformer: The Fusion of Transformers and Multi-Scale Perception Modules Skin Lesion Segmentation Algorithm
    Yang, Guoliang
    Geng, Zhen
    Wang, Qianchen
    IEEE ACCESS, 2024, 12 : 128602 - 128617