HSNet: A hybrid semantic network for polyp segmentation

被引:78
作者
Zhang, Wenchao [1 ]
Fu, Chong [1 ,2 ,3 ]
Zheng, Yu [4 ]
Zhang, Fangyuan [5 ]
Zhao, Yanli [6 ]
Sham, Chiu-Wing [7 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110819, Peoples R China
[2] Minist Educ, Engn Res Ctr Secur Technol Complex Network Syst, Shenyang, Peoples R China
[3] Northeastern Univ, Key Lab Intelligent Comp Med Image, Minist Educ, Shenyang 110819, Peoples R China
[4] Chinese Univ Hong Kong, Dept Informat Engn, Sha Tin, Hong Kong, Peoples R China
[5] China Med Univ, Dept Gen Surg, Shengjing Hosp, Shenyang, Peoples R China
[6] Ningxia Inst Sci & Technol, Sch Elect Informat Engn, Shizuishan 753000, Peoples R China
[7] Univ Auckland, Sch Comp Sci, Auckland, New Zealand
关键词
Polyp segmentation; Hybrid semantic; Dual-branch; Long-range dependencies; Local details;
D O I
10.1016/j.compbiomed.2022.106173
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Automatic polyp segmentation can help physicians to effectively locate polyps (a.k.a. region of interests) in clinical practice, in the way of screening colonoscopy images assisted by neural networks (NN). However, two significant bottlenecks hinder its effectiveness, disappointing physicians' expectations. (1) Changeable polyps in different scaling, orientation, and illumination, bring difficulty in accurate segmentation. (2) Current works building on a dominant decoder-encoder network tend to overlook appearance details (e.g., textures) for a tiny polyp, degrading the accuracy to differentiate polyps. For alleviating the bottlenecks, we investigate a hybrid semantic network (HSNet) that adopts both advantages of Transformer and convolutional neural networks (CNN), aiming at improving polyp segmentation. Our HSNet contains a cross-semantic attention module (CSA), a hybrid semantic complementary module (HSC), and a multi-scale prediction module (MSP). Unlike previous works on segmenting polyps, we newly insert the CSA module, which can fill the gap between low-level and high-level features via an interactive mechanism that exchanges two types of semantics from different NN attentions. By a dual-branch structure of Transformer and CNN, we newly design an HSC module, for capturing both long-range dependencies and local details of appearance. Besides, the MSP module can learn weights for fusing stage-level prediction masks of a decoder. Experimentally, we compared our work with 10 state-of-the-art works, including both recent and classical works, showing improved accuracy (via 7 evaluative metrics) over 5 benchmark datasets, e.g., it achieves 0.926/0.877 mDic/mIoU on Kvasir-SEG, 0.948/0.905 mDic/mIoU on ClinicDB, 0.810/0.735 mDic/mIoU on ColonDB, 0.808/0.74 mDic/mIoU on ETIS, and 0.903/0.839 mDic/mIoU on Endoscene. The proposed model is available at (https://github.com/baiboat/ HSNet).
引用
收藏
页数:10
相关论文
共 66 条
[1]   Recurrent residual U-Net for medical image segmentation [J].
Alom, Md Zahangir ;
Yakopcic, Chris ;
Hasan, Mahmudul ;
Taha, Tarek M. ;
Asari, Vijayan K. .
JOURNAL OF MEDICAL IMAGING, 2019, 6 (01)
[2]   WM-DOVA maps for accurate polyp highlighting in colonoscopy: Validation vs. saliency maps from physicians [J].
Bernal, Jorge ;
Javier Sanchez, F. ;
Fernandez-Esparrach, Gloria ;
Gil, Debora ;
Rodriguez, Cristina ;
Vilarino, Fernando .
COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2015, 43 :99-111
[3]  
Chen J., 2021, arXiv, DOI 10.48550/arXiv:2102.04306
[4]   MSEva: A Musculoskeletal Rehabilitation Evaluation System Based on EMG Signals [J].
Dai, Yuanchao ;
Wu, Jing ;
Fan, Yuanzhao ;
Wang, Jin ;
Niu, Jianwei ;
Gu, Fei ;
Shen, Shigen .
ACM TRANSACTIONS ON SENSOR NETWORKS, 2023, 19 (01)
[5]  
Dai Z, 2021, ADV NEUR IN, V34
[6]  
Deng-Ping Fan, 2020, Medical Image Computing and Computer Assisted Intervention - MICCAI 2020. 23rd International Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12266), P263, DOI 10.1007/978-3-030-59725-2_26
[7]  
Dong B, 2024, Arxiv, DOI arXiv:2108.06932
[8]   ASYMMETRIC ATTENTION UPSAMPLING: RETHINKING UPSAMPLING FOR BIOLOGICAL IMAGE SEGMENTATION [J].
Dong, Chunyu ;
Zhao, Qunfei ;
Chen, Kun ;
Huang, Xiaolin .
2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, :645-649
[9]  
Dosovitskiy A., 2021, P 28 INT C LEARN REP
[10]  
Fan D. P., 2021, Scientia Sinica Informationis, V51, DOI DOI 10.1360/SSI-2020-0370