Few-shot medical image segmentation with high-fidelity prototypes

被引:6
作者
Tang, Song [1 ,2 ]
Yan, Shaxu [1 ]
Qi, Xiaozhi [3 ]
Gao, Jianxin [1 ]
Ye, Mao [4 ]
Zhang, Jianwei [2 ]
Zhu, Xiatian [5 ,6 ]
机构
[1] Univ Shanghai Sci & Technol, Sch Hlth Sci & Engn, IMI Grp, Shanghai, Peoples R China
[2] Univ Hamburg, Dept Informat, TAMS Grp, Hamburg, Germany
[3] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China
[4] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu, Peoples R China
[5] Univ Surrey, Surrey Inst People Centred Artificial Intelligence, Guildford, England
[6] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford, England
基金
中国国家自然科学基金;
关键词
Few-shot semantic segmentation; Medical image; High-fidelity prototype; Detail self-refining;
D O I
10.1016/j.media.2024.103412
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot Semantic Segmentation (FSS) aims to adapt a pretrained model to new classes with as few as a single labeled training sample per class. Despite the prototype based approaches have achieved substantial success, existing models are limited to the imaging scenarios with considerably distinct objects and not highly complex background, e.g., natural images. This makes such models suboptimal for medical imaging with both conditions invalid. To address this problem, we propose a novel D etail S elf-refined P rototype Net work ( DSPNet ) to construct high-fidelity prototypes representing the object foreground and the background more comprehensively. Specifically, to construct global semantics while maintaining the captured detail semantics, we learn the foreground prototypes by modeling the multimodal structures with clustering and then fusing each in a channel-wise manner. Considering that the background often has no apparent semantic relation in the spatial dimensions, we integrate channel-specific structural information under sparse channel-aware regulation. Extensive experiments on three challenging medical image benchmarks show the superiority of DSPNet over previous state-of-the-art methods. The code and data are available at https://github.com/tntek/DSPNet.
引用
收藏
页数:12
相关论文
共 50 条
[1]   SLIC Superpixels Compared to State-of-the-Art Superpixel Methods [J].
Achanta, Radhakrishna ;
Shaji, Appu ;
Smith, Kevin ;
Lucchi, Aurelien ;
Fua, Pascal ;
Suesstrunk, Sabine .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) :2274-2281
[2]   UniverSeg: Universal Medical Image Segmentation [J].
Butoi, Victor Ion ;
Ortiz, Jose Javier Gonzalez ;
Ma, Tianyu ;
Sabuncu, Mert R. ;
Guttag, John ;
Dalca, Adrian V. .
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, :21381-21394
[3]   Few-Shot Medical Image Segmentation via Generating Multiple Representative Descriptors [J].
Cheng, Ziming ;
Wang, Shidong ;
Xin, Tong ;
Zhou, Tao ;
Zhang, Haofeng ;
Shao, Ling .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (06) :2202-2214
[4]  
Child R, 2019, Arxiv, DOI [arXiv:1904.10509, 10.48550/arXiv.1904.10509]
[5]   Few-shot Medical Image Segmentation with Cycle-resemblance Attention [J].
Ding, Hao ;
Sun, Changchang ;
Tang, Hao ;
Cai, Dawen ;
Yan, Yan .
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, :2487-2496
[6]   Self-regularized prototypical network for few-shot semantic segmentation [J].
Ding, Henghui ;
Zhang, Hui ;
Jiang, Xudong .
PATTERN RECOGNITION, 2023, 133
[7]   Self-support Few-Shot Semantic Segmentation [J].
Fan, Qi ;
Pei, Wenjie ;
Tai, Yu-Wing ;
Tang, Chi-Keung .
COMPUTER VISION, ECCV 2022, PT XIX, 2022, 13679 :701-719
[8]   Efficient graph-based image segmentation [J].
Felzenszwalb, PF ;
Huttenlocher, DP .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 59 (02) :167-181
[9]   Interactive Few-Shot Learning: Limited Supervision, Better Medical Image Segmentation [J].
Feng, Ruiwei ;
Zheng, Xiangshang ;
Gao, Tianxiang ;
Chen, Jintai ;
Wang, Wenzhe ;
Chen, Danny Z. ;
Wu, Jian .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (10) :2575-2588
[10]   Dual Attention Network for Scene Segmentation [J].
Fu, Jun ;
Liu, Jing ;
Tian, Haijie ;
Li, Yong ;
Bao, Yongjun ;
Fang, Zhiwei ;
Lu, Hanqing .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3141-3149