Volumetric memory network for interactive medical image segmentation

被引:57
作者
Zhou, Tianfei [1 ]
Li, Liulei [2 ]
Bredell, Gustav [1 ]
Li, Jianwu [2 ]
Unkelbach, Jan [3 ]
Konukoglu, Ender [1 ]
机构
[1] Swiss Fed Inst Technol, Comp Vis Lab, Zurich, Switzerland
[2] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing, Peoples R China
[3] Univ Hosp Zurich, Dept Radiat Oncol, Zurich, Switzerland
关键词
Interactive image segmentation; Memory-augmented network; Attention; fully convolutional network; Deep learning; RANDOM-WALKS;
D O I
10.1016/j.media.2022.102599
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite recent progress of automatic medical image segmentation techniques, fully automatic results usually fail to meet clinically acceptable accuracy, thus typically require further refinement. To this end, we propose a novel Volumetric Memory Network, dubbed as VMN, to enable segmentation of 3D medical images in an interactive manner. Provided by user hints on an arbitrary slice, a 2D interaction network is firstly employed to produce an initial 2D segmentation for the chosen slice. Then, the VMN propagates the initial segmentation mask bidirectionally to all slices of the entire volume. Subsequent refinement based on additional user guidance on other slices can be incorporated in the same manner. To facilitate smooth human-in-the-loop segmentation, a quality assessment module is introduced to suggest the next slice for interaction based on the segmentation quality of each slice produced in the previous round. Our VMN demonstrates two distinctive features: First, the memory-augmented network design offers our model the ability to quickly encode past segmentation information, which will be retrieved later for the segmentation of other slices; Second, the quality assessment module enables the model to directly estimate the quality of each segmentation prediction, which allows for an active learning paradigm where users preferentially label the lowest-quality slice for multi-round refinement. The proposed network leads to a robust interactive segmentation engine, which can generalize well to various types of user annotations (e.g., scribble, bounding box, extreme clicking). Extensive experiments have been conducted on three public medical image segmentation datasets (i.e., MSD, KiTS19, CVC-ClinicDB), and the results clearly confirm the superiority of our approach in comparison with state-of-the-art segmentation models. The code is made publicly available at https://github.com/0liliulei/Mem3D.
引用
收藏
页数:11
相关论文
共 71 条
  • [1] Interactive Full Image Segmentation by Considering All Regions Jointly
    Agustsson, Eirikur
    Uijlings, Jasper R. R.
    Ferrari, Vittorio
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11614 - 11623
  • [2] PHiSeg: Capturing Uncertainty in Medical Image Segmentation
    Baumgartner, Christian F.
    Tezcan, Kerem C.
    Chaitanya, Krishna
    Hotker, Andreas M.
    Muehlematter, Urs J.
    Schawkat, Khoschy
    Becker, Anton S.
    Donati, Olivio
    Konukoglu, Ender
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT II, 2019, 11765 : 119 - 127
  • [3] Baxter J.S., 2016, J MED IMAGING, V3
  • [4] WM-DOVA maps for accurate polyp highlighting in colonoscopy: Validation vs. saliency maps from physicians
    Bernal, Jorge
    Javier Sanchez, F.
    Fernandez-Esparrach, Gloria
    Gil, Debora
    Rodriguez, Cristina
    Vilarino, Fernando
    [J]. COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2015, 43 : 99 - 111
  • [5] Graph cuts and efficient N-D image segmentation
    Boykov, Yuri
    Funka-Lea, Gareth
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2006, 70 (02) : 109 - 131
  • [6] Boykov YY, 2001, EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL I, PROCEEDINGS, P105, DOI 10.1109/ICCV.2001.937505
  • [7] Iterative Interaction Training for Segmentation Editing Networks
    Bredell, Gustav
    Tanner, Christine
    Konukoglu, Ender
    [J]. MACHINE LEARNING IN MEDICAL IMAGING: 9TH INTERNATIONAL WORKSHOP, MLMI 2018, 2018, 11046 : 363 - 370
  • [8] Cao H., 2021, arXiv, DOI DOI 10.48550/ARXIV.2105.05537
  • [9] Annotating Object Instances with a Polygon-RNN
    Castrejon, Lluis
    Kundu, Kaustav
    Urtasun, Raquel
    Fidler, Sanja
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4485 - 4493
  • [10] Cascaded Pyramid Network for Multi-Person Pose Estimation
    Chen, Yilun
    Wang, Zhicheng
    Peng, Yuxiang
    Zhang, Zhiqiang
    Yu, Gang
    Sun, Jian
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7103 - 7112