Enhanced Spatial Awareness for Deep Interactive Image Segmentation

被引:1
作者
Li, Haochen [1 ]
Ni, Jinlong [1 ]
Li, Zhicheng [1 ]
Qian, Yuxiang [1 ]
Wang, Tao [1 ,2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] Nanjing Univ Sci & Technol, Jiangsu Key Lab Spectral Imaging & Intelligent Se, Nanjing 210094, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2022, PT IV | 2022年 / 13537卷
基金
中国国家自然科学基金;
关键词
Interactive image segmentation; Spatial awareness; Multi task learning; POINTS;
D O I
10.1007/978-3-031-18916-6_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing deep interactive segmentation approaches can extract the desired object for the user based on simple click interaction. However, the first click provided by the user on the full image space domain is generally too local to capture the global target object, which causes them to rely on a large number of subsequent click corrections for satisfactory results. This paper explores how to strengthen the spatial awareness of user interaction especially after the first click input and increase the stability during the continuous iterative correction process. We first design an interactive cascaded localization strategy to determine the spatial range of the potential target, and then integrate this space-aware prior into a dual-stream network structure as a soft constraint for the segmentation. The above operation can increase the network's attention to the target of interest under very limited user interaction. A new training and inference strategy is also developed to completely adapt the benefit from the space-aware guidance. Furthermore, an object shape related loss is designed to better supervise the network based on user-provided prior guidance. Explicit subject, controllable correction and flexible interaction can help to significantly boost the interactive segmentation performance. The proposed method achieves state-of-the-art performance on several popular benchmarks.
引用
收藏
页码:490 / 505
页数:16
相关论文
共 40 条
[1]   Error-tolerant Scribbles Based Interactive Image Segmentation [J].
Bai, Junjie ;
Wu, Xiaodong .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :392-399
[2]   Large-scale interactive object segmentation with human annotators [J].
Benenson, Rodrigo ;
Popov, Stefan ;
Ferrari, Vittorio .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11692-11701
[3]  
Boykov YY, 2001, EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL I, PROCEEDINGS, P105, DOI 10.1109/ICCV.2001.937505
[4]  
Chen LC, 2016, Arxiv, DOI arXiv:1412.7062
[5]  
Chen LC, 2017, Arxiv, DOI [arXiv:1706.05587, DOI 10.48550/ARXIV.1706.05587]
[6]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[7]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[8]   Conditional Diffusion for Interactive Segmentation [J].
Chen, Xi ;
Zhao, Zhiyan ;
Yu, Feiwu ;
Zhang, Yilei ;
Duan, Manni .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :7325-7334
[9]   UCP-Net: Unstructured Contour Points for Instance Segmentation [J].
Dupont, Camille ;
Ouakrim, Yanis ;
Pham, Quoc Cuong .
2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, :3373-3379
[10]  
Forte M, 2020, Arxiv, DOI arXiv:2003.07932