Attention-based adaptive context network for anchor-free instance segmentation

被引:0
作者
Tong Zhang
Guoshan Zhang
Min Yan
Yueming Zhang
机构
[1] Tianjin University,School of Electrical and Information Engineering
来源
International Journal of Machine Learning and Cybernetics | 2023年 / 14卷
关键词
Instance segmentation; MACG-mask branch; Weighted FPN; ContextMask;
D O I
暂无
中图分类号
学科分类号
摘要
It is crucial to obtain accurate and efficient instance segmentation masks in many modern applications such as automatic pilot and robotic manipulation. In this paper, we propose a straightforward and flexible two-stage framework for instance segmentation, which simultaneously generates box-level localization information in an image and instance-level segmentation information for each instance. We name this framework as Attention-based Adaptive Context Network for anchor-free Instance Segmentation (ContextMask), which extends the object detector FCOS (Fully Convolutional One-stage Object Detection) by adding a novel multi-scale adaptive context-guided mask (MACG-Mask) branch containing an adaptive context network and a MaskIoU branch. The adaptive context network is to combine the global context in predicted bounding boxes and the MaskIoU branch is to evaluate the quality of the predicted masks. With the development of deep convolutional neural networks, the network continues to deepen so that it is difficult to balance spatial information and semantic information well. To address the issue, we design a weighted FPN, which obtains feature maps with balance-well spatial and semantic information by concatenating and weighting feature maps of different resolutions. Besides, we also propose an attention-based head, which adds spatial attention and channel attention module to make each pixel have a unique weight to solve the problem of large-scale variant of objects. We verify ContextMask’s effectiveness on the fine-annotations Cityscapes and COCO dataset. ContextMask outperforms state-of-the-art methods and achieves 38.4%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$38.4\%$$\end{document}AP on the Cityscapes dataset and 39.0%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document}AP on the COCO dataset.
引用
收藏
页码:537 / 549
页数:12
相关论文
共 48 条
[1]  
Bolya D(2020)Yolact++: better real-time instance segmentation IEEE Trans Pattern Anal Mach Intell PP 1-3166
[2]  
Zhou C(2019)Enhance the recognition ability to occlusions and small objects with robust faster r-cnn Int J Mach Learn Cybern 9 3155-848
[3]  
Xiao FY(2018)Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs IEEE Trans Pattern Anal Mach Intell 40 834-16
[4]  
Lee Y(2018)An end-to-end differential network learning method for semantic segmentation Int J Mach Learn Cybern 10 1-397
[5]  
Zhou T(2019)See, feel, act: hierarchical learning for complex manipulation skills with multisensory fusion Sci Robot 4 eaav3123-1149
[6]  
Li Z(2017)Mask r-cnn IEEE Trans Pattern Anal Mach Intell 42 386-2023
[7]  
Zhang C(2017)Faster r-cnn: towards real-time object detection with region proposal networks IEEE Trans Pattern Anal Mach Intell 39 1137-1642
[8]  
Chen LC(2017)Squeeze-and-excitation networks IEEE Trans Pattern Anal Mach Intell 42 2011-3007
[9]  
Papandreou G(2019)Weight-sharing multi-stage multi-scale ensemble convolutional neural network Int J Mach Learn Cybern 10 1631-455
[10]  
Kokkinos I(2017)Focal loss for dense object detection IEEE Trans Pattern Anal Mach Intell 99 2999-undefined