Single-stage Instance Segmentation

被引:3
作者
Lin, Feng [1 ]
Li, Bin [2 ]
Zhou, Wengang [1 ]
Li, Houqiang [1 ]
Lu, Yan [2 ]
机构
[1] Univ Sci & Technol China, 96 JinZhai Rd, Hefei, Peoples R China
[2] Microsoft Res Asia, 5 Dan Ling St, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Instance segmentation; neural networks; single stage; graph merge;
D O I
10.1145/3387926
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Albeit the highest accuracy of object detection is generally acquired by multi-stage detectors, like R-CNN and its extension approaches, the single-stage object detectors also achieve remarkable performance with faster execution and higher scalability. Inspired by this, we propose a single-stage framework to tackle the instance segmentation task. Building on a single-stage object detection network in hand, our model outputs the detected bounding box of each instance, the semantic segmentation result, and the pixel affinity simultaneously. After that, we generate the final instance masks via a fast post-processing method with the help of the three outputs above. As far as we know, it is the first attempt to segment instances in a single-stage pipeline on challenging datasets. Extensive experiments demonstrate the efficiency of our post-processing method, and the proposed framework obtains competitive results as a single-stage instance segmentation method. We achieve 32.5 box AP and 26.0 mask AP on the COCO validation set with 500 pixels input scale and 22.9 mask AP on the Cityscapes test set.
引用
收藏
页数:19
相关论文
共 68 条
[61]   Aggregated Residual Transformations for Deep Neural Networks [J].
Xie, Saining ;
Girshick, Ross ;
Dollar, Piotr ;
Tu, Zhuowen ;
He, Kaiming .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5987-5995
[62]   Explicit Shape Encoding for Real-Time Instance Segmentation [J].
Xu, Wenqiang ;
Wang, Haiyang ;
Qi, Fubo ;
Lu, Cewu .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :5167-5176
[63]   SyncSpecCNN: Synchronized Spectral CNN for 3D Shape Segmentation [J].
Yi, Li ;
Su, Hao ;
Guo, Xingwen ;
Guibas, Leonidas .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6584-6592
[64]   Segmentation of Discriminative Patches in Human Activity Video [J].
Zhang, Bo ;
Conci, Nicola ;
De Natale, Francesco G. B. .
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2015, 12 (01) :1-19
[65]   Multifeature Analysis and Semantic Context Learning for Image Classification [J].
Zhang, Qianni ;
Izquierdo, Ebroul .
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2013, 9 (02)
[66]   Single-Shot Refinement Neural Network for Object Detection [J].
Zhang, Shifeng ;
Wen, Longyin ;
Bian, Xiao ;
Lei, Zhen ;
Li, Stan Z. .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4203-4212
[67]  
Zhang X.Y., 2018, LECT NOTES COMPUT SC, P116, DOI DOI 10.1007/978-3-030-01264-9_8
[68]   Zero-Shot Learning via Joint Latent Similarity Embedding [J].
Zhang, Ziming ;
Saligrama, Venkatesh .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :6034-6042