MaskLab: Instance Segmentation by Refining Object Detection with Semantic and Direction Features

被引：257

作者：

Chen, Liang-Chieh ^{[1
]}

Hermans, Alexander ^{[1
,2
]}

Papandreou, George ^{[1
]}

Schroff, Florian ^{[1
]}

Wang, Peng ^{[1
,3
]}

Adam, Hartwig ^{[1
]}

机构：

[1] Google Inc, Menlo Pk, CA 94025 USA

[2] Rhein Westfal TH Aachen, Aachen, Germany

[3] Univ Calif Los Angeles, Los Angeles, CA 90024 USA

来源：

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年

关键词：

D O I：

10.1109/CVPR.2018.00422

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work, we tackle the problem of instance segmentation, the task of simultaneously solving object detection and semantic segmentation. Towards this goal, we present a model, called MaskLab, which produces three outputs: box detection, semantic segmentation, and direction prediction. Building on top of the Faster-RCNN object detector, the predicted boxes provide accurate localization of object instances. Within each region of interest, MaskLab performs foreground/background segmentation by combining semantic and direction prediction. Semantic segmentation assists the model in distinguishing between objects of different semantic classes including background, while the direction prediction, estimating each pixel's direction towards its corresponding center, allows separating instances of the same semantic class. Moreover, we explore the effect of incorporating recent successful methods from both segmentation and detection (e.g., atrous convolution and hypercolumn). Our proposed model is evaluated on the COCO instance segmentation benchmark and shows comparable performance with other state-of-art models.

引用

页码：4013 / 4022

页数：10

共 81 条

[1]

Abadi M., 2015, PREPRINT

[2]

[Anonymous], 2015, CVPR

[3]

[Anonymous], 2016, ARXIV161110080

[4]

[Anonymous], 2015, CVPR

[5]

[Anonymous], 2014, NIPS

[6]

[Anonymous], 2015, ICCV

[7]

[Anonymous], 2014, ECCV

[8]

[Anonymous], 2016, ARXIV161108991

[9]

[Anonymous], 2016, P IEEE C COMP VIS PA

[10]

[Anonymous], 2016, CVPR

← 1 2 3 4 5 6 7 8 9 →