Panoptic Feature Pyramid Networks

被引：1045

作者：

Kirillov, Alexander ^{[1
]}

Girshick, Ross ^{[1
]}

He, Kaiming ^{[1
]}

Dollar, Piotr ^{[1
]}

机构：

[1] Facebook AI Res FAIR, Paris, France

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年

关键词：

D O I：

10.1109/CVPR.2019.00656

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The recently introduced panoptic segmentation task has renewed our community's interest in unifying the tasks of instance segmentation (for thing classes) and semantic segmentation (for stuff classes). However, current state-of-the-art methods for this joint task use separate and dissimilar networks for instance and semantic segmentation, without performing any shared computation. In this work, we aim to unify these methods at the architectural level, designing a single network for both tasks. Our approach is to endow Mask R-CNN, a popular instance segmentation method, with a semantic segmentation branch using a shared Feature Pyramid Network (FPN) backbone. Surprisingly, this simple baseline not only remains effective for instance segmentation, but also yields a lightweight, top-performing method for semantic segmentation. In this work, we perform a detailed study of this minimally extended version of Mask R-CNN with FPN, which we refer to as Panoptic FPN, and show it is a robust and accurate baseline for both tasks. Given its effectiveness and conceptual simplicity, we hope our method can serve as a strong baseline and aid future research in panoptic segmentation.

引用

页码：6392 / 6401

页数：10

共 60 条

[41] Seeing through the Human Reporting Bias: Visual Classifiers from Noisy Human-Centric Labels [J].

Misra, Ishan ;

Zitnick, C. Lawrence ;

Mitchell, Margaret ;

Girshick, Ross .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2930-2939

[42] Stacked Hourglass Networks for Human Pose Estimation [J].

Newell, Alejandro ;

Yang, Kaiyu ;

Deng, Jia .

COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 :483-499

[43]

Pham V.-Q., 2017, P IEEE INT C FUZZ SY

[44] Learning to Refine Object Segments [J].

Pinheiro, Pedro O. ;

Lin, Tsung-Yi ;

Collobert, Ronan ;

Dollar, Piotr .

COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :75-91

[45] Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks [J].

Ren, Shaoqing ;

He, Kaiming ;

Girshick, Ross ;

Sun, Jian .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) :1137-1149

[46] U-Net: Convolutional Networks for Biomedical Image Segmentation [J].

Ronneberger, Olaf ;

Fischer, Philipp ;

Brox, Thomas .

MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 :234-241

[47] ImageNet Large Scale Visual Recognition Challenge [J].

Russakovsky, Olga ;

Deng, Jia ;

Su, Hao ;

Krause, Jonathan ;

Satheesh, Sanjeev ;

Ma, Sean ;

Huang, Zhiheng ;

Karpathy, Andrej ;

Khosla, Aditya ;

Bernstein, Michael ;

Berg, Alexander C. ;

Fei-Fei, Li .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 115 (03) :211-252

[48] Scene Parsing with Object Instances and Occlusion Ordering [J].

Tighe, Joseph ;

Niethammer, Marc ;

Lazebnik, Svetlana .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3748-3755

[49] Image parsing: Unifying segmentation, detection, and recognition [J].

Tu, ZW ;

Chen, XG ;

Yuille, AL ;

Zhu, SC .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2005, 63 (02) :113-140

[50]

Vijay B., 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence, V39, P2481, DOI DOI 10.1109/TPAMI.2016.2644615

← 1 2 3 4 5 6 →