Boundary-Preserving Mask R-CNN

被引：180

作者：

Cheng, Tianheng ^{[1
]}

Wang, Xinggang ^{[1
]}

Huang, Lichao ^{[2
]}

Liu, Wenyu ^{[1
]}

机构：

[1] Huazhong Univ Sci & Technol, Wuhan, Peoples R China

[2] Horizon Robot Inc, Beijing, Peoples R China

来源：

COMPUTER VISION - ECCV 2020, PT XIV | 2020年 / 12359卷

关键词：

Instance segmentation; Object detection; Boundary-preserving; Boundary detection;

D O I：

10.1007/978-3-030-58568-6_39

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Tremendous efforts have been made to improve mask localization accuracy in instance segmentation. Modern instance segmentation methods relying on fully convolutional networks perform pixel-wise classification, which ignores object boundaries and shap, leading coarse and indistinct mask prediction results and imprecise localization. To remedy these problems, we propose a conceptually simple yet effective Boundary-preserving Mask R-CNN (BMask R-CNN) to leverage object boundary information to improve mask localization accuracy. BMask R-CNN contains a boundary-preserving mask head in which object boundary and mask are mutually learned via feature fusion blocks. As a result, the predicted masks are better aligned with object boundaries. Without bells and whistles, BMask R-CNN outperforms Mask R-CNN by a considerable margin on the COCO dataset; in the Cityscapes dataset, there are more accurate boundary groundtruths available, so that BMask R-CNN obtains remarkable improvements over Mask R-CNN. Besides, it is not surprising to observe that BMask R-CNN obtains more obvious improvement when the evaluation criterion requires better localization (e.g.., AP75) as shown in Fig. 1. Code and models are available at https://github.com/hustvl/BMaskR-CNN.

引用

页码：660 / 676

页数：17

共 55 条

[1] Devil is in the Edges: Learning Semantic Boundaries from Noisy Annotations [J].

Acuna, David ;

Kar, Amlan ;

Fidler, Sanja .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11067-11075

[2] Pixelwise Instance Segmentation with a Dynamically Instantiated Network [J].

Arnab, Anurag ;

Torr, Philip H. S. .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :879-888

[3] Semantic Segmentation with Boundary Neural Fields [J].

Bertasius, Gedas ;

Shi, Jianbo ;

Torresani, Lorenzo .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3602-3610

[4]

Brabandere B.D., 2017, CoRR abs/1708.02551

[5] Cascade R-CNN: High Quality Object Detection and Instance Segmentation [J].

Cai, Zhaowei ;

Vasconcelos, Nuno .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) :1483-1498

[6] BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation [J].

Chen, Hao ;

Sun, Kunyang ;

Tian, Zhi ;

Shen, Chunhua ;

Huang, Yongming ;

Yan, Youliang .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :8570-8578

[7] Hybrid Task Cascade for Instance Segmentation [J].

Chen, Kai ;

Pang, Jiangmiao ;

Wang, Jiaqi ;

Xiong, Yu ;

Li, Xiaoxiao ;

Sun, Shuyang ;

Feng, Wansen ;

Liu, Ziwei ;

Shi, Jianping ;

Ouyang, Wanli ;

Loy, Chen Change ;

Lin, Dahua .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4969-4978

[8] MaskLab: Instance Segmentation by Refining Object Detection with Semantic and Direction Features [J].

Chen, Liang-Chieh ;

Hermans, Alexander ;

Papandreou, George ;

Schroff, Florian ;

Wang, Peng ;

Adam, Hartwig .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4013-4022

[9] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[10] Semantic Image Segmentation with Task-Specific Edge Detection Using CNNs and a Discriminatively Trained Domain Transform [J].

Chen, Liang-Chieh ;

Barron, Jonathan T. ;

Papandreou, George ;

Murphy, Kevin ;

Yuille, Alan L. .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4545-4554

← 1 2 3 4 5 6 →