Meta R-CNN : Towards General Solver for Instance-level Low-shot Learning

被引：350

作者：

Yan, Xiaopeng ^{[1
]}

Chen, Ziliang ^{[1
]}

Xu, Anni ^{[1
]}

Wang, Xiaoxi ^{[1
]}

Liang, Xiaodan ^{[1
,2
]}

Lin, Liang ^{[1
,2
]}

机构：

[1] Sun Yat Sen Univ, Guangzhou, Peoples R China

[2] DarkMatter AI Res, Abu Dhabi, U Arab Emirates

来源：

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/ICCV.2019.00967

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Resembling the rapid learning capability of human, low-shot learning empowers vision systems to understand new concepts by training with few samples. Leading approaches derived from meta-learning on images with a single visual object. Obfuscated by a complex background and multiple objects in one image, they are hard to promote the research of low-shot object detection/segmentation. In this work, we present a flexible and general methodology to achieve these tasks. Our work extends Faster/Mask R-CNN by proposing meta-learning over RoI (Region-of-Interest) features instead of a full image feature. This simple spirit disentangles multi-object information merged with the background, without bells and whistles, enabling Faster/Mask R-CNN turn into a meta-learner to achieve the tasks. Specifically, we introduce a Predictor-head Remodeling Network (PRN) that shares its main backbone with Faster/Mask R-CNN. PRN receives images containing low-shot objects with their bounding boxes or masks to infer their class attentive vectors. The vectors take channel-wise soft-attention on RoI features, remodeling those R-CNN predictor heads to detect or segment the objects consistent with the classes these vectors represent. In our experiments, Meta R-CNN yields the new state of the art in low-shot object detection and improves low-shot object segmentation by Mask R-CNN. Code: https://yanxp.github.io/metarcnn.html.

引用

页码：9576 / 9585

页数：10

共 48 条

[1] Amit R., 2017, ARXIV171101244
[2] [Anonymous], 2017, ADV NEURAL INFORM PR
[3] [Anonymous], LEARNING SEGMENT EVE
[4] [Anonymous], 2016, ARXIV
[5] Arnab A., 2016, ARXIV160902583, DOI 10.5244/C.30.19
[6] Bertinetto L., 2016, ADV NEURAL INFORM PR, P523, DOI DOI 10.48550/ARXIV.1606.05233
[7] Effects of overexpression of jasmonic acid biosynthesis genes on nicotine accumulation in tobacco
Chen, Hongxia
Wang, Bingwu
Geng, Sisi
Arellano, Consuelo
Chen, Sixue
Qu, Rongda
[J]. PLANT DIRECT, 2018, 2 (01)
[8] Halide-free synthesis of Au nanoplates and monitoring the shape evolution process through a marker experiment
Chen, Lei
Hu, Huicheng
Liu, Qipeng
Ji, Fei
Chen, Suli
Xu, Yong
Zhang, Qiao
[J]. JOURNAL OF MATERIALS CHEMISTRY C, 2016, 4 (27) : 6457 - 6460
[9] MaskLab: Instance Segmentation by Refining Object Detection with Semantic and Direction Features
Chen, Liang-Chieh
Hermans, Alexander
Papandreou, George
Schroff, Florian
Wang, Peng
Adam, Hartwig
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4013 - 4022
[10] Dai J, 2016, PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), P1796, DOI 10.1109/ICIT.2016.7475036

← 1 2 3 4 5 →