Meta R-CNN : Towards General Solver for Instance-level Low-shot Learning

被引:350
作者
Yan, Xiaopeng [1 ]
Chen, Ziliang [1 ]
Xu, Anni [1 ]
Wang, Xiaoxi [1 ]
Liang, Xiaodan [1 ,2 ]
Lin, Liang [1 ,2 ]
机构
[1] Sun Yat Sen Univ, Guangzhou, Peoples R China
[2] DarkMatter AI Res, Abu Dhabi, U Arab Emirates
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICCV.2019.00967
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Resembling the rapid learning capability of human, low-shot learning empowers vision systems to understand new concepts by training with few samples. Leading approaches derived from meta-learning on images with a single visual object. Obfuscated by a complex background and multiple objects in one image, they are hard to promote the research of low-shot object detection/segmentation. In this work, we present a flexible and general methodology to achieve these tasks. Our work extends Faster/Mask R-CNN by proposing meta-learning over RoI (Region-of-Interest) features instead of a full image feature. This simple spirit disentangles multi-object information merged with the background, without bells and whistles, enabling Faster/Mask R-CNN turn into a meta-learner to achieve the tasks. Specifically, we introduce a Predictor-head Remodeling Network (PRN) that shares its main backbone with Faster/Mask R-CNN. PRN receives images containing low-shot objects with their bounding boxes or masks to infer their class attentive vectors. The vectors take channel-wise soft-attention on RoI features, remodeling those R-CNN predictor heads to detect or segment the objects consistent with the classes these vectors represent. In our experiments, Meta R-CNN yields the new state of the art in low-shot object detection and improves low-shot object segmentation by Mask R-CNN. Code: https://yanxp.github.io/metarcnn.html.
引用
收藏
页码:9576 / 9585
页数:10
相关论文
共 48 条
  • [1] Amit R., 2017, ARXIV171101244
  • [2] [Anonymous], 2017, ADV NEURAL INFORM PR
  • [3] [Anonymous], LEARNING SEGMENT EVE
  • [4] [Anonymous], 2016, ARXIV
  • [5] Arnab A., 2016, ARXIV160902583, DOI 10.5244/C.30.19
  • [6] Bertinetto L., 2016, ADV NEURAL INFORM PR, P523, DOI DOI 10.48550/ARXIV.1606.05233
  • [7] Effects of overexpression of jasmonic acid biosynthesis genes on nicotine accumulation in tobacco
    Chen, Hongxia
    Wang, Bingwu
    Geng, Sisi
    Arellano, Consuelo
    Chen, Sixue
    Qu, Rongda
    [J]. PLANT DIRECT, 2018, 2 (01)
  • [8] Halide-free synthesis of Au nanoplates and monitoring the shape evolution process through a marker experiment
    Chen, Lei
    Hu, Huicheng
    Liu, Qipeng
    Ji, Fei
    Chen, Suli
    Xu, Yong
    Zhang, Qiao
    [J]. JOURNAL OF MATERIALS CHEMISTRY C, 2016, 4 (27) : 6457 - 6460
  • [9] MaskLab: Instance Segmentation by Refining Object Detection with Semantic and Direction Features
    Chen, Liang-Chieh
    Hermans, Alexander
    Papandreou, George
    Schroff, Florian
    Wang, Peng
    Adam, Hartwig
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4013 - 4022
  • [10] Dai J, 2016, PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), P1796, DOI 10.1109/ICIT.2016.7475036