Few-Shot Object Detection and Viewpoint Estimation for Objects in the Wild

被引:168
作者
Xiao, Yang [1 ]
Marlet, Renaud [1 ,2 ]
机构
[1] Univ Gustave Eiffel, CNRS, LIGM, Ecole Ponts, Marne La Vallee, France
[2] Valeo Ai, Paris, France
来源
COMPUTER VISION - ECCV 2020, PT XVII | 2020年 / 12362卷
关键词
Few-shot learning; Meta learning; Object detection; Viewpoint estimation;
D O I
10.1007/978-3-030-58520-4_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting objects and estimating their viewpoint in images are key tasks of 3D scene understanding. Recent approaches have achieved excellent results on very large benchmarks for object detection and viewpoint estimation. However, performances are still lagging behind for novel object categories with few samples. In this paper, we tackle the problems of few-shot object detection and few-shot viewpoint estimation. We propose a meta-learning framework that can be applied to both tasks, possibly including 3D data. Our models improve the results on objects of novel classes by leveraging on rich feature information originating from base classes with many samples. A simple joint feature embedding module is proposed to make the most of this feature sharing. Despite its simplicity, our method outperforms state-of-the-art methods by a large margin on a range of datasets, including PASCAL VOC and MS COCO for few-shot object detection, and Pascal3D+ and ObjectNet3D for few-shot viewpoint estimation. And for the first time, we tackle the combination of both few-shot tasks, on ObjectNet3D, showing promising results.
引用
收藏
页码:192 / 210
页数:19
相关论文
共 66 条
[1]  
Ammirato P, 2019, Arxiv, DOI arXiv:1803.04610
[2]  
Andrychowicz M, 2016, ADV NEUR IN, V29
[3]  
Bansal A., ECCV
[4]  
Bertinetto L, 2016, ADV NEUR IN, V29
[5]   Weakly Supervised Deep Detection Networks [J].
Bilen, Hakan ;
Vedaldi, Andrea .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2846-2854
[6]   Weakly Supervised Cascaded Convolutional Networks [J].
Diba, Ali ;
Sharma, Vivek ;
Pazandeh, Ali ;
Pirsiavash, Hamed ;
Van Gool, Luc .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5131-5139
[7]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[8]   The PASCAL Visual Object Classes Challenge: A Retrospective [J].
Everingham, Mark ;
Eslami, S. M. Ali ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136
[9]  
Finn C, 2017, PR MACH LEARN RES, V70
[10]   Dynamic Few-Shot Visual Learning without Forgetting [J].
Gidaris, Spyros ;
Komodakis, Nikos .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4367-4375