Few-Shot Object Detection: A Survey

被引:59
作者
Antonelli, Simone [1 ]
Avola, Danilo [1 ]
Cinque, Luigi [1 ]
Crisostomi, Donato [1 ]
Foresti, Gian Luca [2 ]
Galasso, Fabio [1 ]
Marini, Marco Raoul [1 ]
Mecca, Alessio [1 ]
Pannone, Daniele [1 ]
机构
[1] Sapienza Univ Roma, Rome, Italy
[2] Univ Udine, Udine, Italy
关键词
Deep learning for few-shot object detection; dataset for object detection; benchmarks and metrics for object detection;
D O I
10.1145/3519022
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Deep learning approaches have recently raised the bar in many fields, from Natural Language Processing to Computer Vision, by leveraging large amounts of data. However, they could fail when the retrieved information is not enough to fit the vast number of parameters, frequently resulting in overfitting and therefore in poor generalizability. Few-Shot Learning aims at designing models that can effectively operate in a scarce data regime, yielding learning strategies that only need few supervised examples to be trained. These procedures are of both practical and theoretical importance, as they are crucial for many real-life scenarios in which data is either costly or even impossible to retrieve. Moreover, they bridge the distance between current data-hungry models and human-like generalization capability. Computer vision offers various tasks that can be few-shot inherent, such as person re-identification. This survey, which to the best of our knowledge is the first tackling this problem, is focused on Few-Shot Object Detection, which has received far less attention compared to Few-Shot Classification due to the intrinsic challenge level. In this regard, this review presents an extensive description of the approaches that have been tested in the current literature, discussing their pros and cons, and classifying them according to a rigorous taxonomy.
引用
收藏
页数:37
相关论文
共 67 条
[11]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[12]  
Everingham M., 2012, Results
[13]   Few-Shot Object Detection with Attention-RPN and Multi-Relation Detector [J].
Fan, Qi ;
Zhuo, Wei ;
Tang, Chi-Keung ;
Tai, Yu-Wing .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :4012-4021
[14]  
Ferrari V, 2009, PROC CVPR IEEE, P1, DOI 10.1109/CVPRW.2009.5206495
[15]  
Finn C, 2017, PR MACH LEARN RES, V70
[16]  
George M, 2014, LECT NOTES COMPUT SC, V8690, P440, DOI 10.1007/978-3-319-10605-2_29
[17]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[18]   LVIS: A Dataset for Large Vocabulary Instance Segmentation [J].
Gupta, Agrim ;
Dollar, Piotr ;
Girshick, Ross .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5351-5359
[19]  
Han G., 2021, PROC IEEECVF INT C C, P3263
[20]  
Hsieh T.-I., 2019, P 33 INT C NEURAL IN, V32, P1