RepMet: Representative-based metric learning for classification and few-shot object detection

被引：273

作者：

Karlinsky, Leonid ^{[1
]}

Shtok, Joseph ^{[1
]}

Harary, Sivan ^{[1
]}

Schwartz, Eli ^{[1
]}

Aides, Amit ^{[1
]}

Feris, Rogerio ^{[1
]}

Giryes, Raja ^{[2
]}

Bronstein, Alex M. ^{[3
]}

机构：

[1] IBM Res AI, Yorktown Hts, NY 10598 USA

[2] Tel Aviv Univ, Tel Aviv, Israel

[3] Technion, Haifa, Israel

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年

基金：

欧盟地平线“2020”;

关键词：

D O I：

10.1109/CVPR.2019.00534

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Distance metric learning (DML) has been successfully applied to object classification,both in the standard regime of rich training data and in the few-shot scenario, where each category is represented by only a few examples. In this work, we propose a new method for DML that simultaneously learns the backbone network parameters,the embedding space, and the multi-modal distribution of each of the training categories in that space, in a single end-to-end training process. Our approach outperforms state-of-the-art methods for DML-based object classification on a variety of standard fine-grained datasets. Furthermore, we demonstrate the effectiveness of our approach on the problem of few-shot object detection, by incorporating the proposed DML architecture as a classification head into a standard object detection model. We achieve the best results on the ImageNet-LOC dataset compared to strong baselines, when only a few training examples are available. We also offer the community a new episodic benchmark based on the ImageNet dataset for the few-shot object detection task.

引用

页码：5192 / 5201

页数：10

共 45 条

[1]

[Anonymous], ARXIV180405298V2

[2]

[Anonymous], 2016, CoRR abs/1512.00567, DOI DOI 10.1109/CVPR.2016.308

[3]

[Anonymous], ARXIV180404340

[4]

[Anonymous], 2017, ARSENIC EXPOSURE INT

[5]

Bendale A, 2015, PROC CVPR IEEE, P1893, DOI 10.1109/CVPR.2015.7298799

[6] Soft-NMS - Improving Object Detection With One Line of Code [J].

Bodla, Navaneeth ;

Singh, Bharat ;

Chellappa, Rama ;

Davis, Larry S. .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5562-5570

[7]

Chen H, 2018, AAAI CONF ARTIF INTE, P2836

[8] Deformable Convolutional Networks [J].

Dai, Jifeng ;

Qi, Haozhi ;

Xiong, Yuwen ;

Li, Yi ;

Zhang, Guodong ;

Hu, Han ;

Wei, Yichen .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773

[9]

Finn C, 2017, PR MACH LEARN RES, V70

[10]

Girshick Ross, 2015, WEAKLY SUPERVISED ON, P1

← 1 2 3 4 5 →