Divide and Conquer the Embedding Space for Metric Learning

被引：85

作者：

Sanakoyeu, Artsiom ^{[1
]}

Tschernezki, Vadim ^{[1
]}

Buechler, Uta ^{[1
]}

Ommer, Bjoern ^{[1
]}

机构：

[1] Heidelberg Univ, IWR, Heidelberg Collaboratory Image Proc, Heidelberg, Germany

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年

关键词：

D O I：

10.1109/CVPR.2019.00056

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Learning the embedding space, where semantically similar objects are located close together and dissimilar objects far apart, is a cornerstone of many computer vision applications. Existing approaches usually learn a single metric in the embedding space for all available data points, which may have a very complex non-uniform distribution with different notions of similarity between objects, e.g. appearance, shape, color or semantic meaning. Approaches for learning a single distance metric often struggle to encode all different types of relationships and do not generalize well. In this work, we propose a novel easy-to-implement divide and conquer approach for deep metric learning, which significantly improves the state-of-the-art performance of metric learning. Our approach utilizes the embedding space more efficiently by jointly splitting the embedding space and data into K smaller sub-problems. It divides both, the data and the embedding space into K subsets and learns K separate distance metrics in the non-overlapping subspaces of the embedding space, defined by groups of neurons in the embedding layer of the neural network. The proposed approach increases the convergence speed and improves generalization since the complexity of each sub-problem is reduced compared to the original one. We show that our approach outperforms the state-of-the-art by a large margin in retrieval, clustering and re-identification tasks on CUB200-2011, CARS196, Stanford Online Products, Inshop Clothes and PKU VehicleID datasets.

引用

页码：471 / 480

页数：10

共 53 条

[21] Mining on Manifolds: Metric Learning without Labels [J].

Iscen, Ahmet ;

Tolias, Giorgos ;

Avrithis, Yannis ;

Chum, Ondrej .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7642-7651

[22] Product Quantization for Nearest Neighbor Search [J].

Jegou, Herve ;

Douze, Matthijs ;

Schmid, Cordelia .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (01) :117-128

[23] Billion-Scale Similarity Search with GPUs [J].

Johnson, Jeff ;

Douze, Matthijs ;

Jegou, Herve .

IEEE TRANSACTIONS ON BIG DATA, 2021, 7 (03) :535-547

[24]

Kingma DP, 2014, ARXIV

[25] 3D Object Representations for Fine-Grained Categorization [J].

Krause, Jonathan ;

Stark, Michael ;

Deng, Jia ;

Li Fei-Fei .

2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, :554-561

[26]

LI Z, 2018, IEEE T PATTERN ANAL

[27] Weakly Supervised Deep Metric Learning for Community-Contributed Image Retrieval [J].

Li, Zechao ;

Tang, Jinhui .

IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (11) :1989-1999

[28] Deep Variational Metric Learning [J].

Lin, Xudong ;

Duan, Yueqi ;

Dong, Qiyuan ;

Lu, Jiwen ;

Zhou, Jie .

COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 :714-729

[29]

Liu H, 2007, INT J PRECIS ENG MAN, V8, P8

[30]

Liu Siqi, 2016, P IEEE C COMP VIS PA

← 1 2 3 4 5 6 →