Scalable Large-Margin Distance Metric Learning Using Stochastic Gradient Descent

被引：19

作者：

Bac Nguyen ^{[1
]}

Morell, Carlos ^{[2
]}

De Baets, Bernard ^{[1
]}

机构：

[1] Univ Ghent, Dept Data Anal & Math Modeling, B-9000 Ghent, Belgium

[2] Univ Cent Marta Abreu Las Villas, Comp Sci Dept, Santa Clara 54830, Cuba

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2020年 / 50卷 / 03期

关键词：

Large-margin nearest neighbor; metric learning; positive semidefinite (PSD) matrix; stochastic gradient descent (SGD); EIGENVALUE; SIMILARITY;

D O I：

10.1109/TCYB.2018.2881417

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The key to success of many machine learning and pattern recognition algorithms is the way of computing distances between the input data. In this paper, we propose a large-margin-based approach, called the large-margin distance metric learning (LMDML), for learning a Mahalanobis distance metric. LMDML employs the principle of margin maximization to learn the distance metric with the goal of improving ${k}$ -nearest-neighbor classification. The main challenge of distance metric learning is the positive semidefiniteness constraint on the Mahalanobis matrix. Semidefinite programming is commonly used to enforce this constraint, but it becomes computationally intractable on large-scale data sets. To overcome this limitation, we develop an efficient algorithm based on a stochastic gradient descent. Our algorithm can avoid the computations of the full gradient and ensure that the learned matrix remains within the positive semidefinite cone after each iteration. Extensive experiments show that the proposed algorithm is scalable to large data sets and outperforms other state-of-the-art distance metric learning approaches regarding classification accuracy and training time.

引用

页码：1072 / 1083

页数：12

共 64 条

[1]

[Anonymous], 2006, P NIPS

[2]

[Anonymous], 1990, P 3 DARPA SPEECH NAT

[3]

[Anonymous], 2014, Convex Optimiza- tion

[4]

[Anonymous], 2004, ADV NEURAL INFORM PR, DOI DOI 10.1109/TCSVT.2013.2242640

[5]

[Anonymous], LOW RANK SPARSE MODE

[6]

[Anonymous], 2000, Pattern Classification, DOI DOI 10.1007/978-3-319-57027-3_4

[7]

[Anonymous], P 18 INT C ART INT S

[8]

[Anonymous], 2002, P ADV NEURAL INF PRO

[9]

[Anonymous], 2010, ICML

[10]

[Anonymous], 2011, Acm T. Intel. Syst. Tec., DOI DOI 10.1145/1961189.1961199

← 1 2 3 4 5 6 7 →