Safe Triplet Screening for Distance Metric Learning

被引:5
作者
Yoshida, Tomoki [1 ]
Takeuchi, Ichiro [1 ,2 ,3 ]
Karasuyama, Masayuki [1 ,2 ,4 ]
机构
[1] Nagoya Inst Technol, Showa Ku, Gokiso Cho, Nagoya, Aichi 4668555, Japan
[2] Natl Inst Mat Sci, Sengen, Tsukuba, Ibaraki 3050047, Japan
[3] RIKEN Ctr Adv Intelligence Project, Chuo Ku, Tokyo 1030012, Japan
[4] Japan Sci & Technol Agcy, Honcho, Kawaguchi, Saitama 3320012, Japan
基金
日本科学技术振兴机构;
关键词
DUAL APPROACH;
D O I
10.1162/neco_a_01240
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Distance metric learning has been widely used to obtain the optimal distance function based on the given training data. We focus on a triplet-based loss function, which imposes a penalty such that a pair of instances in the same class is closer than a pair in different classes. However, the number of possible triplets can be quite large even for a small data set, and this considerably increases the computational cost for metric optimization. In this letter, we propose safe triplet screening that identifies triplets that can be safely removed from the optimization problem without losing the optimality. In comparison with existing safe screening studies, triplet screening is particularly significant because of the huge number of possible triplets and the semidefinite constraint in the optimization problem. We demonstrate and verify the effectiveness of our screening rules by using several benchmark data sets.
引用
收藏
页码:2432 / 2491
页数:60
相关论文
共 44 条
[1]  
[Anonymous], 2016, ARXIV160706996
[2]  
[Anonymous], 1999, NONLINEAR PROGRAMMIN
[3]  
[Anonymous], 2010, ARXIV10094219
[4]  
[Anonymous], 2014, ARXIV14106880
[5]  
[Anonymous], 2009, P ADV NEUR INF PROC
[6]  
[Anonymous], ARXIV161204853
[7]  
[Anonymous], PROC CVPR IEEE
[8]  
[Anonymous], 2015, NIPS 2015 WORKSH OPT
[9]   2-POINT STEP SIZE GRADIENT METHODS [J].
BARZILAI, J ;
BORWEIN, JM .
IMA JOURNAL OF NUMERICAL ANALYSIS, 1988, 8 (01) :141-148
[10]   Least-squares covariance matrix adjustment [J].
Boyd, S ;
Xiao, L .
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2005, 27 (02) :532-546