Distance Metric Learning for Large Margin Nearest Neighbor Classification

被引:0
|
作者
Weinberger, Kilian Q. [1 ]
Saul, Lawrence K. [2 ]
机构
[1] Yahoo Res, Santa Clara, CA USA
[2] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
基金
美国国家科学基金会;
关键词
convex optimization; semi-definite programming; Mahalanobis distance; metric learning; multi-class classification; support vector machines;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The accuracy of k-nearest neighbor (kNN) classification depends significantly on the metric used to compute distances between different examples. In this paper, we show how to learn a Mahalanobis distance metric for kNN classification from labeled examples. The Mahalanobis metric can equivalently be viewed as a global linear transformation of the input space that precedes kNN classification using Euclidean distances. In our approach, the metric is trained with the goal that the k-nearest neighbors always belong to the same class while examples from different classes are separated by a large margin. As in support vector machines (SVMs), the margin criterion leads to a convex optimization based on the hinge loss. Unlike learning in SVMs, however, our approach requires no modification or extension for problems in multiway (as opposed to binary) classification. In our framework, the Mahalanobis distance metric is obtained as the solution to a semidefinite program. On several data sets of varying size and difficulty, we find that metrics trained in this way lead to significant improvements in kNN classification. Sometimes these results can be further improved by clustering the training examples and learning an individual metric within each cluster. We show how to learn and combine these local metrics in a globally integrated manner.
引用
收藏
页码:207 / 244
页数:38
相关论文
共 50 条
  • [31] Mixture correntropy-based robust distance metric learning for classification
    Yuan, Chao
    Zhou, Changsheng
    Peng, Jigen
    Li, Haiyang
    KNOWLEDGE-BASED SYSTEMS, 2024, 295
  • [32] SEMI-SUPERVISED DISTANCE METRIC LEARNING FOR VISUAL OBJECT CLASSIFICATION
    Cevikalp, Hakan
    Paredes, Roberto
    VISAPP 2009: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, 2009, : 315 - +
  • [33] A Proposal of Extended Cosine Measure for Distance Metric Learning in Text Classification
    Mikawa, Kenta
    Ishida, Takashi
    Goto, Masayuki
    2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 1741 - 1746
  • [34] Large margin nearest local mean classifier
    Chai, Jing
    Liu, Hongwei
    Chen, Bo
    Bao, Zheng
    SIGNAL PROCESSING, 2010, 90 (01) : 236 - 248
  • [35] A large margin time series nearest neighbour classification under locally weighted time warps
    Yuan, Jidong
    Douzal-Chouakria, Ahlame
    Yazdi, Saeed Varasteh
    Wang, Zhihai
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 59 (01) : 117 - 135
  • [36] Large margin metric learning based vehicle re-identification method
    Zhang S.-L.
    Ma S.-M.
    Gu Z.-Q.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2021, 55 (05): : 948 - 956
  • [37] A large margin time series nearest neighbour classification under locally weighted time warps
    Jidong Yuan
    Ahlame Douzal-Chouakria
    Saeed Varasteh Yazdi
    Zhihai Wang
    Knowledge and Information Systems, 2019, 59 : 117 - 135
  • [38] Safe Triplet Screening for Distance Metric Learning
    Yoshida, Tomoki
    Takeuchi, Ichiro
    Karasuyama, Masayuki
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 2653 - 2662
  • [39] Distance metric learning for graph structured data
    Tomoki Yoshida
    Ichiro Takeuchi
    Masayuki Karasuyama
    Machine Learning, 2021, 110 : 1765 - 1811
  • [40] Distance metric learning for graph structured data
    Yoshida, Tomoki
    Takeuchi, Ichiro
    Karasuyama, Masayuki
    MACHINE LEARNING, 2021, 110 (07) : 1765 - 1811