Learning Distance Metrics from Probabilistic Information

被引:4
作者
Huai, Mengdi [1 ]
Miao, Chenglin [2 ]
Li, Yaliang [3 ]
Suo, Qiuling [2 ]
Su, Lu [2 ]
Zhang, Aidong [1 ]
机构
[1] Univ Virginia, Dept Comp Sci, 85 Engineers Way, Charlottesville, VA 22904 USA
[2] SUNY Buffalo, Dept Comp Sci & Engn, 338 Davis Hall, Buffalo, NY 14260 USA
[3] Alibaba Grp, 500 108th Ave NE,Suite 800, Bellevue, WA 98004 USA
基金
美国国家科学基金会;
关键词
Metric learning; distance measure; probabilistic labels; ONLINE;
D O I
10.1145/3364320
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The goal of metric learning is to learn a good distance metric that can capture the relationships among instances, and its importance has long been recognized in many fields. An implicit assumption in the traditional settings of metric learning is that the associated labels of the instances are deterministic. However, in many real-world applications, the associated labels come naturally with probabilities instead of deterministic values, which makes it difficult for the existing metric-learning methods to work well in these applications. To address this challenge, in this article, we study how to effectively learn the distance metric from datasets that contain probabilistic information, and then propose several novel metric-learning mechanisms for two types of probabilistic labels, i.e., the instance-wise probabilistic label and the group-wise probabilistic label. Compared with the existing metric-learning methods, our proposed mechanisms are capable of learning distance metrics directly from the probabilistic labels with high accuracy. We also theoretically analyze the proposed mechanisms and conduct extensive experiments on real-world datasets to verify the desirable properties of these mechanisms.
引用
收藏
页数:33
相关论文
共 54 条
[1]  
[Anonymous], 2016, ARXIV161006671
[2]  
[Anonymous], 2007, P 24 INT C MACH LEAR
[3]  
Baghshah MS, 2009, 21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, P1217
[4]  
Bar-Hillel AB, 2005, J MACH LEARN RES, V6, P937
[5]  
Bertsekas D. P., 1999, Nonlinear Programming
[6]   Generalization bounds for metric and similarity learning [J].
Cao, Qiong ;
Guo, Zheng-Chu ;
Ying, Yiming .
MACHINE LEARNING, 2016, 102 (01) :115-132
[7]  
Chapelle O, 2008, J MACH LEARN RES, V9, P203
[8]  
Duchi J, 2009, J MACH LEARN RES, V10, P2899
[9]  
Fish B, 2017, PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1675
[10]  
Guillaumin M, 2010, LECT NOTES COMPUT SC, V6311, P634, DOI 10.1007/978-3-642-15549-9_46