Learning from Noisy Similar and Dissimilar Data

被引:3
作者
Dan, Soham [1 ]
Bao, Han [2 ,3 ]
Sugiyama, Masashi [2 ,3 ]
机构
[1] Univ Penn, Philadelphia, PA 19104 USA
[2] RIKEN, Ctr Adv Intelligence Project, Tokyo, Japan
[3] Univ Tokyo, Tokyo, Japan
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT II | 2021年 / 12976卷
关键词
Classification; Pairwise supervision; Noisy supervision;
D O I
10.1007/978-3-030-86520-7_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the widespread use of machine learning for classification, it becomes increasingly important to be able to use weaker kinds of supervision for tasks in which it is hard to obtain standard labeled data. One such kind of supervision is provided pairwise in the form of Similar (S) pairs (if two examples belong to the same class) and Dissimilar (D) pairs (if two examples belong to different classes). This kind of supervision is realistic in privacy-sensitive domains. Although the basic version of this problem has been studied recently, it is still unclear how to learn from such supervision under label noise, which is very common when the supervision is, for instance, crowd-sourced. In this paper, we close this gap and demonstrate how to learn a classifier from noisy S and D labeled pairs. We perform a detailed investigation of this problem under two realistic noise models and propose two algorithms to learn from noisy SD data. We also show important connections between learning from such pairwise supervision data and learning from ordinary class-labeled data. Finally, we perform experiments on synthetic and real-world datasets and show our noise-informed algorithms outperform existing baselines in learning from noisy pairwise data.
引用
收藏
页码:233 / 249
页数:17
相关论文
共 26 条
[1]  
Bao H, 2018, PR MACH LEARN RES, V80
[2]  
Bartlett P. L., 2003, Journal of Machine Learning Research, V3, P463, DOI 10.1162/153244303321897690
[3]   Convexity, classification, and risk bounds [J].
Bartlett, PL ;
Jordan, MI ;
McAuliffe, JD .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (473) :138-156
[4]   Local Rademacher complexities [J].
Bartlett, PL ;
Bousquet, O ;
Mendelson, S .
ANNALS OF STATISTICS, 2005, 33 (04) :1497-1537
[5]  
Basu S, 2009, CH CRC DATA MIN KNOW, P1
[6]  
Brochu E., 2007, Advances in Neural Information Processing Systems, P409
[7]  
Du S. S., 2018, INT C LEARNING REPRE
[8]  
Elkan C, 2001, INT JOINT C ARTIFICI, V2, P973
[9]   SOCIAL DESIRABILITY BIAS AND THE VALIDITY OF INDIRECT QUESTIONING [J].
FISHER, RJ .
JOURNAL OF CONSUMER RESEARCH, 1993, 20 (02) :303-315
[10]  
Fürnkranz J, 2010, PREFERENCE LEARNING, P65, DOI 10.1007/978-3-642-14125-6_4