A Regularization Approach for Instance-Based Superset Label Learning

被引:117
作者
Gong, Chen [1 ,2 ]
Liu, Tongliang [3 ]
Tang, Yuanyan [4 ,5 ]
Yang, Jian [1 ]
Yang, Jie
Tao, Dacheng [6 ,7 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Jiangsu, Peoples R China
[2] Shanghai Jiao Tong Univ, Inst Image Proc & Pattern Recognit, Shanghai 200240, Peoples R China
[3] Univ Technol Sydney, Fac Engn & Informat Technol, Sch Software, Ultimo, NSW 2007, Australia
[4] Univ Macau, Fac Sci & Technol, Macau 999078, Peoples R China
[5] Chongqing Univ, Coll Comp Sci, Chongqing 400000, Peoples R China
[6] Univ Sydney, Sch Informat Technol, Sydney, NSW 2008, Australia
[7] Univ Sydney, Fac Engn & Informat Technol, Sydney, NSW 2008, Australia
基金
澳大利亚研究理事会;
关键词
Concave convex procedure (CCCP); disambiguation; regularization; superset label learning (SLL);
D O I
10.1109/TCYB.2017.2669639
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Different from the traditional supervised learning in which each training example has only one explicit label, superset label learning (SLL) refers to the problem that a training example can be associated with a set of candidate labels, and only one of them is correct. Existing SLL methods are either regularization-based or instance-based, and the latter of which has achieved state-of-the-art performance. This is because the latest instance-based methods contain an explicit disambiguation operation that accurately picks up the groundtruth label of each training example from its ambiguous candidate labels. However, such disambiguation operation does not fully consider the mutually exclusive relationship among different candidate labels, so the disambiguated labels are usually generated in a nondiscriminative way, which is unfavorable for the instance-based methods to obtain satisfactory performance. To address this defect, we develop a novel regularization approach for instance-based superset label (RegISL) learning so that our instance-based method also inherits the good discriminative ability possessed by the regularization scheme. Specifically, we employ a graph to represent the training set, and require the examples that are adjacent on the graph to obtain similar labels. More importantly, a discrimination term is proposed to enlarge the gap of values between possible labels and unlikely labels for every training example. As a result, the intrinsic constraints among different candidate labels are deployed, and the disambiguated labels generated by RegISL are more discriminative and accurate than those output by existing instance-based algorithms. The experimental results on various tasks convincingly demonstrate the superiority of our RegISL to other typical SLL methods in terms of both training accuracy and test accuracy.
引用
收藏
页码:967 / 978
页数:12
相关论文
共 45 条
  • [1] Bertsekas DP, 2014, CONSTRAINED OPTIMIZA
  • [2] Briggs F, 2012, P 18 ACM SIGKDD INT, P534, DOI DOI 10.1145/2339530.2339616
  • [3] Ambiguously Labeled Learning Using Dictionaries
    Chen, Yi-Chen
    Patel, Vishal M.
    Chellappa, Rama
    Phillips, P. Jonathon
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2014, 9 (12) : 2076 - 2088
  • [4] Cid-Sueiro Jesus, 2014, Machine Learning and Knowledge Discovery in Databases. European Conference, ECML PKDD 2014. Proceedings: LNCS 8724, P197, DOI 10.1007/978-3-662-44848-9_13
  • [5] Cid-Sueiro Jesus, 2012, Proceedings of the 25th International Conference on Neural Information Processing Systems-Volume 1, NIPS'12, page, P1565
  • [6] Cour T, 2011, J MACH LEARN RES, V12, P1501
  • [7] Cour T, 2009, PROC CVPR IEEE, P919, DOI 10.1109/CVPRW.2009.5206667
  • [8] Fan RE, 2008, J MACH LEARN RES, V9, P1871
  • [9] LOCAL CONVERGENCE OF EXACT AND INEXACT AUGMENTED LAGRANGIAN METHODS UNDER THE SECOND-ORDER SUFFICIENT OPTIMALITY CONDITION
    Fernandez, D.
    Solodov, M. V.
    [J]. SIAM JOURNAL ON OPTIMIZATION, 2012, 22 (02) : 384 - 407
  • [10] Classification in the Presence of Label Noise: a Survey
    Frenay, Benoit
    Verleysen, Michel
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (05) : 845 - 869