Semi-Supervised Affinity Propagation with Soft Instance-Level Constraints

被引:25
作者
Arzeno, Natalia M. [1 ]
Vikalo, Haris [1 ]
机构
[1] Univ Texas Austin, Dept Elect & Comp Engn, Austin, TX 78712 USA
基金
美国国家科学基金会;
关键词
Clustering algorithms; graph algorithms; affinity propagation; semi-supervised learning; noisy pairwise constraints;
D O I
10.1109/TPAMI.2014.2359454
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Soft-constraint semi-supervised affinity propagation (SCSSAP) adds supervision to the affinity propagation (AP) clustering algorithm without strictly enforcing instance-level constraints. Constraint violations lead to an adjustment of the AP similarity matrix at every iteration of the proposed algorithm and to addition of a penalty to the objective function. This formulation is particularly advantageous in the presence of noisy labels or noisy constraints since the penalty parameter of SCSSAP can be tuned to express our confidence in instance-level constraints. When the constraints are noiseless, SCSSAP outperforms unsupervised AP and performs at least as well as the previously proposed semi-supervised AP and constrained expectation maximization. In the presence of label and constraint noise, SCSSAP results in a more accurate clustering than either of the aforementioned established algorithms. Finally, we present an extension of SCSSAP which incorporates metric learning in the optimization objective and can further improve the performance of clustering.
引用
收藏
页码:1041 / 1052
页数:12
相关论文
共 49 条
  • [1] [Anonymous], 2004, Proceedings, Twenty-First International Conference on Machine Learning, ICML 2004
  • [2] [Anonymous], 2006, Advances in neural information processing systems
  • [3] [Anonymous], 2003, P 20 INT C MACHINE L
  • [4] [Anonymous], 2004, Adv. Neural Inf. Process. Syst.
  • [5] [Anonymous], 2006, IEEE COMP SOC C CVPR
  • [6] [Anonymous], 2010, Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, July 25-28, 2010, DOI DOI 10.1145/1835804.183594
  • [7] Bache K, 2013, UCI machine learning repository
  • [8] Bishop Christopher, 2006, Pattern Recognition and Machine Learning, DOI 10.1117/1.2819119
  • [9] Robust supervised classification with mixture models: Learning from data with uncertain labels
    Bouveyron, Charles
    Girard, Stephane
    [J]. PATTERN RECOGNITION, 2009, 42 (11) : 2649 - 2658
  • [10] Identifying mislabeled training data
    Brodley, CE
    Friedl, MA
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1999, 11 : 131 - 167