Consensus rate-based label propagation for semi-supervised classification

被引:14
作者
Yu, Jaehong [1 ]
Kim, Seoung Bum [2 ]
机构
[1] NYU, Sch Med, Dept Populat Hlth, 650 First Ave, New York, NY 10016 USA
[2] Korea Univ, Dept Ind Management Engn, 145 Anam Ro, Seoul 02841, South Korea
基金
新加坡国家研究基金会;
关键词
Consensus rate; Label propagation; Semi-supervised classification; Smoothness assumption; CLUSTER; ENSEMBLES;
D O I
10.1016/j.ins.2018.06.074
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Label propagation is one of the most widely used semi-supervised classification methods. It utilizes neighborhood structures of observations to apply the smoothness assumption, which describes that observations close to each other are more likely to share a label. However, a single neighborhood structure cannot appropriately reflect intrinsic data structures, and hence, existing label propagation methods can fail to achieve superior performance. To overcome these limitations, we propose a label propagation algorithm based on consensus rates that are calculated by summarizing multiple clustering solutions to incorporate various properties of the data. Thus, the proposed algorithm can effectively reflect the intrinsic data structures, and yield accurate classification results. Experiments are conducted on various benchmark datasets to examine the properties of the proposed algorithm, and to compare it with the existing label propagation methods. The experimental results confirm that the proposed label propagation algorithm demonstrated superior performance compared to the existing methods. (C) 2018 Elsevier Inc. All rights reserved.
引用
收藏
页码:265 / 284
页数:20
相关论文
共 44 条
[1]  
Agovic A., 2009, TECHNICAL REPORT
[2]  
Anava Oren., 2016, ADV NEURAL INFORM PR, V29, P4916
[3]  
[Anonymous], 2006, Introduction to Data Mining
[4]  
[Anonymous], 2005, AISTAT
[5]  
[Anonymous], 2001, Matrix Analysis and Applied Linear Algebra
[6]  
[Anonymous], 2011, WWW
[7]  
Arthur D, 2007, PROCEEDINGS OF THE EIGHTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P1027
[8]  
Balasubramanian M, 2002, SCIENCE, V295
[9]  
Belkin M, 2006, J MACH LEARN RES, V7, P2399
[10]  
Chapelle O., 2009, SEMISUPERVISED LEARN, V20, P542