Lγ-PageRank for semi-supervised learning

被引:14
作者
Bautista, Esteban [1 ,2 ]
Abry, Patrice [2 ]
Goncalves, Paulo [1 ]
机构
[1] Univ Lyon, UCB Lyon 1, CNRS, ENS Lyon,INRIA,LIP,UMR 5668, F-69342 Lyon, France
[2] Univ Lyon, Univ Claude Bernard, Ens Lyon, CNRS,Lab Phys, F-69342 Lyon, France
关键词
Semi-supervised learning; PageRank; Laplacian powers; Diffusion on graphs; Signed graphs; Optimal tuning; MNIST;
D O I
10.1007/s41109-019-0172-x
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
PageRank for Semi-Supervised Learning has shown to leverage data structures and limited tagged examples to yield meaningful classification. Despite successes, classification performance can still be improved, particularly in cases of graphs with unclear clusters or unbalanced labeled data. To address such limitations, a novel approach based on powers of the Laplacian matrix L-gamma (gamma>0), referred to as L-gamma-PageRank, is proposed. Its theoretical study shows that it operates on signed graphs, where nodes belonging to one same class are more likely to share positive edges while nodes from different classes are more likely to be connected with negative edges. It is shown that by selecting an optimal gamma, classification performance can be significantly enhanced. A procedure for the automated estimation of the optimal gamma, from a unique observation of data, is devised and assessed. Experiments on several datasets demonstrate the effectiveness of both L-gamma-PageRank classification and the optimal gamma estimation.
引用
收藏
页数:20
相关论文
共 30 条
[1]  
Andersen R, 2007, LECT NOTES COMPUT SC, V4484, P1
[2]   Using PageRank to Locally Partition a Graph [J].
Andersen, Reid ;
Chung, Fan ;
Lang, Kevin .
INTERNET MATHEMATICS, 2007, 4 (01) :35-64
[3]  
[Anonymous], PHONEME DATABASE
[4]  
[Anonymous], 2007, P ICCM
[5]  
[Anonymous], 2008, Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP '08
[6]  
Avrachenkov K, P 2012 SIAM INT C DA, P966, DOI [10.1137/1.9781611972825.83, DOI 10.1137/1.9781611972825.83]
[7]  
Avrachenkov K., 2012, INT WIR COMM MOB COM
[8]   Mean Field Analysis of Personalized PageRank with Implications for Local Graph Clustering [J].
Avrachenkov, Konstantin ;
Kadavankandy, Arun ;
Litvak, Nelly .
JOURNAL OF STATISTICAL PHYSICS, 2018, 173 (3-4) :895-916
[9]  
Avrachenkov Konstantin., 2008, P 31 ANN INT ACM SIG, P873
[10]  
Bautista E., 2017, P 26 C 2017 JUAN LES