Graph-Based Semi-Supervised Learning: A Comprehensive Review

被引:189
作者
Song, Zixing [1 ]
Yang, Xiangli [2 ]
Xu, Zenglin [3 ,4 ]
King, Irwin [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, SMILE Lab, Chengdu 611731, Peoples R China
[3] Harbin Inst Technol, Sch Comp Sci & Technol, Shenzhen 518055, Peoples R China
[4] Peng Cheng Lab, Shenzhen 518066, Peoples R China
关键词
Taxonomy; Semisupervised learning; Manifolds; Codes; Training; Prediction algorithms; Image color analysis; Graph embedding; graph representation learning; graph-based semi-supervised learning (GSSL); semi-supervised learning (SSL); LABEL PROPAGATION; MANIFOLD REGULARIZATION; CONSTRUCTION; LAPLACIAN; FRAMEWORK;
D O I
10.1109/TNNLS.2022.3155478
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semi-supervised learning (SSL) has tremendous value in practice due to the utilization of both labeled and unlabelled data. An essential class of SSL methods, referred to as graph-based semi-supervised learning (GSSL) methods in the literature, is to first represent each sample as a node in an affinity graph, and then, the label information of unlabeled samples can be inferred based on the structure of the constructed graph. GSSL methods have demonstrated their advantages in various domains due to their uniqueness of structure, the universality of applications, and their scalability to large-scale data. Focusing on GSSL methods only, this work aims to provide both researchers and practitioners with a solid and systematic understanding of relevant advances as well as the underlying connections among them. The concentration on one class of SSL makes this article distinct from recent surveys that cover a more general and broader picture of SSL methods yet often neglect the fundamental understanding of GSSL methods. In particular, a significant contribution of this article lies in a newly generalized taxonomy for GSSL under the unified framework, with the most up-to-date references and valuable resources such as codes, datasets, and applications. Furthermore, we present several potential research directions as future work with our insights into this rapidly growing field.
引用
收藏
页码:8174 / 8194
页数:21
相关论文
共 154 条
[1]  
Ahmed Amr., 2013, Proceedings of the 22Nd International Conference on World Wide Web, P37
[2]  
Alam F., 2018, 12 INT AAAI C WEB SO, P556
[3]  
Aliannejadi M., 2014, ALTA, P98
[4]  
Ando R.K., 2007, Advances in neural information processing systems, P25, DOI [10.7551/mitpress/7503.003.0009, DOI 10.7551/MITPRESS/7503.003.0009]
[5]  
[Anonymous], 2002, Tech. Rep. CMU-CALD-02-107
[6]  
Argyriou A., 2005, Proc. of Advances in Neural Information Processing Systems 18, P67
[7]  
Argyriou A, 2009, J MACH LEARN RES, V10, P2507
[8]  
Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, DOI 10.48550/ARXIV.1409.0473]
[9]   An enhanced graph-based semi-supervised learning algorithm to detect fake users on Twitter [J].
BalaAnand, M. ;
Karthikeyan, N. ;
Karthik, S. ;
Varatharajan, R. ;
Manogaran, Gunasekaran ;
Sivaparthipan, C. B. .
JOURNAL OF SUPERCOMPUTING, 2019, 75 (09) :6085-6105
[10]  
Belkin M, 2004, 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS, P1000