Scalable Graph-Based Semi-Supervised Learning through Sparse Bayesian Model

被引:50
作者
Jiang, Bingbing [1 ]
Chen, Huanhuan [1 ]
Yuan, Bo [2 ]
Yao, Xin [2 ,3 ]
机构
[1] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230027, Anhui, Peoples R China
[2] Southern Univ Sci & Technol SUSTech, Shenzhen Key Lab Computat Intelligence, Dept Comp Sci & Engn, Shenzhen 518055, Guangdong, Peoples R China
[3] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England
基金
中国国家自然科学基金;
关键词
Semi-supervised learning; graph-based methods; sparse Bayesian model; incremental learning; large-scale data sets; CLASSIFICATION; ROBUSTNESS;
D O I
10.1109/TKDE.2017.2749574
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semi-supervised learning (SSL) concerns the problem of how to improve classifiers' performance through making use of prior knowledge from unlabeled data. Many SSL methods have been developed to integrate unlabeled data into the classifiers based on either the manifold or cluster assumption in recent years. In particular, the graph-based approaches, following the manifold assumption, have achieved a promising performance in many real-world applications. However, most of them work well on small-scale data sets only and lack probabilistic outputs. In this paper, a scalable graph-based SSL framework through sparse Bayesian model is proposed by defining a graph-based sparse prior. Based on the traditional Bayesian inference technique, a sparse Bayesian SSL algorithm ((SBSL)-L-2) is obtained, which can remove the irrelevant unlabeled samples and make probabilistic prediction for out-of-sample data. Moreover, in order to scale (SBSL)-L-2 to large-scale data sets, an incremental (SBSL)-L-2 ((ISBSL)-L-2) is derived. The key idea of (ISBSL)-L-2 is employing an incremental strategy and sequentially selecting parts of unlabeled samples that contribute to the learning instead of using all available unlabeled samples directly. (ISBSL)-L-2 has lower time and space complexities than previous SSL algorithms with the use of all unlabeled samples. Extensive experiments on various data sets verify that our algorithms can achieve comparable classification effectiveness and efficiency with much better scalability. Finally, the generalization error bound is derived based on robustness analysis.
引用
收藏
页码:2758 / 2771
页数:14
相关论文
共 50 条
  • [41] A graph-based semi-supervised learning algorithm for web page classification
    Liu, Rong
    Zhou, Jianzhong
    Liu, Ming
    ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 2, 2006, : 856 - +
  • [42] A general graph-based semi-supervised learning with novel class discovery
    Feiping Nie
    Shiming Xiang
    Yun Liu
    Changshui Zhang
    Neural Computing and Applications, 2010, 19 : 549 - 555
  • [43] Graph-based Semi-supervised Learning with Manifold Preprocessing for Image Classification
    Gong, Yun-Chao
    Liu, Feng
    Chen, Chuanliang
    2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 391 - +
  • [44] Instance selection method for improving graph-based semi-supervised learning
    Wang, Hai
    Wang, Shao-Bo
    Li, Yu-Feng
    FRONTIERS OF COMPUTER SCIENCE, 2018, 12 (04) : 725 - 735
  • [45] Graph-based Semi-Supervised Learning by Strengthening Local Label Consistency
    Li, Chen
    Peng, Xutan
    Peng, Hao
    Wu, Jia
    Wang, Lihong
    Yu, Philip S.
    Li, Jianxin
    Sun, Lichao
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 3201 - 3205
  • [46] Visual Texture Perception via Graph-based Semi-supervised Learning
    Zhang, Qin
    Dong, Junyu
    Zhong, Guoqiang
    NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
  • [47] Multi Similarity Metric Fusion in Graph-Based Semi-Supervised Learning
    Bahrami, Saeedeh
    Bosaghzadeh, Alireza
    Dornaika, Fadi
    COMPUTATION, 2019, 7 (01)
  • [48] Nonnegative Sparse and KNN graph for semi-supervised learning
    Zhang, Yunbin
    Zhang, Chunmei
    Zhou, Qianqi
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN MECHANICAL ENGINEERING AND INDUSTRIAL INFORMATICS (AMEII 2016), 2016, 73 : 1178 - 1182
  • [49] Semi-Supervised Logistic Discrimination Via Graph-Based Regularization
    Kawano, Shuichi
    Misumi, Toshihiro
    Konishi, Sadanori
    NEURAL PROCESSING LETTERS, 2012, 36 (03) : 203 - 216
  • [50] From Cluster Assumption to Graph Convolution: Graph-Based Semi-Supervised Learning Revisited
    Wang, Zheng
    Ding, Hongming
    Pan, Li
    Li, Jianhua
    Gong, Zhiguo
    Yu, Philip S.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,