SemiBoost: Boosting for Semi-Supervised Learning

被引:199
作者
Mallapragada, Pavan Kumar [1 ]
Jin, Rong [1 ]
Jain, Anil K. [1 ]
Liu, Yi [1 ]
机构
[1] Michigan State Univ, Dept Comp Sci & Engn, E Lansing, MI 48823 USA
基金
美国国家科学基金会;
关键词
Machine learning; semi-supervised learning; semi-supervised improvement; manifold assumption; cluster assumption; boosting;
D O I
10.1109/TPAMI.2008.235
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semi-supervised learning has attracted a significant amount of attention in pattern recognition and machine learning. Most previous studies have focused on designing special algorithms to effectively exploit the unlabeled data in conjunction with labeled data. Our goal is to improve the classification accuracy of any given supervised learning algorithm by using the available unlabeled examples. We call this as the Semi-supervised improvement problem, to distinguish the proposed approach from the existing approaches. We design a metasemi-supervised learning algorithm that wraps around the underlying supervised algorithm and improves its performance using unlabeled data. This problem is particularly important when we need to train a supervised learning algorithm with a limited number of labeled examples and a multitude of unlabeled examples. We present a boosting framework for semi-supervised learning, termed as SemiBoost. The key advantages of the proposed semi-supervised learning approach are: 1) performance improvement of any supervised learning algorithm with a multitude of unlabeled data, 2) efficient computation by the iterative boosting algorithm, and 3) exploiting both manifold and cluster assumption in training classification models. An empirical study on 16 different data sets and text categorization demonstrates that the proposed framework improves the performance of several commonly used supervised learning algorithms, given a large number of unlabeled examples. We also show that the performance of the proposed algorithm, SemiBoost, is comparable to the state-of-the-art semi-supervised learning algorithms.
引用
收藏
页码:2000 / 2014
页数:15
相关论文
共 50 条
  • [41] Sharpened graph ensemble for semi-supervised learning
    Choi, Inae
    Park, Kanghee
    Shin, Hyunjung
    INTELLIGENT DATA ANALYSIS, 2013, 17 (03) : 387 - 398
  • [42] Historical inference based on semi-supervised learning
    Lee, Dong-gi
    Lee, Sangkuk
    Kim, Myungjun
    Shin, Hyunjung
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 106 : 121 - 131
  • [43] Boosting semi-supervised face recognition with raw faces
    Chen, Yunze
    Huang, Junjie
    Zhu, Zheng
    Long, Xianlei
    Gu, Qingyi
    IMAGE AND VISION COMPUTING, 2022, 125
  • [44] Boosting Semi-Supervised Face Recognition With Noise Robustness
    Liu, Yuchi
    Shi, Hailin
    Du, Hang
    Zhu, Rui
    Wang, Jun
    Zheng, Liang
    Mei, Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (02) : 778 - 787
  • [45] Boosting-Based Semi-Supervised AUC Optimization: Theory and Algorithm
    Yang Z.-Y.
    Xu Q.-Q.
    He Y.
    Cao X.-C.
    Huang Q.-M.
    Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (08): : 1598 - 1617
  • [46] Boosting semi-supervised learning under imbalanced regression via pseudo-labeling
    Zong, Nannan
    Su, Songzhi
    Zhou, Changle
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (19)
  • [47] Adaptive Active Learning for Semi-supervised Learning
    Li Y.-C.
    Xiao F.
    Chen Z.
    Li B.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (12): : 3808 - 3822
  • [48] POSITIVE UNLABELED LEARNING BY SEMI-SUPERVISED LEARNING
    Wang, Zhuowei
    Jiang, Jing
    Long, Guodong
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2976 - 2980
  • [49] Broad learning system for semi-supervised learning
    Liu, Zheng
    Huang, Shiluo
    Jin, Wei
    Mu, Ying
    NEUROCOMPUTING, 2021, 444 (444) : 38 - 47
  • [50] Augmentation Learning for Semi-Supervised Classification
    Frommknecht, Tim
    Zipf, Pedro Alves
    Fan, Quanfu
    Shvetsova, Nina
    Kuehne, Hilde
    PATTERN RECOGNITION, DAGM GCPR 2022, 2022, 13485 : 85 - 98