SemiBoost: Boosting for Semi-Supervised Learning

被引:199
作者
Mallapragada, Pavan Kumar [1 ]
Jin, Rong [1 ]
Jain, Anil K. [1 ]
Liu, Yi [1 ]
机构
[1] Michigan State Univ, Dept Comp Sci & Engn, E Lansing, MI 48823 USA
基金
美国国家科学基金会;
关键词
Machine learning; semi-supervised learning; semi-supervised improvement; manifold assumption; cluster assumption; boosting;
D O I
10.1109/TPAMI.2008.235
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semi-supervised learning has attracted a significant amount of attention in pattern recognition and machine learning. Most previous studies have focused on designing special algorithms to effectively exploit the unlabeled data in conjunction with labeled data. Our goal is to improve the classification accuracy of any given supervised learning algorithm by using the available unlabeled examples. We call this as the Semi-supervised improvement problem, to distinguish the proposed approach from the existing approaches. We design a metasemi-supervised learning algorithm that wraps around the underlying supervised algorithm and improves its performance using unlabeled data. This problem is particularly important when we need to train a supervised learning algorithm with a limited number of labeled examples and a multitude of unlabeled examples. We present a boosting framework for semi-supervised learning, termed as SemiBoost. The key advantages of the proposed semi-supervised learning approach are: 1) performance improvement of any supervised learning algorithm with a multitude of unlabeled data, 2) efficient computation by the iterative boosting algorithm, and 3) exploiting both manifold and cluster assumption in training classification models. An empirical study on 16 different data sets and text categorization demonstrates that the proposed framework improves the performance of several commonly used supervised learning algorithms, given a large number of unlabeled examples. We also show that the performance of the proposed algorithm, SemiBoost, is comparable to the state-of-the-art semi-supervised learning algorithms.
引用
收藏
页码:2000 / 2014
页数:15
相关论文
共 50 条
  • [31] Supervised and Semi-Supervised Learning for Failure Identification in Microwave Networks
    Musumeci, Francesco
    Magni, Luca
    Ayoub, Omran
    Rubino, Roberto
    Capacchione, Massimiliano
    Rigamonti, Gabriele
    Milano, Michele
    Passera, Claudio
    Tornatore, Massimo
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2021, 18 (02): : 1934 - 1945
  • [32] On Semi-Supervised Learning and Sparsity
    Balinsky, Alexander
    Balinsky, Helen
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 3083 - +
  • [33] Safe semi-supervised learning: a brief introduction
    Li, Yu-Feng
    Liang, De-Ming
    FRONTIERS OF COMPUTER SCIENCE, 2019, 13 (04) : 669 - 676
  • [34] Robust semi-supervised learning in open environments
    Guo, Lan-Zhe
    Jia, Lin-Han
    Shao, Jie-Jing
    Li, Yu-Feng
    FRONTIERS OF COMPUTER SCIENCE, 2025, 19 (08)
  • [35] A review of semi-supervised learning for text classification
    José Marcio Duarte
    Lilian Berton
    Artificial Intelligence Review, 2023, 56 : 9401 - 9469
  • [36] Semi-supervised Learning for False Alarm Reduction
    Chiu, Chien-Yi
    Lee, Yuh-Jye
    Chang, Chien-Chung
    Luo, Wen-Yang
    Huang, Hsiu-Chuan
    ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS, 2010, 6171 : 595 - +
  • [37] A review of semi-supervised learning for text classification
    Duarte, Jose Marcio
    Berton, Lilian
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (09) : 9401 - 9469
  • [38] An AdaBoost Algorithm for Multiclass Semi-Supervised Learning
    Tanha, Jafar
    van Someren, Maarten
    Afsarmanesh, Hamideh
    12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012), 2012, : 1116 - 1121
  • [39] Safe semi-supervised learning: a brief introduction
    Yu-Feng Li
    De-Ming Liang
    Frontiers of Computer Science, 2019, 13 : 669 - 676
  • [40] Semi-Supervised Learning with Partial Domain Models
    Armengol, Eva
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE OF THE CATALAN ASSOCIATION FOR ARTIFICIAL INTELLIGENCE, 2013, 256 : 151 - 154