On semi-supervised learning

被引:2
|
作者
Cholaquidis, A. [1 ]
Fraiman, R. [1 ]
Sued, M. [2 ]
机构
[1] Univ Republica, Fac Ciencias, Montevideo, Uruguay
[2] INst Calculo, Fac Ciencias Exactas & Nat, Buenos Aires, DF, Argentina
关键词
Semi-supervised learning; Small training sample; Consistency; PATTERN-RECOGNITION; ERROR;
D O I
10.1007/s11749-019-00690-2
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Major efforts have been made, mostly in the machine learning literature, to construct good predictors combining unlabelled and labelled data. These methods are known as semi-supervised. They deal with the problem of how to take advantage, if possible, of a huge amount of unlabelled data to perform classification in situations where there are few labelled data. This is not always feasible: it depends on the possibility to infer the labels from the unlabelled data distribution. Nevertheless, several algorithms have been proposed recently. In this work, we present a new method that, under almost necessary conditions, attains asymptotically the performance of the best theoretical rule when the size of the unlabelled sample goes to infinity, even if the size of the labelled sample remains fixed. Its performance and computational time are assessed through simulations and in the well- known "Isolet" real data of phonemes, where a strong dependence on the choice of the initial training sample is shown. The main focus of this work is to elucidate when and why semi-supervised learning works in the asymptotic regime described above. The set of necessary assumptions, although reasonable, show that semi-parametric methods only attain consistency for very well-conditioned problems.
引用
收藏
页码:914 / 937
页数:24
相关论文
共 50 条
  • [21] Semi-supervised Learning with Multimodal Perturbation
    Su, Lei
    Liao, Hongzhi
    Yu, Zhengtao
    Tang, Jiahua
    ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 1, PROCEEDINGS, 2009, 5551 : 651 - +
  • [22] Robust Semi-supervised Learning for Biometrics
    Yang, Nanhai
    Huang, Mingming
    He, Ran
    Wang, Xiukun
    LIFE SYSTEM MODELING AND INTELLIGENT COMPUTING, PT I, 2010, 6328 : 466 - 476
  • [23] Lγ-PageRank for semi-supervised learning
    Bautista, Esteban
    Abry, Patrice
    Goncalves, Paulo
    APPLIED NETWORK SCIENCE, 2019, 4 (01)
  • [24] Feature ranking for semi-supervised learning
    Petkovic, Matej
    Dzeroski, Saso
    Kocev, Dragi
    MACHINE LEARNING, 2023, 112 (11) : 4379 - 4408
  • [25] A Survey on Deep Semi-Supervised Learning
    Yang, Xiangli
    Song, Zixing
    King, Irwin
    Xu, Zenglin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (09) : 8934 - 8954
  • [26] SemiBoost: Boosting for Semi-Supervised Learning
    Mallapragada, Pavan Kumar
    Jin, Rong
    Jain, Anil K.
    Liu, Yi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (11) : 2000 - 2014
  • [27] Semi-supervised Learning with Gaussian Processes
    Li, Hongwei
    Li, Yakui
    Lu, Hanqing
    PROCEEDINGS OF THE 2008 CHINESE CONFERENCE ON PATTERN RECOGNITION (CCPR 2008), 2008, : 13 - 17
  • [28] Lagrangian supervised and semi-supervised extreme learning machine
    Ma, Jun
    Wen, Yakun
    Yang, Liming
    APPLIED INTELLIGENCE, 2019, 49 (02) : 303 - 318
  • [29] Semi-supervised learning in unbalanced networks with heterogeneous degree
    Li, Ting
    Ying, Ningchen
    Yu, Xianshi
    Jing, Bing-Yi
    STATISTICS AND ITS INTERFACE, 2024, 17 (03) : 501 - 516
  • [30] Analysis of active semi-supervised learning
    Berton, Lilian
    Mitsuishi, Felipe Baz
    Vega-Oliveros, Didier A.
    38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, 2023, : 1122 - 1129