On semi-supervised learning

被引:2
|
作者
Cholaquidis, A. [1 ]
Fraiman, R. [1 ]
Sued, M. [2 ]
机构
[1] Univ Republica, Fac Ciencias, Montevideo, Uruguay
[2] INst Calculo, Fac Ciencias Exactas & Nat, Buenos Aires, DF, Argentina
关键词
Semi-supervised learning; Small training sample; Consistency; PATTERN-RECOGNITION; ERROR;
D O I
10.1007/s11749-019-00690-2
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Major efforts have been made, mostly in the machine learning literature, to construct good predictors combining unlabelled and labelled data. These methods are known as semi-supervised. They deal with the problem of how to take advantage, if possible, of a huge amount of unlabelled data to perform classification in situations where there are few labelled data. This is not always feasible: it depends on the possibility to infer the labels from the unlabelled data distribution. Nevertheless, several algorithms have been proposed recently. In this work, we present a new method that, under almost necessary conditions, attains asymptotically the performance of the best theoretical rule when the size of the unlabelled sample goes to infinity, even if the size of the labelled sample remains fixed. Its performance and computational time are assessed through simulations and in the well- known "Isolet" real data of phonemes, where a strong dependence on the choice of the initial training sample is shown. The main focus of this work is to elucidate when and why semi-supervised learning works in the asymptotic regime described above. The set of necessary assumptions, although reasonable, show that semi-parametric methods only attain consistency for very well-conditioned problems.
引用
收藏
页码:914 / 937
页数:24
相关论文
共 50 条
  • [41] Quantum semi-supervised kernel learning
    Saeedi, Seyran
    Panahi, Aliakbar
    Arodz, Tom
    QUANTUM MACHINE INTELLIGENCE, 2021, 3 (02)
  • [42] Lγ-PageRank for semi-supervised learning
    Esteban Bautista
    Patrice Abry
    Paulo Gonçalves
    Applied Network Science, 4
  • [43] On Consistency of Graph-based Semi-supervised Learning
    Du, Chengan
    Zhao, Yunpeng
    Wang, Feng
    2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 483 - 491
  • [44] Semi-Supervised Learning With Label Proportion
    Sun, Ningzhao
    Luo, Tingjin
    Zhuge, Wenzhang
    Tao, Hong
    Hou, Chenping
    Hu, Dewen
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (01) : 877 - 890
  • [45] Semi-Supervised Learning for Intelligent Surveillance
    de Freitas, Guilherme Correa
    Maximo, Marcos R. O. A.
    Verri, Filipe A. N.
    2022 LATIN AMERICAN ROBOTICS SYMPOSIUM (LARS), 2022 BRAZILIAN SYMPOSIUM ON ROBOTICS (SBR), AND 2022 WORKSHOP ON ROBOTICS IN EDUCATION (WRE), 2022, : 306 - 311
  • [46] Semi-Supervised Billinear Subspace Learning
    Xu, Dong
    Yan, Shuicheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2009, 18 (07) : 1671 - 1676
  • [47] Distributed Semi-Supervised Metric Learning
    Shen, Pengcheng
    Du, Xin
    Li, Chunguang
    IEEE ACCESS, 2016, 4 : 8558 - 8571
  • [48] Semi-supervised learning in knowledge discovery
    Klose, A
    Kruse, R
    FUZZY SETS AND SYSTEMS, 2005, 149 (01) : 209 - 233
  • [49] Quantum annealing for semi-supervised learning
    Zheng, Yu-Lin
    Zhang, Wen
    Zhou, Cheng
    Geng, Wei
    CHINESE PHYSICS B, 2021, 30 (04)
  • [50] Active learning for semi-supervised structural health monitoring
    Bull, L.
    Worden, K.
    Manson, G.
    Dervilis, N.
    JOURNAL OF SOUND AND VIBRATION, 2018, 437 : 373 - 388