A brief introduction to weakly supervised learning

被引:1137
作者
Zhou, Zhi-Hua [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
machine learning; weakly supervised learning; supervised learning; INSTANCE; CLASSIFICATION; NOISE;
D O I
10.1093/nsr/nwx106
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Supervised learning techniques construct predictive models by learning from a large number of training examples, where each training example has a label indicating its ground-truth output. Though current techniques have achieved great success, it is noteworthy that in many tasks it is difficult to get strong supervision information like fully ground-truth labels due to the high cost of the data-labeling process. Thus, it is desirable for machine-learning techniques to work with weak supervision. This article reviews some research progress of weakly supervised learning, focusing on three typical types of weak supervision: incomplete supervision, where only a subset of training data is given with labels; inexact supervision, where the training data are given with only coarse-grained labels; and inaccurate supervision, where the given labels are not always ground-truth.
引用
收藏
页码:44 / 53
页数:10
相关论文
共 103 条
  • [51] Lewis D. D., 1994, SIGIR '94. Proceedings of the Seventeenth Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, P3
  • [52] Li X., 2013, P 23 INT JOINT C ART, V13, P1479
  • [53] Towards Making Unlabeled Data Never Hurt
    Li, Yu-Feng
    Zhou, Zhi-Hua
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (01) : 175 - 188
  • [54] Li YF, 2013, J MACH LEARN RES, V14, P2151
  • [55] Liu G Q, 2012, PMLR, P253
  • [56] PAC learning axis-aligned rectangles with respect to product distributions from multiple-instance examples
    Long, PM
    Tan, L
    [J]. MACHINE LEARNING, 1998, 30 (01) : 7 - 21
  • [57] Miller DJ, 1997, ADV NEUR IN, V9, P571
  • [58] Identifying and handling mislabelled instances
    Muhlenbach, F
    Lallich, S
    Zighed, DA
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2004, 22 (01) : 89 - 109
  • [59] Nguyen H.T., 2004, P 21 INT C MACH LEAR, P79
  • [60] Text classification from labeled and unlabeled documents using EM
    Nigam, K
    McCallum, AK
    Thrun, S
    Mitchell, T
    [J]. MACHINE LEARNING, 2000, 39 (2-3) : 103 - 134