Repetitive Reprediction Deep Decipher for Semi-Supervised Learning

被引:0
作者
Wang, Guo-Hua [1 ]
Wu, Jianxin [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China
来源
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2020年 / 34卷
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most recent semi-supervised deep learning (deep SSL) methods used a similar paradigm: use network predictions to update pseudo-labels and use pseudo-labels to update network parameters iteratively. However, they lack theoretical support and cannot explain why predictions are good candidates for pseudo-labels. In this paper, we propose a principled end-to-end framework named deep decipher (D2) for SSL. Within the D2 framework, we prove that pseudo-labels are related to network predictions by an exponential link function, which gives a theoretical support for using predictions as pseudo-labels. Furthermore, we demonstrate that updating pseudo-labels by network predictions will make them uncertain. To mitigate this problem, we propose a training strategy called repetitive reprediction (R2). Finally, the proposed R2-D2 method is tested on the large-scale ImageNet dataset and outperforms state-of-the-art methods by 5 percentage points.
引用
收藏
页码:6170 / 6177
页数:8
相关论文
共 22 条
  • [1] [Anonymous], 2011, PROC INT C NEURAL IN
  • [2] Deep Label Distribution Learning With Label Ambiguity
    Gao, Bin-Bin
    Xing, Chao
    Xie, Chen-Wei
    Wu, Jianxin
    Geng, Xin
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (06) : 2825 - 2838
  • [3] Gastaldi Xavier, 2017, SHAKE SHAKE REGULARI
  • [4] He K., 2015, ARXIV, p1505.00996, DOI 10.48550/arXiv
  • [5] Label Propagation for Deep Semi-supervised Learning
    Iscen, Ahmet
    Tolias, Giorgos
    Avrithis, Yannis
    Chum, Ondrej
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5065 - 5074
  • [6] Krizhevsky A., 2009, LEARNING MULTIPLE LA
  • [7] Laine S., 2017, ICLR POSTER
  • [8] Gradient-based learning applied to document recognition
    Lecun, Y
    Bottou, L
    Bengio, Y
    Haffner, P
    [J]. PROCEEDINGS OF THE IEEE, 1998, 86 (11) : 2278 - 2324
  • [9] Lee Dong-Hyun, 2013, WORKSH CHALL REPR LE, V3, P896, DOI DOI 10.1109/IEDM.2013.6724604
  • [10] Transductive Centroid Projection for Semi-supervised Large-Scale Recognition
    Liu, Yu
    Song, Guanglu
    Shao, Jing
    Jin, Xiao
    Wang, Xiaogang
    [J]. COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 72 - 89