Repetitive Reprediction Deep Decipher for Semi-Supervised Learning

被引：0

作者：

Wang, Guo-Hua ^{[1
]}

Wu, Jianxin ^{[1
]}

机构：

[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China

来源：

THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2020年 / 34卷

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most recent semi-supervised deep learning (deep SSL) methods used a similar paradigm: use network predictions to update pseudo-labels and use pseudo-labels to update network parameters iteratively. However, they lack theoretical support and cannot explain why predictions are good candidates for pseudo-labels. In this paper, we propose a principled end-to-end framework named deep decipher (D2) for SSL. Within the D2 framework, we prove that pseudo-labels are related to network predictions by an exponential link function, which gives a theoretical support for using predictions as pseudo-labels. Furthermore, we demonstrate that updating pseudo-labels by network predictions will make them uncertain. To mitigate this problem, we propose a training strategy called repetitive reprediction (R2). Finally, the proposed R2-D2 method is tested on the large-scale ImageNet dataset and outperforms state-of-the-art methods by 5 percentage points.

引用

页码：6170 / 6177

页数：8

共 22 条

[1] [Anonymous], 2011, PROC INT C NEURAL IN
[2] Deep Label Distribution Learning With Label Ambiguity
Gao, Bin-Bin
Xing, Chao
Xie, Chen-Wei
Wu, Jianxin
Geng, Xin
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (06) : 2825 - 2838
[3] Gastaldi Xavier, 2017, SHAKE SHAKE REGULARI
[4] He K., 2015, ARXIV, p1505.00996, DOI 10.48550/arXiv
[5] Label Propagation for Deep Semi-supervised Learning
Iscen, Ahmet
Tolias, Giorgos
Avrithis, Yannis
Chum, Ondrej
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5065 - 5074
[6] Krizhevsky A., 2009, LEARNING MULTIPLE LA
[7] Laine S., 2017, ICLR POSTER
[8] Gradient-based learning applied to document recognition
Lecun, Y
Bottou, L
Bengio, Y
Haffner, P
[J]. PROCEEDINGS OF THE IEEE, 1998, 86 (11) : 2278 - 2324
[9] Lee Dong-Hyun, 2013, WORKSH CHALL REPR LE, V3, P896, DOI DOI 10.1109/IEDM.2013.6724604
[10] Transductive Centroid Projection for Semi-supervised Large-Scale Recognition
Liu, Yu
Song, Guanglu
Shao, Jing
Jin, Xiao
Wang, Xiaogang
[J]. COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 72 - 89

← 1 2 3 →