Review on self-supervised image recognition using deep neural networks

被引:122
作者
Ohri, Kriti [1 ]
Kumar, Mukesh [1 ]
机构
[1] Natl Inst Technol Patna, Dept CSE, Patna 800005, Bihar, India
关键词
Self-supervised learning; Unsupervised learning; Semi-supervised learning; Transfer learning; Deep learning; Pretext tasks; Convolutional neural network; Contrastive learning; Online clustering;
D O I
10.1016/j.knosys.2021.107090
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has brought significant developments in image understanding tasks such as object detection, image classification, and image segmentation. But the success of image recognition largely relies on supervised learning that requires huge number of human-annotated labels. To avoid costly collection of labeled data and the domains where very few standard pre-trained models exist, self supervised learning comes to our rescue. Self-supervised learning is a form of unsupervised learning that allows the network to learn rich visual features that help in performing downstream computer vision tasks such as image classification, object detection, and image segmentation. This paper provides a thorough review of self-supervised learning which has the potential to revolutionize the computer vision field using unlabeled data. First, the motivation of self-supervised learning is discussed, and other annotation efficient learning schemes. Then, the general pipeline for supervised learning and self-supervised learning is illustrated. Next, various handcrafted pretext tasks are explained that enable learning of visual features using unlabeled image dataset. The paper also highlights the recent breakthroughs in self-supervised learning using contrastive learning and clustering methods that are outperforming supervised learning. Finally, we have performance comparisons of self-supervised techniques on evaluation tasks such as image classification and detection. In the end, the paper is concluded with practical considerations and open challenges of image recognition tasks in self supervised learning regime. From the onset of the review paper, the core focus is on visual feature learning from images using the self-supervised approaches. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:22
相关论文
共 96 条
[1]  
[Anonymous], 2014, IEEE INTCONF COMPUT
[2]  
Arjovsky M, 2017, PR MACH LEARN RES, V70
[3]  
Bachman P, 2019, ADV NEUR IN, V32
[4]   Diagnostic Assessment of Deep Learning Algorithms for Detection of Lymph Node Metastases in Women With Breast Cancer [J].
Bejnordi, Babak Ehteshami ;
Veta, Mitko ;
van Diest, Paul Johannes ;
van Ginneken, Bram ;
Karssemeijer, Nico ;
Litjens, Geert ;
van der Laak, Jeroen A. W. M. .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2017, 318 (22) :2199-2210
[5]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[6]   Learning Deep Architectures for AI [J].
Bengio, Yoshua .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127
[7]  
Bethge M., 2018, INT C LEARNING REPRE
[8]  
Bojanowski P, 2017, 34 INT C MACHINE LEA, V70
[9]  
Brock Andrew., 2018, Large scale GAN training for high fidelity natural image synthesis, DOI DOI 10.48550/ARXIV.1809.11096
[10]   nuScenes: A multimodal dataset for autonomous driving [J].
Caesar, Holger ;
Bankiti, Varun ;
Lang, Alex H. ;
Vora, Sourabh ;
Liong, Venice Erin ;
Xu, Qiang ;
Krishnan, Anush ;
Pan, Yu ;
Baldan, Giancarlo ;
Beijbom, Oscar .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11618-11628