Contrastive self-supervised learning: review, progress, challenges and future research directions

被引:36
作者
Kumar, Pranjal [1 ]
Rawat, Piyush [2 ]
Chauhan, Siddhartha [1 ]
机构
[1] NIT Hamirpur, Hamirpur 177005, Himachal Prades, India
[2] Univ Petr & Energy Studies, Sch Comp Sci, Dept Syst, Dehra Dun 248007, Uttarakhand, India
关键词
Contrastive learning; Self-supervised learning; Unsupervised learning; Data augmentation; Survey; REPRESENTATION; EMBEDDINGS;
D O I
10.1007/s13735-022-00245-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the last decade, deep supervised learning has had tremendous success. However, its flaws, such as its dependency on manual and costly annotations on large datasets and being exposed to attacks, have prompted researchers to look for alternative models. Incorporating contrastive learning (CL) for self-supervised learning (SSL) has turned out as an effective alternative. In this paper, a comprehensive review of CL methodology in terms of its approaches, encoding techniques and loss functions is provided. It discusses the applications of CL in various domains like Natural Language Processing (NLP), Computer Vision, speech and text recognition and prediction. The paper presents an overview and background about SSL for understanding the introductory ideas and concepts. A comparative study for all the works that use CL methods for various downstream tasks in each domain is performed. Finally, it discusses the limitations of current methods, as well as the need for additional techniques and future directions in order to make meaningful progress in this area.
引用
收藏
页码:461 / 488
页数:28
相关论文
共 217 条
[1]   Self-supervised Learning of Audio-Visual Objects from Video [J].
Afouras, Triantafyllos ;
Owens, Andrew ;
Chung, Joon Son ;
Zisserman, Andrew .
COMPUTER VISION - ECCV 2020, PT XVIII, 2020, 12363 :208-224
[2]  
Al-Tahan H, 2021, PR MACH LEARN RES, V130
[3]  
Alayrac JB, 2020, ADV NEUR IN, V33
[4]   Survey on Self-Supervised Learning: Auxiliary Pretext Tasks and Contrastive Learning Methods in Imaging [J].
Albelwi, Saleh .
ENTROPY, 2022, 24 (04)
[5]  
[Anonymous], 2019, Massively Multilingual Neural Machine Translation in the Wild: Findings and Challenges
[6]   Look, Listen and Learn [J].
Arandjelovic, Relja ;
Zisserman, Andrew .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :609-617
[7]  
Arandjelovic Relja, 2018, EUROPEAN C COMPUTER, P435
[8]  
Aroca-Ouellette S, 2020, ARXIV201001694
[9]  
Arora S, 2019, PR MACH LEARN RES, V97
[10]  
Asai A., 2020, P 8 INT C LEARN REPR