Applying Self-Supervised Learning to Medicine: Review of the State of the Art and Medical Implementations

被引:41
作者
Chowdhury, Alexander [1 ]
Rosenthal, Jacob [1 ,2 ]
Waring, Jonathan [1 ]
Umeton, Renato [1 ,2 ,3 ,4 ]
机构
[1] Dana Farber Canc Inst, Dept Informat & Anal, Boston, MA 02215 USA
[2] Weill Cornell Med, Dept Pathol & Lab Med, New York, NY 10021 USA
[3] Harvard TH Chan Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[4] MIT, Dept Mech Engn, Dept Biol Engn, Cambridge, MA 02139 USA
来源
INFORMATICS-BASEL | 2021年 / 8卷 / 03期
关键词
self-supervised learning; healthcare; representation learning; medicine; computer vision; pathology; machine learning; CONVOLUTIONAL NEURAL-NETWORKS;
D O I
10.3390/informatics8030059
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Machine learning has become an increasingly ubiquitous technology, as big data continues to inform and influence everyday life and decision-making. Currently, in medicine and healthcare, as well as in most other industries, the two most prevalent machine learning paradigms are supervised learning and transfer learning. Both practices rely on large-scale, manually annotated datasets to train increasingly complex models. However, the requirement of data to be manually labeled leaves an excess of unused, unlabeled data available in both public and private data repositories. Self-supervised learning (SSL) is a growing area of machine learning that can take advantage of unlabeled data. Contrary to other machine learning paradigms, SSL algorithms create artificial supervisory signals from unlabeled data and pretrain algorithms on these signals. The aim of this review is two-fold: firstly, we provide a formal definition of SSL, divide SSL algorithms into their four unique subsets, and review the state of the art published in each of those subsets between the years of 2014 and 2020. Second, this work surveys recent SSL algorithms published in healthcare, in order to provide medical experts with a clearer picture of how they can integrate SSL into their research, with the objective of leveraging unlabeled data.
引用
收藏
页数:29
相关论文
共 121 条
[1]   Learning to See by Moving [J].
Agrawal, Pulkit ;
Carreira, Joao ;
Malik, Jitendra .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :37-45
[2]   Semisupervised learning using denoising autoencoders for brain lesion detection and segmentation [J].
Alex V. ;
Vaidhya K. ;
Thirunavukkarasu S. ;
Kesavadas C. ;
Krishnamurthi G. .
Journal of Medical Imaging, 2017, 4 (04)
[3]   Architectonic Mapping of the Human Brain beyond Brodmann [J].
Amunts, Katrin ;
Zilles, Karl .
NEURON, 2015, 88 (06) :1086-1107
[4]  
[Anonymous], 2012, Advances in Neural Information Processing Systems
[5]  
[Anonymous], 1987, P AAAI C ART INT
[6]  
Arjovsky M, 2017, PR MACH LEARN RES, V70
[7]  
Bachman P, 2019, ADV NEUR IN, V32
[8]   Self-Supervised Learning for Cardiac MR Image Segmentation by Anatomical Position Prediction [J].
Bai, Wenjia ;
Chen, Chen ;
Tarroni, Giacomo ;
Duan, Jinming ;
Guitton, Florian ;
Petersen, Steffen E. ;
Guo, Yike ;
Matthews, Paul M. ;
Rueckert, Daniel .
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT II, 2019, 11765 :541-549
[9]   Network Dissection: Quantifying Interpretability of Deep Visual Representations [J].
Bau, David ;
Zhou, Bolei ;
Khosla, Aditya ;
Oliva, Aude ;
Torralba, Antonio .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3319-3327
[10]  
Bromley J., 1993, International Journal of Pattern Recognition and Artificial Intelligence, V7, P669, DOI 10.1142/S0218001493000339