Self-Supervised Contrastive Learning for Medical Time Series: A Systematic Review

被引:30
作者
Liu, Ziyu [1 ]
Alavi, Azadeh [1 ]
Li, Minyi [2 ]
Zhang, Xiang [3 ]
机构
[1] RMIT, Sch Comp Technol, Melbourne, Vic 3000, Australia
[2] Coles, Melbourne, Vic 3123, Australia
[3] Univ N Carolina, Dept Comp Sci, Charlotte, NC 28223 USA
基金
美国国家科学基金会;
关键词
self-supervised learning; medical time series; deep learning; healthcare; pretext tasks; contrastive learning; systematic review; RESEARCH RESOURCE; SLEEP; EEG; HEALTH; DATABASE; CLASSIFICATION; FRAMEWORK; CHILDREN; DATASET;
D O I
10.3390/s23094221
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Medical time series are sequential data collected over time that measures health-related signals, such as electroencephalography (EEG), electrocardiography (ECG), and intensive care unit (ICU) readings. Analyzing medical time series and identifying the latent patterns and trends that lead to uncovering highly valuable insights for enhancing diagnosis, treatment, risk assessment, and disease progression. However, data mining in medical time series is heavily limited by the sample annotation which is time-consuming and labor-intensive, and expert-depending. To mitigate this challenge, the emerging self-supervised contrastive learning, which has shown great success since 2020, is a promising solution. Contrastive learning aims to learn representative embeddings by contrasting positive and negative samples without the requirement for explicit labels. Here, we conducted a systematic review of how contrastive learning alleviates the label scarcity in medical time series based on PRISMA standards. We searched the studies in five scientific databases (IEEE, ACM, Scopus, Google Scholar, and PubMed) and retrieved 1908 papers based on the inclusion criteria. After applying excluding criteria, and screening at title, abstract, and full text levels, we carefully reviewed 43 papers in this area. Specifically, this paper outlines the pipeline of contrastive learning, including pre-training, fine-tuning, and testing. We provide a comprehensive summary of the various augmentations applied to medical time series data, the architectures of pre-training encoders, the types of fine-tuning classifiers and clusters, and the popular contrastive loss functions. Moreover, we present an overview of the different data types used in medical time series, highlight the medical applications of interest, and provide a comprehensive table of 51 public datasets that have been utilized in this field. In addition, this paper will provide a discussion on the promising future scopes such as providing guidance for effective augmentation design, developing a unified framework for analyzing hierarchical time series, and investigating methods for processing multimodal data. Despite being in its early stages, self-supervised contrastive learning has shown great potential in overcoming the need for expert-created annotations in the research of medical time series.
引用
收藏
页数:34
相关论文
共 138 条
[1]   Survey on Self-Supervised Learning: Auxiliary Pretext Tasks and Contrastive Learning Methods in Imaging [J].
Albelwi, Saleh .
ENTROPY, 2022, 24 (04)
[2]  
Anguita D., 2013, 21 EUROPEAN S ARTIFI, V3, P437
[3]   A mind-brain-body dataset of MRI, EEG, cognition, emotion, and peripheral physiology in young and old adults [J].
Babayan, Anahit ;
Erbey, Miray ;
Kumral, Deniz ;
Reinelt, Janis D. ;
Reiter, Andrea M. F. ;
Roebbig, Josefin ;
Schaare, H. Lina ;
Uhlig, Marie ;
Anwander, Alfred ;
Bazin, Pierre-Louis ;
Horstmann, Annette ;
Lampe, Leonie ;
Nikulin, Vadim V. ;
Okon-Singer, Hadas ;
Preusser, Sven ;
Pampel, Andre ;
Rohr, Christiane S. ;
Sacher, Julia ;
Thoene-Otto, Angelika ;
Trapp, Sabrina ;
Nierhaus, Till ;
Altmann, Denise ;
Arelin, Katrin ;
Bloechl, Maria ;
Bongartz, Edith ;
Breig, Patric ;
Cesnaite, Elena ;
Chen, Sufang ;
Cozatl, Roberto ;
Czerwonatis, Saskia ;
Dambrauskaite, Gabriele ;
Dreyer, Maria ;
Enders, Jessica ;
Engelhardt, Melina ;
Fischer, Marie Michele ;
Forschack, Norman ;
Golchert, Johannes ;
Golz, Laura ;
Guran, C. Alexandrina ;
Hedrich, Susanna ;
Hentschel, Nicole ;
Hoffmann, Daria I. ;
Huntenburg, Julia M. ;
Jost, Rebecca ;
Kosatschek, Anna ;
Kunzendorf, Stella ;
Lammers, Hannah ;
Lauckner, Mark E. ;
Mahjoory, Keyvan ;
Kanaan, Ahmad S. .
SCIENTIFIC DATA, 2019, 6 (1)
[4]   Continuous and automatic mortality risk prediction using vital signs in the intensive care unit: a hybrid neural network approach [J].
Baker, Stephanie ;
Xiang, Wei ;
Atkinson, Ian .
SCIENTIFIC REPORTS, 2020, 10 (01)
[5]  
Ballas A, 2022, Arxiv, DOI [arXiv:2208.14845, DOI 10.48550/ARXIV.2208.14845]
[6]   SELF-SUPERVISED REPRESENTATION LEARNING FROM ELECTROENCEPHALOGRAPHY SIGNALS [J].
Banville, Hubert ;
Albuquerque, Isabela ;
Hyvarinen, Aapo ;
Moffat, Graeme ;
Engemann, Denis-Alexander ;
Gramfort, Alexandre .
2019 IEEE 29TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2019,
[7]  
BioWink GmbH, 2020, CLUE
[8]   The mPower study, Parkinson disease mobile data collected using ResearchKit [J].
Bot, Brian M. ;
Suver, Christine ;
Neto, Elias Chaibub ;
Kellen, Michael ;
Klein, Arno ;
Bare, Christopher ;
Doerr, Megan ;
Pratap, Abhishek ;
Wilbanks, John ;
Dorsey, E. Ray ;
Friend, Stephen H. ;
Trister, Andrew D. .
SCIENTIFIC DATA, 2016, 3
[9]  
Bousseljot Ralf, 1995, Nutzung der EKG-Signaldatenbank CARDIODAT der PTB uber das Internet, P317
[10]  
Brunner C., 2008, Institute for Knowledge Discovery (Laboratory of Brain-Computer Interfaces, V16, P1, DOI [10.21227/katb-zv89.Dataset, DOI 10.21227/KATB-ZV89.DATASET]