CONTRASTIVE SELF-SUPERVISED LEARNING FOR SPATIO-TEMPORAL ANALYSIS OF LUNG ULTRASOUND VIDEOS

被引:1
|
作者
Chen, Li [1 ]
Rubin, Jonathan [1 ]
Ouyang, Jiahong [1 ]
Balaraju, Naveen [1 ]
Patil, Shubham [1 ]
Mehanian, Courosh [2 ]
Kulhare, Sourabh [2 ]
Millin, Rachel [2 ]
Gregory, Kenton W. [3 ]
Gregory, Cynthia R. [3 ]
Zhu, Meihua [3 ]
Kessler, David O. [4 ]
Malia, Laurie [4 ]
Dessie, Almaz [4 ]
Rabiner, Joni [4 ]
Coneybeare, Di [4 ]
Shopsin, Bo [5 ]
Hersh, Andrew [6 ]
Madar, Cristian [7 ]
Shupp, Jeffrey [8 ]
Johnson, Laura S. [8 ]
Avila, Jacob [9 ]
Dwyer, Kristin [10 ]
Weimersheimer, Peter [11 ]
Raju, Balasundar [1 ]
Kruecker, Jochen [1 ]
Chen, Alvin [1 ]
机构
[1] Philips Res North Amer, Cambridge, MA 02141 USA
[2] Global Hlth Labs, Bellevue, WA USA
[3] Oregon Hlth & Sci Univ, Portland, OR 97201 USA
[4] Columbia Univ, Med Ctr, New York, NY 10027 USA
[5] NYU, New York, NY 10003 USA
[6] Brooke Army Med Ctr, Ft Sam Houston, TX 78234 USA
[7] Tripler Army Med Ctr, Honolulu, HI 96859 USA
[8] MedStar Washington Hosp Ctr, Washington, DC USA
[9] Univ Kentucky, Lexington, KY 40506 USA
[10] Brown Univ, Warren Alpert Med Sch, Providence, RI 02912 USA
[11] Univ Vermont, Larner Coll Med, Burlington, VT 05405 USA
关键词
Self-supervised learning; contrastive learning; spatio-temporal augmentation; lung ultrasound; DIAGNOSIS; ACCURACY;
D O I
10.1109/ISBI53787.2023.10230816
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self-supervised learning (SSL) methods have shown promise for medical imaging applications by learning meaningful visual representations, even when the amount of labeled data is limited. Here, we extend state-of-the-art contrastive learning SSL methods to 2D+time medical ultrasound video data by introducing a modified encoder and augmentation method capable of learning meaningful spatio-temporal representations, without requiring constraints on the input data. We evaluate our method on the challenging clinical task of identifying lung consolidations (an important pathological feature) in ultrasound videos. Using a multi-center dataset of over 27k lung ultrasound videos acquired from over 500 patients, we show that our method can significantly improve performance on downstream localization and classification of lung consolidation. Comparisons against baseline models trained without SSL show that the proposed methods are particularly advantageous when the size of labeled training data is limited (e.g., as little as 5% of the training set).
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Contrastive Spatio-Temporal Pretext Learning for Self-Supervised Video Representation
    Zhang, Yujia
    Po, Lai-Man
    Xu, Xuyuan
    Liu, Mengyang
    Wang, Yexin
    Ou, Weifeng
    Zhao, Yuzhi
    Yu, Wing-Yin
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 3380 - 3389
  • [2] Self-supervised Spatio-temporal Representation Learning for Videos by Predicting Motion and Appearance Statistics
    Wang, Jiangliu
    Jiao, Jianbo
    Bao, Linchao
    He, Shengfeng
    Liu, Yunhui
    Liu, Wei
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4001 - 4010
  • [3] Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos
    Shen, Zhiqiang
    Sheng, Xiaoxiao
    Fan, Hehe
    Wang, Longguang
    Guo, Yulan
    Liu, Qiong
    Wen, Hao
    Zhou, Xi
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 16534 - 16543
  • [4] Video Cloze Procedure for Self-Supervised Spatio-Temporal Learning
    Luo, Dezhao
    Liu, Chang
    Zhou, Yu
    Yang, Dongbao
    Ma, Can
    Ye, Qixiang
    Wang, Weiping
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11701 - 11708
  • [5] Spatio-Temporal Self-Supervised Learning for Traffic Flow Prediction
    Ji, Jiahao
    Wang, Jingyuan
    Huang, Chao
    Wu, Junjie
    Xu, Boren
    Wu, Zhenhe
    Zhang, Junbo
    Zheng, Yu
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 4, 2023, : 4356 - 4364
  • [6] Self-Supervised Video Representation Learning by Uncovering Spatio-Temporal Statistics
    Wang, Jiangliu
    Jiao, Jianbo
    Bao, Linchao
    He, Shengfeng
    Liu, Wei
    Liu, Yun-hui
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (07) : 3791 - 3806
  • [7] Video Playback Rate Perception for Self-supervised Spatio-Temporal Representation Learning
    Yao, Yuan
    Liu, Chang
    Luo, Dezhao
    Zhou, Yu
    Ye, Qixiang
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6547 - 6556
  • [8] SELF-SUPERVISED SPATIO-TEMPORAL REPRESENTATION LEARNING OF SATELLITE IMAGE TIME SERIES
    Dumeur, Iris
    Valero, Silvia
    Inglada, Jordi
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 642 - 645
  • [9] Self-Supervised Spatio-Temporal Graph Learning for Point-of-Interest Recommendation
    Liu, Jiawei
    Gao, Haihan
    Shi, Chuan
    Cheng, Hongtao
    Xie, Qianlong
    APPLIED SCIENCES-BASEL, 2023, 13 (15):
  • [10] Joint spatio-temporal features constrained self-supervised electrocardiogram representation learning
    Ran, Ao
    Liu, Huafeng
    BIOMEDICAL ENGINEERING LETTERS, 2024, 14 (02) : 209 - 220