Structure-aware World Model for Probe Guidance via Large-scale Self-supervised Pre-train

被引:0
|
作者
Jiang, Haojun [1 ,2 ]
Li, Meng [2 ]
Sun, Zhenguo [2 ]
Jia, Ning [2 ]
Sun, Yu [2 ]
Luo, Shaqi [2 ]
Song, Shiji [1 ]
Huang, Gao [1 ,2 ]
机构
[1] Tsinghua Univ, Dept Automat, BNRist, Beijing, Peoples R China
[2] Beijing Acad Artificial Intelligence, Beijing, Peoples R China
来源
SIMPLIFYING MEDICAL ULTRASOUND, ASMUS 2024 | 2025年 / 15186卷
基金
国家重点研发计划;
关键词
Echocardiography; World Model; Structural Understanding; Self-supervised Pre-train; Probe Guidance;
D O I
10.1007/978-3-031-73647-6_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The complex structure of the heart leads to significant challenges in echocardiography, especially in acquisition cardiac ultrasound images. Successful echocardiography requires a thorough understanding of the structures on the two-dimensional plane and the spatial relationships between planes in three-dimensional space. In this paper, we innovatively propose a large-scale self-supervised pre-training method to acquire a cardiac structure-aware world model. The core innovation lies in constructing a self-supervised task that requires structural inference by predicting masked structures on a 2D plane and imagining another plane based on pose transformation in 3D space. To support large-scale pre-training, we collected over 1.36 million echocardiograms from ten standard views, along with their 3D spatial poses. In the downstream probe guidance task, we demonstrate that our pre-trained model consistently reduces guidance errors across the ten most common standard views on the test set with 0.29 million samples from 74 routine clinical scans, indicating that structure-aware pre-training benefits the scanning.
引用
收藏
页码:58 / 67
页数:10
相关论文
共 50 条
  • [1] Foundation Model for Endoscopy Video Analysis via Large-Scale Self-supervised Pre-train
    Wang, Zhao
    Liu, Chang
    Zhang, Shaoting
    Dou, Qi
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IX, 2023, 14228 : 101 - 111
  • [2] Structure-aware protein self-supervised learning
    Chen, Can
    Zhou, Jingbo
    Wang, Fan
    Liu, Xue
    Dou, Dejing
    BIOINFORMATICS, 2023, 39 (04)
  • [3] Self-supervised Learning for Large-scale Item Recommendations
    Yao, Tiansheng
    Yi, Xinyang
    Cheng, Derek Zhiyuan
    Yu, Felix
    Chen, Ting
    Menon, Aditya
    Hong, Lichan
    Chi, Ed H.
    Tjoa, Steve
    Kang, Jieqi
    Ettinger, Evan
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 4321 - 4330
  • [4] Large-Scale Self-Supervised Human Activity Recognition
    Zadeh, Mohammad Zaki
    Jaiswal, Ashish
    Pavel, Hamza Reza
    Hebri, Aref
    Kapoor, Rithik
    Makedon, Fillia
    PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS, PETRA 2022, 2022, : 298 - 299
  • [5] Self-Supervised Pretraining for Large-Scale Point Clouds
    Zhang, Zaiwei
    Bai, Min
    Li, Erran
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [6] Rethinking graph anomaly detection: A self-supervised Group Discrimination paradigm with Structure-Aware
    Yan, Junyi
    Zuo, Enguang
    Chen, Chen
    Chen, Cheng
    Zhong, Jie
    Li, Tianle
    Lv, Xiaoyi
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2735 - 2740
  • [7] Self-Supervised Graph Transformer on Large-Scale Molecular Data
    Rong, Yu
    Bian, Yatao
    Xu, Tingyang
    Xie, Weiyang
    Wei, Ying
    Huang, Wenbing
    Huang, Junzhou
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [8] Self-supervised contrastive representation learning for large-scale trajectories
    Li, Shuzhe
    Chen, Wei
    Yan, Bingqi
    Li, Zhen
    Zhu, Shunzhi
    Yu, Yanwei
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 148 : 357 - 366
  • [9] Automated Large-Scale Cell Annotation with Self-Supervised Learning
    Tang, Yuan Xi
    Huan, Le
    Xia, Can
    Lin, Fulai
    Zhao, Yundi
    JOURNAL OF THE AMERICAN COLLEGE OF SURGEONS, 2024, 239 (05) : S182 - S182
  • [10] WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
    Chen, Sanyuan
    Wang, Chengyi
    Chen, Zhengyang
    Wu, Yu
    Liu, Shujie
    Chen, Zhuo
    Li, Jinyu
    Kanda, Naoyuki
    Yoshioka, Takuya
    Xiao, Xiong
    Wu, Jian
    Zhou, Long
    Ren, Shuo
    Qian, Yanmin
    Qian, Yao
    Zeng, Michael
    Yu, Xiangzhan
    Wei, Furu
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (06) : 1505 - 1518