Cross-Modality Segmentation by Self-supervised Semantic Alignment in Disentangled Content Space

被引:0
|
作者
Yang, Junlin [1 ]
Li, Xiaoxiao [1 ]
Pak, Daniel [1 ]
Dvornek, Nicha C. [3 ]
Chapiro, Julius [3 ]
Lin, MingDe [3 ]
Duncan, James S. [1 ,2 ,3 ,4 ]
机构
[1] Yale Univ, Dept Biomed Engn, New Haven, CT 06511 USA
[2] Yale Univ, Dept Elect Engn, New Haven, CT USA
[3] Yale Sch Med, Dept Radiol & Biomed Imaging, New Haven, CT USA
[4] Yale Univ, Dept Stat & Data Sci, New Haven, CT USA
关键词
Cross modality; Self supervision; Domain adaptation;
D O I
10.1007/978-3-030-60548-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep convolutional networks have demonstrated state-of-the-art performance in a variety of medical image tasks, including segmentation. Taking advantage of images from different modalities has great clinical benefits. However, the generalization ability of deep networks on different modalities is challenging due to domain shift. In this work, we investigate the challenging unsupervised domain adaptation problem of cross-modality medical image segmentation. Cross-modality domain shift can be viewed as having two orthogonal components: appearance (modality) shift and content (anatomy) shift. Previous works using the popular adversarial training strategy emphasize the significant appearance/modality alignment caused by different physical principles while ignoring the content/anatomy alignment, which can be harmful for the downstream segmentation task. Here, we design a cross-modality segmentation pipeline, where self-supervision is introduced to achieve further semantic alignment specifically on the disentangled content space. In the self-supervision branch, in addition to rotation prediction, we also propose elastic transformation prediction as a new pretext task. We validate our model on cross-modality liver segmentation from CT to MR. Both quantitative and qualitative experimental results demonstrate that further semantic alignment through self-supervision can improve segmentation performance significantly, making the learned model more robust.
引用
收藏
页码:52 / 61
页数:10
相关论文
共 50 条
  • [1] Contrastive Image Synthesis and Self-supervised Feature Adaptation for Cross-Modality Biomedical Image Segmentation
    Hu, Xinrong
    Wang, Corey
    Shi, Yiyu
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2329 - 2338
  • [2] Self-supervised Feature Learning by Cross-modality and Cross-view Correspondences
    Jing, Longlong
    Zhang, Ling
    Tian, Yingli
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1581 - 1591
  • [3] Towards Self-supervised Face Labeling via Cross-modality Association
    Lu, Chris Xiaoxuan
    Kan, Xuan
    Rosa, Stefano
    Du, Bowen
    Wen, Hongkai
    Markham, Andrew
    Trigoni, Niki
    PROCEEDINGS OF THE 15TH ACM CONFERENCE ON EMBEDDED NETWORKED SENSOR SYSTEMS (SENSYS'17), 2017,
  • [4] Self-supervised Contrastive Cross-Modality Representation Learning for Spoken Question Answering
    You, Chenyu
    Chen, Nuo
    Zou, Yuexian
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 28 - 39
  • [5] Weakly supervised segmentation with cross-modality equivariant constraints
    Patel, Gaurav
    Dolz, Jose
    MEDICAL IMAGE ANALYSIS, 2022, 77
  • [6] Self-Supervised Bird's Eye View Motion Prediction with Cross-Modality Signals
    Fang, Shaoheng
    Liu, Zuhong
    Wang, Mingyu
    Xu, Chenxin
    Zhong, Yiqi
    Chen, Siheng
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 1726 - 1734
  • [7] Learning disentangled representation for self-supervised video object segmentation
    Hou, Wenjie
    Qin, Zheyun
    Xi, Xiaoming
    Lu, Xiankai
    Yin, Yilong
    NEUROCOMPUTING, 2022, 481 : 270 - 280
  • [8] Learning disentangled representation for self-supervised video object segmentation
    Hou, Wenjie
    Qin, Zheyun
    Xi, Xiaoming
    Lu, Xiankai
    Yin, Yilong
    Neurocomputing, 2022, 481 : 270 - 280
  • [9] Self-supervised vision transformers for semantic segmentation
    Gu, Xianfan
    Hu, Yingdong
    Wen, Chuan
    Gao, Yang
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2025, 251
  • [10] Cross-modality and self-supervised protein embedding for compound-protein affinity and contact prediction
    You, Yuning
    Shen, Yang
    BIOINFORMATICS, 2022, 38 : ii68 - ii74