Cross-Modality Segmentation by Self-supervised Semantic Alignment in Disentangled Content Space

被引:0
|
作者
Yang, Junlin [1 ]
Li, Xiaoxiao [1 ]
Pak, Daniel [1 ]
Dvornek, Nicha C. [3 ]
Chapiro, Julius [3 ]
Lin, MingDe [3 ]
Duncan, James S. [1 ,2 ,3 ,4 ]
机构
[1] Yale Univ, Dept Biomed Engn, New Haven, CT 06511 USA
[2] Yale Univ, Dept Elect Engn, New Haven, CT USA
[3] Yale Sch Med, Dept Radiol & Biomed Imaging, New Haven, CT USA
[4] Yale Univ, Dept Stat & Data Sci, New Haven, CT USA
关键词
Cross modality; Self supervision; Domain adaptation;
D O I
10.1007/978-3-030-60548-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep convolutional networks have demonstrated state-of-the-art performance in a variety of medical image tasks, including segmentation. Taking advantage of images from different modalities has great clinical benefits. However, the generalization ability of deep networks on different modalities is challenging due to domain shift. In this work, we investigate the challenging unsupervised domain adaptation problem of cross-modality medical image segmentation. Cross-modality domain shift can be viewed as having two orthogonal components: appearance (modality) shift and content (anatomy) shift. Previous works using the popular adversarial training strategy emphasize the significant appearance/modality alignment caused by different physical principles while ignoring the content/anatomy alignment, which can be harmful for the downstream segmentation task. Here, we design a cross-modality segmentation pipeline, where self-supervision is introduced to achieve further semantic alignment specifically on the disentangled content space. In the self-supervision branch, in addition to rotation prediction, we also propose elastic transformation prediction as a new pretext task. We validate our model on cross-modality liver segmentation from CT to MR. Both quantitative and qualitative experimental results demonstrate that further semantic alignment through self-supervision can improve segmentation performance significantly, making the learned model more robust.
引用
收藏
页码:52 / 61
页数:10
相关论文
共 50 条
  • [31] ThreeWays to Improve Semantic Segmentation with Self-Supervised Depth Estimation
    Hoyer, Lukas
    Dai, Dengxin
    Chen, Yuhua
    Koring, Adrian
    Saha, Suman
    Van Gool, Luc
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11125 - 11135
  • [32] Semantic Consistent Unsupervised Domain Adaptation for Cross-Modality Medical Image Segmentation
    Zeng, Guodong
    Lerch, Till D.
    Schmaranzer, Florian
    Zheng, Guoyan
    Burger, Juergen
    Gerber, Kate
    Tannast, Moritz
    Siebenrock, Klaus
    Gerber, Nicolas
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 : 201 - 210
  • [33] CMANet: Cross-Modality Attention Network for Indoor-Scene Semantic Segmentation
    Zhu, Longze
    Kang, Zhizhong
    Zhou, Mei
    Yang, Xi
    Wang, Zhen
    Cao, Zhen
    Ye, Chenming
    SENSORS, 2022, 22 (21)
  • [34] Rethinking Self-Supervised Semantic Segmentation: Achieving End-to-End Segmentation
    Liu, Yue
    Zeng, Jun
    Tao, Xingzhen
    Fang, Gang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10036 - 10046
  • [35] XMP-Font: Self-Supervised Cross-Modality Pre-training for Few-Shot Font Generation
    Liu, Wei
    Liu, Fangyue
    Ding, Fei
    He, Qian
    Yi, Zili
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7895 - 7904
  • [36] MSSA: Multispectral Semantic Alignment for Cross-Modality Infrared-RGB Person Reidentification
    Chen, Qingshan
    Zhang, Moyan
    Quan, Zhenzhen
    Zhang, Yumeng
    Mozerov, Mikhail G.
    Zhai, Chao
    Li, Hongjuan
    Li, Yujun
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024,
  • [37] Self-supervised Image-specific Prototype Exploration for Weakly Supervised Semantic Segmentation
    Chen, Qi
    Yang, Lingxiao
    Lai, Jianhuang
    Xie, Xiaohua
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4278 - 4288
  • [38] Liver Segmentation via Learning Cross-Modality Content-Aware Representation
    Lin, Xingxiao
    Ji, Zexuan
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XIII, 2024, 14437 : 198 - 208
  • [39] FogAdapt: Self-supervised domain adaptation for semantic segmentation of foggy images
    Iqbal, Javed
    Hafiz, Rehan
    Ali, Mohsen
    NEUROCOMPUTING, 2022, 501 : 844 - 856
  • [40] Performance Prediction for Semantic Segmentation by a Self-Supervised Image Reconstruction Decoder
    Baer, Andreas
    Klingner, Marvin
    Loehdefink, Jonas
    Hueger, Fabian
    Schlicht, Peter
    Fingscheidt, Tim
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4398 - 4407