Deep Generative Adversarial Reinforcement Learning for Semi-Supervised Segmentation of Low-Contrast and Small Objects in Medical Images

被引:3
作者
Xu, Chenchu [1 ,2 ]
Zhang, Tong [3 ]
Zhang, Dong [4 ]
Zhang, Dingwen [2 ,5 ]
Han, Junwei [2 ,6 ]
机构
[1] Anhui Univ, Sch Comp Sci & Technol, Key Lab Intelligent Comp & Signal Proc, Minist Educ, Hefei 230601, Peoples R China
[2] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei 230601, Peoples R China
[3] Anhui Univ, Sch Comp Sci & Technol, Hefei, Anhui, Peoples R China
[4] Univ British Columbia, Dept Elect & Comp Engn, Vancouver, BC V6T 1Z4, Canada
[5] Fourth Mil Med Univ, Xiing Hosp, Dept Clin Immunol, Xian 710032, Shaanxi, Peoples R China
[6] Northwestern Polytech Univ, Sch Automat, Xian 710129, Peoples R China
基金
中国国家自然科学基金;
关键词
Image segmentation; Task analysis; Biomedical imaging; Generative adversarial networks; Optimization; Training; Reinforcement learning; Medical image segmentation; deep reinforcement learning (DRL); generative adversarial networks (GANs);
D O I
10.1109/TMI.2024.3383716
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Deep reinforcement learning (DRL) has demonstrated impressive performance in medical image segmentation, particularly for low-contrast and small medical objects. However, current DRL-based segmentation methods face limitations due to the optimization of error propagation in two separate stages and the need for a significant amount of labeled data. In this paper, we propose a novel deep generative adversarial reinforcement learning (DGARL) approach that, for the first time, enables end-to-end semi-supervised medical image segmentation in the DRL domain. DGARL ingeniously establishes a pipeline that integrates DRL and generative adversarial networks (GANs) to optimize both detection and segmentation tasks holistically while mutually enhancing each other. Specifically, DGARL introduces two innovative components to facilitate this integration in semi-supervised settings. First, a task-joint GAN with two discriminators links the detection results to the GAN's segmentation performance evaluation, allowing simultaneous joint evaluation and feedback. This ensures that DRL and GAN can be directly optimized based on each other's results. Second, a bidirectional exploration DRL integrates backward exploration and forward exploration to ensure the DRL agent explores the correct direction when forward exploration is disabled due to lack of explicit rewards. This mitigates the issue of unlabeled data being unable to provide rewards and rendering DRL unexplorable. Comprehensive experiments on three generalization datasets, comprising a total of 640 patients, demonstrate that our novel DGARL achieves 85.02% Dice and improves at least 1.91% for brain tumors, achieves 73.18% Dice and improves at least 4.28% for liver tumors, and achieves 70.85% Dice and improves at least 2.73% for pancreas compared to the ten most recent advanced methods, our results attest to the superiority of DGARL. Code is available at GitHub.
引用
收藏
页码:3072 / 3084
页数:13
相关论文
共 55 条
  • [1] A survey of inverse reinforcement learning: Challenges, methods and progress
    Arora, Saurabh
    Doshi, Prashant
    [J]. ARTIFICIAL INTELLIGENCE, 2021, 297 (297)
  • [2] Auer P., 2003, Journal of Machine Learning Research, V3, P397, DOI 10.1162/153244303321897663
  • [3] Bidirectional Copy-Paste for Semi-Supervised Medical Image Segmentation
    Bai, Yunhao
    Chen, Duowen
    Li, Qingli
    Shen, Wei
    Wang, Yan
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11514 - 11524
  • [4] Pseudo-label Guided Contrastive Learning for Semi-supervised Medical Image Segmentation
    Basak, Hritam
    Yin, Zhaozheng
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19786 - 19797
  • [5] Casanova A, 2020, Arxiv, DOI arXiv:2002.06583
  • [6] Combined Spiral Transformation and Model-Driven Multi-Modal Deep Learning Scheme for Automatic Prediction of TP53 Mutation in Pancreatic Cancer
    Chen, Xiahan
    Lin, Xiaozhu
    Shen, Qing
    Qian, Xiaohua
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (02) : 735 - 747
  • [7] Retrieval of Brain Tumors by Adaptive Spatial Pooling and Fisher Vector Representation
    Cheng, Jun
    Yang, Wei
    Huang, Meiyan
    Huang, Wei
    Jiang, Jun
    Zhou, Yujia
    Yang, Ru
    Zhao, Jie
    Feng, Yanqiu
    Feng, Qianjin
    Chen, Wufan
    [J]. PLOS ONE, 2016, 11 (06):
  • [8] Enhanced Performance of Brain Tumor Classification via Tumor Region Augmentation and Partition
    Cheng, Jun
    Huang, Wei
    Cao, Shuangliang
    Yang, Ru
    Yang, Wei
    Yun, Zhaoqiang
    Wang, Zhijian
    Feng, Qianjin
    [J]. PLOS ONE, 2015, 10 (10):
  • [9] Semi-supervised Brain Lesion Segmentation with an Adapted Mean Teacher Model
    Cui, Wenhui
    Liu, Yanlin
    Li, Yuxing
    Guo, Menghao
    Li, Yiming
    Li, Xiuli
    Wang, Tianle
    Zeng, Xiangzhu
    Ye, Chuyang
    [J]. INFORMATION PROCESSING IN MEDICAL IMAGING, IPMI 2019, 2019, 11492 : 554 - 565
  • [10] Diaz I, 2011, IEEE ENG MED BIO, P3934, DOI 10.1109/IEMBS.2011.6090977