Deep Generative Adversarial Reinforcement Learning for Semi-Supervised Segmentation of Low-Contrast and Small Objects in Medical Images

被引：3

作者：

Xu, Chenchu ^{[1
,2
]}

Zhang, Tong ^{[3
]}

Zhang, Dong ^{[4
]}

Zhang, Dingwen ^{[2
,5
]}

Han, Junwei ^{[2
,6
]}

机构：

[1] Anhui Univ, Sch Comp Sci & Technol, Key Lab Intelligent Comp & Signal Proc, Minist Educ, Hefei 230601, Peoples R China

[2] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei 230601, Peoples R China

[3] Anhui Univ, Sch Comp Sci & Technol, Hefei, Anhui, Peoples R China

[4] Univ British Columbia, Dept Elect & Comp Engn, Vancouver, BC V6T 1Z4, Canada

[5] Fourth Mil Med Univ, Xiing Hosp, Dept Clin Immunol, Xian 710032, Shaanxi, Peoples R China

[6] Northwestern Polytech Univ, Sch Automat, Xian 710129, Peoples R China

来源：

IEEE TRANSACTIONS ON MEDICAL IMAGING | 2024年 / 43卷 / 09期

基金：

中国国家自然科学基金;

关键词：

Image segmentation; Task analysis; Biomedical imaging; Generative adversarial networks; Optimization; Training; Reinforcement learning; Medical image segmentation; deep reinforcement learning (DRL); generative adversarial networks (GANs);

D O I：

10.1109/TMI.2024.3383716

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Deep reinforcement learning (DRL) has demonstrated impressive performance in medical image segmentation, particularly for low-contrast and small medical objects. However, current DRL-based segmentation methods face limitations due to the optimization of error propagation in two separate stages and the need for a significant amount of labeled data. In this paper, we propose a novel deep generative adversarial reinforcement learning (DGARL) approach that, for the first time, enables end-to-end semi-supervised medical image segmentation in the DRL domain. DGARL ingeniously establishes a pipeline that integrates DRL and generative adversarial networks (GANs) to optimize both detection and segmentation tasks holistically while mutually enhancing each other. Specifically, DGARL introduces two innovative components to facilitate this integration in semi-supervised settings. First, a task-joint GAN with two discriminators links the detection results to the GAN's segmentation performance evaluation, allowing simultaneous joint evaluation and feedback. This ensures that DRL and GAN can be directly optimized based on each other's results. Second, a bidirectional exploration DRL integrates backward exploration and forward exploration to ensure the DRL agent explores the correct direction when forward exploration is disabled due to lack of explicit rewards. This mitigates the issue of unlabeled data being unable to provide rewards and rendering DRL unexplorable. Comprehensive experiments on three generalization datasets, comprising a total of 640 patients, demonstrate that our novel DGARL achieves 85.02% Dice and improves at least 1.91% for brain tumors, achieves 73.18% Dice and improves at least 4.28% for liver tumors, and achieves 70.85% Dice and improves at least 2.73% for pancreas compared to the ten most recent advanced methods, our results attest to the superiority of DGARL. Code is available at GitHub.

引用

页码：3072 / 3084

页数：13

共 55 条

[1] A survey of inverse reinforcement learning: Challenges, methods and progress
Arora, Saurabh
Doshi, Prashant
[J]. ARTIFICIAL INTELLIGENCE, 2021, 297 (297)
[2] Auer P., 2003, Journal of Machine Learning Research, V3, P397, DOI 10.1162/153244303321897663
[3] Bidirectional Copy-Paste for Semi-Supervised Medical Image Segmentation
Bai, Yunhao
Chen, Duowen
Li, Qingli
Shen, Wei
Wang, Yan
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11514 - 11524
[4] Pseudo-label Guided Contrastive Learning for Semi-supervised Medical Image Segmentation
Basak, Hritam
Yin, Zhaozheng
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19786 - 19797
[5] Casanova A, 2020, Arxiv, DOI arXiv:2002.06583
[6] Combined Spiral Transformation and Model-Driven Multi-Modal Deep Learning Scheme for Automatic Prediction of TP53 Mutation in Pancreatic Cancer
Chen, Xiahan
Lin, Xiaozhu
Shen, Qing
Qian, Xiaohua
[J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (02) : 735 - 747
[7] Retrieval of Brain Tumors by Adaptive Spatial Pooling and Fisher Vector Representation
Cheng, Jun
Yang, Wei
Huang, Meiyan
Huang, Wei
Jiang, Jun
Zhou, Yujia
Yang, Ru
Zhao, Jie
Feng, Yanqiu
Feng, Qianjin
Chen, Wufan
[J]. PLOS ONE, 2016, 11 (06):
[8] Enhanced Performance of Brain Tumor Classification via Tumor Region Augmentation and Partition
Cheng, Jun
Huang, Wei
Cao, Shuangliang
Yang, Ru
Yang, Wei
Yun, Zhaoqiang
Wang, Zhijian
Feng, Qianjin
[J]. PLOS ONE, 2015, 10 (10):
[9] Semi-supervised Brain Lesion Segmentation with an Adapted Mean Teacher Model
Cui, Wenhui
Liu, Yanlin
Li, Yuxing
Guo, Menghao
Li, Yiming
Li, Xiuli
Wang, Tianle
Zeng, Xiangzhu
Ye, Chuyang
[J]. INFORMATION PROCESSING IN MEDICAL IMAGING, IPMI 2019, 2019, 11492 : 554 - 565
[10] Diaz I, 2011, IEEE ENG MED BIO, P3934, DOI 10.1109/IEMBS.2011.6090977

← 1 2 3 4 5 6 →