Self-Supervised Medical Image Segmentation Using Deep Reinforced Adaptive Masking

被引:4
作者
Xu, Zhenghua [1 ]
Liu, Yunxin [1 ]
Xu, Gang [2 ]
Lukasiewicz, Thomas [3 ,4 ]
机构
[1] Hebei Univ Technol, Sch Hlth Sci & Biomed Engn, State Key Lab Reliabil & Intelligence Elect Equipm, Tianjin 300401, Peoples R China
[2] Hebei Univ Technol, Sch Artificial Intelligence, Tianjin 300401, Peoples R China
[3] Univ Oxford, Dept Comp Sci, Oxford OX1 3QG, England
[4] Vienna Univ Technol, Inst Log & Computat, A-1040 Vienna, Austria
基金
中国国家自然科学基金;
关键词
Biomedical imaging; Image reconstruction; Image segmentation; Task analysis; Adaptation models; Self-supervised learning; Training; medical image segmentation; adaptive image masking; deep reinforcement learning;
D O I
10.1109/TMI.2024.3436608
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Self-supervised learning aims to learn transferable representations from unlabeled data for downstream tasks. Inspired by masked language modeling in natural language processing, masked image modeling (MIM) has achieved certain success in the field of computer vision, but its effectiveness in medical images remains unsatisfactory. This is mainly due to the high redundancy and small discriminative regions in medical images compared to natural images. Therefore, this paper proposes an adaptive hard masking (AHM) approach based on deep reinforcement learning to expand the application of MIM in medical images. Unlike predefined random masks, AHM uses an asynchronous advantage actor-critic (A3C) model to predict reconstruction loss for each patch, enabling the model to learn where masking is valuable. By optimizing the non-differentiable sampling process using reinforcement learning, AHM enhances the understanding of key regions, thereby improving downstream task performance. Experimental results on two medical image datasets demonstrate that AHM outperforms state-of-the-art methods. Additional experiments under various settings validate the effectiveness of AHM in constructing masked images.
引用
收藏
页码:180 / 193
页数:14
相关论文
共 42 条
[1]  
Bakas S, 2019, Arxiv, DOI arXiv:1811.02629
[2]   Data Descriptor: Advancing The Cancer Genome Atlas glioma MRI collections with expert segmentation labels and radiomic features [J].
Bakas, Spyridon ;
Akbari, Hamed ;
Sotiras, Aristeidis ;
Bilello, Michel ;
Rozycki, Martin ;
Kirby, Justin S. ;
Freymann, John B. ;
Farahani, Keyvan ;
Davatzikos, Christos .
SCIENTIFIC DATA, 2017, 4
[3]  
Bao H., 2021, arXiv, DOI DOI 10.48550/ARXIV.2106.08254
[4]  
Brown TB, 2020, ADV NEUR IN, V33
[5]   Association of genomic subtypes of lower-grade gliomas with shape features automatically extracted by a deep learning algorithm [J].
Buda, Mateusz ;
Saha, Ashirbani ;
Mazurowski, Maciej A. .
COMPUTERS IN BIOLOGY AND MEDICINE, 2019, 109 :218-225
[6]  
Caron M, 2020, ADV NEUR IN, V33
[7]  
Chen T, 2020, PR MACH LEARN RES, V119
[8]   Boundary IoU: Improving Object-Centric Image Segmentation Evaluation [J].
Cheng, Bowen ;
Girshick, Ross ;
Dollar, Piotr ;
Berg, Alexander C. ;
Kirillov, Alexander .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :15329-15337
[9]   FEEDBACK ON A PUBLICLY DISTRIBUTED IMAGE DATABASE: THE MESSIDOR DATABASE [J].
Decenciere, Etienne ;
Zhang, Xiwei ;
Cazuguel, Guy ;
Lay, Bruno ;
Cochener, Beatrice ;
Trone, Caroline ;
Gain, Philippe ;
Ordonez-Varela, John-Richard ;
Massin, Pascale ;
Erginay, Ali ;
Charton, Beatrice ;
Klein, Jean-Claude .
IMAGE ANALYSIS & STEREOLOGY, 2014, 33 (03) :231-234
[10]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171