Region Attention Transformer for Medical Image Restoration

被引:0
作者
Yang, Zhiwen [1 ]
Chen, Haowei [1 ]
Qian, Ziniu [1 ]
Zhou, Yang [1 ]
Zhang, Hui [2 ]
Zhao, Dan [3 ]
Wei, Bingzheng [4 ]
Xu, Yan [1 ]
机构
[1] Beihang Univ, Sch Biol Sci & Med Engn, State Key Lab Software Dev Environm, Key Lab Biomechan & Mech,Minist Educ,Beijing Adv, Beijing 100191, Peoples R China
[2] Tsinghua Univ, Dept Biomed Engn, Beijing 100084, Peoples R China
[3] Chinese Acad Med Sci & Peking Union Med Coll, Dept Gynecol Oncol, Natl Canc Ctr, Natl Clin Res Ctr Canc,Canc Hosp, Beijing 100021, Peoples R China
[4] ByteDance Inc, Beijing 100098, Peoples R China
来源
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VII | 2024年 / 15007卷
关键词
Medical Image Restoration; Segment Anything Model; Region Attention; Focal Region Loss; Transformer; LOW-DOSE CT;
D O I
10.1007/978-3-031-72104-5_58
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transformer-based methods have demonstrated impressive results in medical image restoration, attributed to the multi-head self-attention (MSA) mechanism in the spatial dimension. However, the majority of existing Transformers conduct attention within fixed and coarsely partitioned regions (e.g. the entire image or fixed patches), resulting in interference from irrelevant regions and fragmentation of continuous image content. To overcome these challenges, we introduce a novel Region Attention Transformer (RAT) that utilizes a region-based multi-head self-attention mechanism (R-MSA). The R-MSA dynamically partitions the input image into non-overlapping semantic regions using the robust Segment Anything Model (SAM) and then performs self-attention within these regions. This region partitioning is more flexible and interpretable, ensuring that only pixels from similar semantic regions complement each other, thereby eliminating interference from irrelevant regions. Moreover, we introduce a focal region loss to guide our model to adaptively focus on recovering high-difficulty regions. Extensive experiments demonstrate the effectiveness of RAT in various medical image restoration tasks, including PET image synthesis, CT image denoising, and pathological image super-resolution. Code is available at https://github.com/RAT.
引用
收藏
页码:603 / 613
页数:11
相关论文
共 31 条
[1]  
Chan Chung, 2018, 2018 IEEE NUCL SCI S
[2]   Low-Dose CT With a Residual Encoder-Decoder Convolutional Neural Network [J].
Chen, Hu ;
Zhang, Yi ;
Kalra, Mannudeep K. ;
Lin, Feng ;
Chen, Yang ;
Liao, Peixi ;
Zhou, Jiliu ;
Wang, Ge .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2017, 36 (12) :2524-2535
[3]  
Chen H, 2017, I S BIOMED IMAGING, P143
[4]   Simple Baselines for Image Restoration [J].
Chen, Liangyu ;
Chu, Xiaojie ;
Zhang, Xiangyu ;
Sun, Jian .
COMPUTER VISION, ECCV 2022, PT VII, 2022, 13667 :17-33
[5]  
Chen YH, 2018, I S BIOMED IMAGING, P739
[6]   Image Super-Resolution Using Deep Convolutional Networks [J].
Dong, Chao ;
Loy, Chen Change ;
He, Kaiming ;
Tang, Xiaoou .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (02) :295-307
[7]   Highly aligned stromal collagen is a negative prognostic factor following pancreatic ductal adenocarcinoma resection [J].
Drifka, Cole R. ;
Loeffler, Agnes G. ;
Mathewson, Kara ;
Keikhosravi, Adib ;
Eickhoff, Jens C. ;
Liu, Yuming ;
Weber, Sharon M. ;
Kao, W. John ;
Eliceiri, Kevin W. .
ONCOTARGET, 2016, 7 (46) :76197-76213
[8]   ACCELERATED IMAGE-RECONSTRUCTION USING ORDERED SUBSETS OF PROJECTION DATA [J].
HUDSON, HM ;
LARKIN, RS .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 1994, 13 (04) :601-609
[9]   Spach Transformer: Spatial and Channel-Wise Transformer Based on Local and Global Self-Attentions for PET Image Denoising [J].
Jang, Se-In ;
Pan, Tinsu ;
Li, Ye ;
Heidari, Pedram ;
Chen, Junyu ;
Li, Quanzheng ;
Gong, Kuang .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (06) :2036-2049
[10]  
Kirillov A, 2023, Arxiv, DOI [arXiv:2304.02643, 10.48550/arXiv.2304.02643]