RUN: Rethinking the UNet Architecture for Efficient Image Restoration

被引:2
作者
Wu, Zhijian [1 ]
Li, Jun [2 ]
Xu, Chang [3 ]
Huang, Dingjiang [1 ]
Hoi, Steven C. H. [4 ]
机构
[1] East China Normal Univ, Sch Data Sci & Engn, Shanghai 200050, Peoples R China
[2] Nanjing Normal Univ, Sch Comp & Elect Informat, Nanjing 210098, Peoples R China
[3] Univ Sydney, Fac Engn, Sch Comp Sci, Sydney, NSW 2006, Australia
[4] Singapore Management Univ, Sch Comp & Informat Syst, Singapore 188065, Singapore
基金
中国国家自然科学基金;
关键词
Image restoration; Convolution; Computer architecture; Task analysis; Computational modeling; Computational efficiency; Transformers; Convolution block; heterogeneous operators; image restoration; self-attention; UNet architecture; NETWORK; ATTENTION;
D O I
10.1109/TMM.2024.3407656
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent advanced image restoration (IR) methods typically stack homogeneous operators hierarchically in the UNet architecture. To achieve higher accuracy, these models are now going deeper and more complex, making them resource-intensive. After comprehensively reviewing different operators within modern networks, we provide an in-depth analysis of their individual favorable properties and invent a novel efficient IR network by redesigning the UNet architecture (RUN) with heterogeneous operators. Specifically, we propose three heterogeneous operators for different relational interactions concerning the specificity of different hierarchical features of the UNet architecture. First, the spatial self-attention block (SSA Block) processes high-resolution top-level features by modeling pixel interactions from the spatial dimension. Second, the channel self-attention block (CSA Block) performs channel recalibration and information transmission for the bottom-level features with rich channels. Finally, a simple and efficient convolution block (Conv Block) is used to facilitate middle-order information propagation, which complements the self-attention mechanism to achieve local-global coupling. Based on these designs, our RUN enables more comprehensive information dissemination and interaction regardless of topological distance, thus achieving superior performance while maintaining desirable computational budgets. Extensive experiments show that our RUN achieves state-of-the-art results for a variety of IR tasks, including image deblurring, image denoising, image deraining, and low-light image enhancement.
引用
收藏
页码:10381 / 10394
页数:14
相关论文
共 114 条
[1]   NTIRE 2019 Challenge on Real Image Denoising: Methods and Results [J].
Abdelhamed, Abdelrahman ;
Timofte, Radu ;
Brown, Michael S. ;
Yu, Songhyun ;
Park, Bumjun ;
Jeong, Jechang ;
Jung, Seung-Won ;
Kim, Dong-Wook ;
Chung, Jae-Ryun ;
Liu, Jiaming ;
Wang, Yuzhi ;
Wu, Chi-Hao ;
Xu, Qin ;
Wang, Chuan ;
Cai, Shaofan ;
Ding, Yifan ;
Fan, Haoqiang ;
Wang, Jue ;
Zhang, Kai ;
Zuo, Wangmeng ;
Zhussip, Magauiya ;
Park, Dong Won ;
Soltanayev, Shakarim ;
Chun, Se Young ;
Xiong, Zhiwei ;
Chen, Chang ;
Haris, Muhammad ;
Akita, Kazutoshi ;
Yoshida, Tomoki ;
Shakhnarovich, Greg ;
Ukita, Norimichi ;
Zamir, Syed Waqas ;
Arora, Aditya ;
Khan, Salman ;
Khan, Fahad Shahbaz ;
Shao, Ling ;
Ko, Sung-Jea ;
Lim, Dong-Pan ;
Kim, Seung-Wook ;
Ji, Seo-Won ;
Lee, Sang-Won ;
Tang, Wenyi ;
Fan, Yuchen ;
Zhou, Yuqian ;
Liu, Ding ;
Huang, Thomas S. ;
Meng, Deyu ;
Zhang, Lei ;
Yong, Hongwei ;
Zhao, Yiyun .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, :2197-2210
[2]   A High-Quality Denoising Dataset for Smartphone Cameras [J].
Abdelhamed, Abdelrahman ;
Lin, Stephen ;
Brown, Michael S. .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1692-1700
[3]  
Abuolaim Abdullah, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12355), P111, DOI 10.1007/978-3-030-58607-2_7
[4]   NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study [J].
Agustsson, Eirikur ;
Timofte, Radu .
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :1122-1131
[5]  
Ali A ..., 2021, Adv. Neural Inf. Process. Syst, V34, P20014, DOI DOI 10.48550/ARXIV.2106.09681
[6]   Densely Residual Laplacian Super-Resolution [J].
Anwar, Saeed ;
Barnes, Nick .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (03) :1192-1204
[7]   Real Image Denoising with Feature Attention [J].
Anwar, Saeed ;
Barnes, Nick .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3155-3164
[8]   Contour Detection and Hierarchical Image Segmentation [J].
Arbelaez, Pablo ;
Maire, Michael ;
Fowlkes, Charless ;
Malik, Jitendra .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (05) :898-916
[9]  
Ba Jimmy Lei, 2016, arXiv
[10]  
Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13