Cross-Scale KNN Image Transformer for Image Restoration

被引:3
|
作者
Lee, Hunsang [1 ]
Choi, Hyesong [2 ]
Sohn, Kwanghoon [1 ]
Min, Dongbo [2 ]
机构
[1] Yonsei Univ, Sch Elect & Elect Engn, Seoul, South Korea
[2] Ewha Womans Univ, Dept Comp Sci & Engn, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Image restoration; Transformers; Noise reduction; Complexity theory; Computer vision; Convolutional neural networks; Feature extraction; denoising; deblurring; deraining; transformer; self-attention; k-nn search; low-level vision; ALGORITHMS; NETWORK;
D O I
10.1109/ACCESS.2023.3242556
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Numerous image restoration approaches have been proposed based on attention mechanism, achieving superior performance to convolutional neural networks (CNNs) based counterparts. However, they do not leverage the attention model in a form fully suited to the image restoration tasks. In this paper, we propose an image restoration network with a novel attention mechanism, called cross-scale $k$ -NN image Transformer (CS-KiT), that effectively considers several factors such as locality, non-locality, and cross-scale aggregation, which are essential to image restoration. To achieve locality and non-locality, the CS-KiT builds $k$ -nearest neighbor relation of local patches and aggregates similar patches through local attention. To induce cross-scale aggregation, we ensure that each local patch embraces different scale information with scale-aware patch embedding (SPE) which predicts an input patch scale through a combination of multi-scale convolution branches. We show the effectiveness of the CS-KiT with experimental results, outperforming state-of-the-art restoration approaches on image denoising, deblurring, and deraining benchmarks.
引用
收藏
页码:13013 / 13027
页数:15
相关论文
共 50 条
  • [41] ELMformer: Efficient Raw Image Restoration with a Locally Multiplicative Transformer
    Ma, Jiaqi
    Yan, Shengyuan
    Zhang, Lefei
    Wang, Guoli
    Zhang, Qian
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5842 - 5852
  • [42] Region Attention Transformer for Medical Image Restoration
    Yang, Zhiwen
    Chen, Haowei
    Qian, Ziniu
    Zhou, Yang
    Zhang, Hui
    Zhao, Dan
    Wei, Bingzheng
    Xu, Yan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VII, 2024, 15007 : 603 - 613
  • [43] Hyperspectral Image Super-Resolution Network Based on Cross-Scale Nonlocal Attention
    Li, Shuangliang
    Tian, Yugang
    Wang, Cheng
    Wu, Hongxian
    Zheng, Shaolan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [44] A dual encoder LDCT image denoising model based on cross-scale skip connections☆
    Wang, Lifang
    Wang, Yali
    Ren, Wenjing
    Yu, Jing
    Chang, Xiaoyan
    Guo, Xiaodong
    Hu, Lihua
    NEUROCOMPUTING, 2025, 613
  • [45] RFormer: Transformer-Based Generative Adversarial Network for Real Fundus Image Restoration on a New Clinical Benchmark
    Deng, Zhuo
    Cai, Yuanhao
    Chen, Lu
    Gong, Zheng
    Bao, Qiqi
    Yao, Xue
    Fang, Dong
    Yang, Wenming
    Zhang, Shaochong
    Ma, Lan
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (09) : 4645 - 4655
  • [46] Reffusion: Enhancement Conditional Diffusion Framework with Dual Domain Interaction Transformer for image restoration
    Xie, Dirui
    Hu, Xiaofang
    Xiao, He
    Zhou, Yue
    Duan, Shukai
    KNOWLEDGE-BASED SYSTEMS, 2025, 311
  • [47] GridFormer: Residual Dense Transformer with Grid Structure for Image Restoration in Adverse Weather Conditions
    Wang, Tao
    Zhang, Kaihao
    Shao, Ziqian
    Luo, Wenhan
    Stenger, Bjorn
    Lu, Tong
    Kim, Tae-Kyun
    Liu, Wei
    Li, Hongdong
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (10) : 4541 - 4563
  • [48] Prompt-guided and degradation prior supervised transformer for adverse weather image restoration
    Liu, Weihan
    Shao, Mingwen
    Meng, Lingzhuang
    Qiao, Yuanjian
    Bao, Zhiyuan
    APPLIED INTELLIGENCE, 2025, 55 (02)
  • [49] Prior Knowledge-Guided Transformer for Remote Sensing Image Captioning
    Meng, Lingwu
    Wang, Jing
    Yang, Yang
    Xiao, Liang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 13
  • [50] Interactive Concept Network Enhanced Transformer for Remote Sensing Image Captioning
    Zhang, Cheng
    Ren, Zhongle
    Hou, Biao
    Meng, Jianhua
    Li, Weibin
    Jiao, Licheng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63