RVSRT: Real-time Video Super Resolution Transformer

被引:0
|
作者
Ou, Linlin [1 ,2 ]
Chen, Yuanping [2 ]
机构
[1] Chinese Acad Sci, Comp Network Informat Ctr, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
来源
FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022 | 2022年 / 12705卷
关键词
Video super resolution; vision transformer; deep learning;
D O I
10.1117/12.2680156
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video super-resolution is the task of converting low-resolution video to high-resolution video. Existing methods with better intuitive effects are mainly based on convolutional neural networks (CNNs), but the architecture is heavy, resulting in a slow inference structure. Aiming at this problem, this paper proposes a real-time video super-resolution Transformer (RVSRT) can quickly complete the super-resolution task while considering the visual fluency of video frame switching. Unlike traditional methods based on CNNs, this paper does not process video frames separately with different network modules in the temporal domain, but batches adjacent frames through a single UNet-style structure end-to-end Transformer network architecture. Moreover, this paper creatively sets up two-stage interpolation sampling before and after the end-to-end network to maximize the performance of the traditional CV algorithm. The experimental results show that compared with SOTA TMNet [1], RVSRT has only 20% of the network size (2.3M vs 12.3M, parameters) while ensuring comparable performance, and the speed is increased by 80% (26.2 fps vs 14.3 fps, frame size is 720*576).
引用
收藏
页数:5
相关论文
共 50 条
  • [21] 3RE-Net: Joint Loss-REcovery and Super-REsolution Neural Network for REal-Time Video
    Ge, Liming
    Jiang, David Zhaochen
    Bao, Wei
    ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2023, PT I, 2024, 14471 : 165 - 177
  • [22] Real-Time Environment Monitoring Using a Lightweight Image Super-Resolution Network
    Yu, Qiang
    Liu, Feiqiang
    Xiao, Long
    Liu, Zitao
    Yang, Xiaomin
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (11)
  • [23] Neural Super-Resolution in Real-Time Rendering Using Auxiliary Feature Enhancement
    Zhong, Zhihua
    Chen, Guanlin
    Wang, Rui
    Huo, Yuchi
    JOURNAL OF DATABASE MANAGEMENT, 2023, 34 (03)
  • [24] Blind Super Resolution of Real-Life Video Sequences
    Faramarzi, Esmaeil
    Rajan, Dinesh
    Fernandes, Felix C. A.
    Christensen, Marc P.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (04) : 1544 - 1555
  • [25] SAR Analysis for Real Video Super- Resolution Improvement
    Li, Shi
    Chen, Xiaodiao
    Zhou, Steven Zhiying
    Pan, Wanbin
    Wang, Yigang
    IEEE ACCESS, 2020, 8 (08): : 54816 - 54832
  • [26] CTVSR: Collaborative Spatial-Temporal Transformer for Video Super-Resolution
    Tang, Jun
    Lu, Chenyan
    Liu, Zhengxue
    Li, Jiale
    Dai, Hang
    Ding, Yong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 5018 - 5032
  • [27] Space-time super-resolution for satellite video: A joint framework based on multi-scale spatial-temporal transformer
    Xiao, Yi
    Yuan, Qiangqiang
    He, Jiang
    Zhang, Qiang
    Sun, Jing
    Su, Xin
    Wu, Jialian
    Zhang, Liangpei
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 108
  • [28] A Real-Time Convolutional Neural Network for Super-Resolution on FPGA With Applications to 4K UHD 60 fps Video Services
    Kim, Yongwoo
    Choi, Jae-Seok
    Kim, Munchurl
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (08) : 2521 - 2534
  • [29] Gradient information distillation network for real-time single-image super-resolution
    Bin Meng
    Lining Wang
    Zheng He
    Gwanggil Jeon
    Qingyu Dou
    Xiaomin Yang
    Journal of Real-Time Image Processing, 2021, 18 : 333 - 344
  • [30] Gradient information distillation network for real-time single-image super-resolution
    Meng, Bin
    Wang, Lining
    He, Zheng
    Jeon, Gwanggil
    Dou, Qingyu
    Yang, Xiaomin
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2021, 18 (02) : 333 - 344