RVSRT: Real-time Video Super Resolution Transformer

被引：0

作者：

Ou, Linlin ^{[1
,2
]}

Chen, Yuanping ^{[2
]}

机构：

[1] Chinese Acad Sci, Comp Network Informat Ctr, Beijing, Peoples R China

[2] Univ Chinese Acad Sci, Beijing, Peoples R China

来源：

FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022 | 2022年 / 12705卷

关键词：

Video super resolution; vision transformer; deep learning;

D O I：

10.1117/12.2680156

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Video super-resolution is the task of converting low-resolution video to high-resolution video. Existing methods with better intuitive effects are mainly based on convolutional neural networks (CNNs), but the architecture is heavy, resulting in a slow inference structure. Aiming at this problem, this paper proposes a real-time video super-resolution Transformer (RVSRT) can quickly complete the super-resolution task while considering the visual fluency of video frame switching. Unlike traditional methods based on CNNs, this paper does not process video frames separately with different network modules in the temporal domain, but batches adjacent frames through a single UNet-style structure end-to-end Transformer network architecture. Moreover, this paper creatively sets up two-stage interpolation sampling before and after the end-to-end network to maximize the performance of the traditional CV algorithm. The experimental results show that compared with SOTA TMNet [1], RVSRT has only 20% of the network size (2.3M vs 12.3M, parameters) while ensuring comparable performance, and the speed is increased by 80% (26.2 fps vs 14.3 fps, frame size is 720*576).

引用

页数：5

共 50 条

[21] 3RE-Net: Joint Loss-REcovery and Super-REsolution Neural Network for REal-Time Video
Ge, Liming
Jiang, David Zhaochen
Bao, Wei
ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2023, PT I, 2024, 14471 : 165 - 177
[22] Real-Time Environment Monitoring Using a Lightweight Image Super-Resolution Network
Yu, Qiang
Liu, Feiqiang
Xiao, Long
Liu, Zitao
Yang, Xiaomin
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (11)
[23] Neural Super-Resolution in Real-Time Rendering Using Auxiliary Feature Enhancement
Zhong, Zhihua
Chen, Guanlin
Wang, Rui
Huo, Yuchi
JOURNAL OF DATABASE MANAGEMENT, 2023, 34 (03)
[24] Blind Super Resolution of Real-Life Video Sequences
Faramarzi, Esmaeil
Rajan, Dinesh
Fernandes, Felix C. A.
Christensen, Marc P.
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (04) : 1544 - 1555
[25] SAR Analysis for Real Video Super- Resolution Improvement
Li, Shi
Chen, Xiaodiao
Zhou, Steven Zhiying
Pan, Wanbin
Wang, Yigang
IEEE ACCESS, 2020, 8 (08): : 54816 - 54832
[26] CTVSR: Collaborative Spatial-Temporal Transformer for Video Super-Resolution
Tang, Jun
Lu, Chenyan
Liu, Zhengxue
Li, Jiale
Dai, Hang
Ding, Yong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 5018 - 5032
[27] Space-time super-resolution for satellite video: A joint framework based on multi-scale spatial-temporal transformer
Xiao, Yi
Yuan, Qiangqiang
He, Jiang
Zhang, Qiang
Sun, Jing
Su, Xin
Wu, Jialian
Zhang, Liangpei
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 108
[28] A Real-Time Convolutional Neural Network for Super-Resolution on FPGA With Applications to 4K UHD 60 fps Video Services
Kim, Yongwoo
Choi, Jae-Seok
Kim, Munchurl
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (08) : 2521 - 2534
[29] Gradient information distillation network for real-time single-image super-resolution
Bin Meng
Lining Wang
Zheng He
Gwanggil Jeon
Qingyu Dou
Xiaomin Yang
Journal of Real-Time Image Processing, 2021, 18 : 333 - 344
[30] Gradient information distillation network for real-time single-image super-resolution
Meng, Bin
Wang, Lining
He, Zheng
Jeon, Gwanggil
Dou, Qingyu
Yang, Xiaomin
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2021, 18 (02) : 333 - 344

← 1 2 3 4 5 →