Resampling video super-resolution based on multi-scale guided optical flow

被引：0

作者：

Li, Puying ^{[1
]}

Zhu, Fuzhen ^{[1
]}

Liu, Yong ^{[1
]}

Zhang, Qi ^{[1
]}

机构：

[1] Heilongjiang Univ, Sch Elect Engn, Harbin 150080, Peoples R China

来源：

COMPUTERS & ELECTRICAL ENGINEERING | 2025年 / 123卷

关键词：

Video super-resolution; Transformer; Multi-scale adaptive flow estimation; Resampling; NETWORKS;

D O I：

10.1016/j.compeleceng.2025.110176

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Existing video super-resolution (VSR) methods are inadequate for dealing with inter-frame motion and spatial distortion problems, especially in high-motion scenes, which tend to lead to loss of details and degradation of reconstruction quality. To address these challenges, this paper puts forward a resampling video super-resolution algorithm based on multiscale guided optical flow. The method combines multi-scale guided optical flow estimation to address the issue of interframe motion and a resampling deformable convolution module to address the issue of spatial distortion. Specifically, features are first extracted from low-quality video frames using a convolutional layer, followed by feature extraction with Residual Swin Transformer Blocks (RSTBs). In the feature alignment module, a multiscale-guided optical flow estimation approach is employed, which addresses the inter-frame motion problem across different video segments and performs video frame interpolation and super-resolution reconstruction simultaneously. Furthermore, spatial alignment is achieved by integrating resampling into the deformable convolution module, mitigating spatial distortion. Finally, multiple Residual Swin Transformer Blocks (RSTBs) are used to extract and fuse features, and pixel rearrangement layers are employed to reconstruct high-quality video frames. The experimental results on the REDS, Vid4, and UDM10 datasets show that our method significantly outperforms current state-of-the-art (SOTA) techniques, with improvements of 0.61 dB in Peak Signal-to-Noise Ratio (PSNR) and 0.0121 in Structural Similarity (SSIM), validating the effectiveness and superiority of the method.

引用

页数：14

共 50 条

[1] Multi-scale Residual Dense Block for Video Super-Resolution
Cui, Hetao
Sun, Quansen
INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: VISUAL DATA ENGINEERING, PT I, 2019, 11935 : 424 - 434
[2] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
Zhenghua Zhou
Boxiang Xue
Hai Wang
Jianwei Zhao
Multimedia Tools and Applications, 2024, 83 : 27809 - 27830
[3] Bidirectional Multi-scale Deformable Attention for Video Super-Resolution
Zhou, Zhenghua
Xue, Boxiang
Wang, Hai
Zhao, Jianwei
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 27809 - 27830
[4] Video super-resolution based on multi-scale 3D convolution
Zhan K.
Sun Y.
Li Y.
Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2021, 48 (05): : 8 - 14
[5] Attention-guided video super-resolution with recurrent multi-scale spatial–temporal transformer
Wei Sun
Xianguang Kong
Yanning Zhang
Complex & Intelligent Systems, 2023, 9 : 3989 - 4002
[6] Optical flow for video super-resolution: a survey
Tu, Zhigang
Li, Hongyan
Xie, Wei
Liu, Yuanzhong
Zhang, Shifu
Li, Baoxin
Yuan, Junsong
ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (08) : 6505 - 6546
[7] Attention-guided video super-resolution with recurrent multi-scale spatial-temporal transformer
Sun, Wei
Kong, Xianguang
Zhang, Yanning
COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (04) : 3989 - 4002
[8] Optical flow for video super-resolution: a survey
Zhigang Tu
Hongyan Li
Wei Xie
Yuanzhong Liu
Shifu Zhang
Baoxin Li
Junsong Yuan
Artificial Intelligence Review, 2022, 55 : 6505 - 6546
[9] LightVSR: A Lightweight Video Super-Resolution Model with Multi-Scale Feature Aggregation
Huang, Guanglun
Li, Nachuan
Liu, Jianming
Zhang, Minghe
Zhang, Li
Li, Jun
APPLIED SCIENCES-BASEL, 2025, 15 (03):
[10] Video Super-Resolution Using Multi-Scale and Non-Local Feature Fusion
Li, Yanghui
Zhu, Hong
Hou, Qian
Wang, Jing
Wu, Wenhuan
ELECTRONICS, 2022, 11 (09)

← 1 2 3 4 5 →