Real-Time Lightweight Video Super-Resolution With RRED-Based Perceptual Constraint

被引：1

作者：

Wu, Xinyi ^{[1
]}

Lopez-Tapia, Santiago ^{[1
]}

Wang, Xijun ^{[1
]}

Molina, Rafael ^{[2
]}

Katsaggelos, Aggelos K. ^{[1
]}

机构：

[1] Northwestern Univ, Dept Comp Sci & Elect Engn, Evanston, IL 60208 USA

[2] Comp Sci & Artificial Intelligence Dept, Granada 18071, Spain

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 10期

关键词：

Streaming media; Real-time systems; Quality assessment; Video recording; Spatial resolution; Computational modeling; Circuits and systems; Video super-resolution; real-time; spatio-temporal consistency; video quality assessment; QUALITY ASSESSMENT; NETWORK;

D O I：

10.1109/TCSVT.2024.3405827

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Real-time video services are gaining popularity in our daily life, yet limited network bandwidth can constrain the delivered video quality. Video Super Resolution (VSR) technology emerges as a key solution to enhance user experience by reconstructing high-resolution (HR) videos. The existing real-time VSR frameworks have primarily emphasized spatial quality metrics like PSNR and SSIM, which often lack consideration of temporal coherence, a critical factor for accurately reflecting the overall quality of super-resolved videos. Inspired by Video Quality Assessment (VQA) strategies, we propose a dual-frame training framework and a lightweight multi-branch network to address VSR processing in real time. Such designs thoroughly leverage the spatio-temporal correlations between consecutive frames so as to ensure efficient video restoration. Furthermore, we incorporate ST-RRED, a powerful VQA approach that separately measures spatial and temporal consistency aligning with human perception principles, into our loss functions. This guides us to synthesize quality-aware perceptual features across both space and time for realistic reconstruction. Our model demonstrates remarkable efficiency, achieving near real-time processing of 4K videos. Compared to the state-of-the-art lightweight model MRVSR, ours is more compact and faster, 60% smaller in size (0.483M vs. 1.21M parameters), and 106% quicker (96.44fps vs. 46.7fps on 1080p frames), with significantly improved perceptual quality.

引用

页码：10310 / 10325

页数：16