Real-Time Lightweight Video Super-Resolution With RRED-Based Perceptual Constraint

被引:1
作者
Wu, Xinyi [1 ]
Lopez-Tapia, Santiago [1 ]
Wang, Xijun [1 ]
Molina, Rafael [2 ]
Katsaggelos, Aggelos K. [1 ]
机构
[1] Northwestern Univ, Dept Comp Sci & Elect Engn, Evanston, IL 60208 USA
[2] Comp Sci & Artificial Intelligence Dept, Granada 18071, Spain
关键词
Streaming media; Real-time systems; Quality assessment; Video recording; Spatial resolution; Computational modeling; Circuits and systems; Video super-resolution; real-time; spatio-temporal consistency; video quality assessment; QUALITY ASSESSMENT; NETWORK;
D O I
10.1109/TCSVT.2024.3405827
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Real-time video services are gaining popularity in our daily life, yet limited network bandwidth can constrain the delivered video quality. Video Super Resolution (VSR) technology emerges as a key solution to enhance user experience by reconstructing high-resolution (HR) videos. The existing real-time VSR frameworks have primarily emphasized spatial quality metrics like PSNR and SSIM, which often lack consideration of temporal coherence, a critical factor for accurately reflecting the overall quality of super-resolved videos. Inspired by Video Quality Assessment (VQA) strategies, we propose a dual-frame training framework and a lightweight multi-branch network to address VSR processing in real time. Such designs thoroughly leverage the spatio-temporal correlations between consecutive frames so as to ensure efficient video restoration. Furthermore, we incorporate ST-RRED, a powerful VQA approach that separately measures spatial and temporal consistency aligning with human perception principles, into our loss functions. This guides us to synthesize quality-aware perceptual features across both space and time for realistic reconstruction. Our model demonstrates remarkable efficiency, achieving near real-time processing of 4K videos. Compared to the state-of-the-art lightweight model MRVSR, ours is more compact and faster, 60% smaller in size (0.483M vs. 1.21M parameters), and 106% quicker (96.44fps vs. 46.7fps on 1080p frames), with significantly improved perceptual quality.
引用
收藏
页码:10310 / 10325
页数:16
相关论文
empty
未找到相关数据