Semantic guidance incremental network for efficiency video super-resolution

被引：0

作者：

He, Xiaonan ^{[1
]}

Xia, Yukun ^{[2
]}

Qiao, Yuansong ^{[1
]}

Lee, Brian ^{[1
]}

Ye, Yuhang ^{[1
]}

机构：

[1] Technol Univ Shannon Midlands Midwest, Univ Rd, Athlone N37 HD68, Ireland

[2] Jiangxi Coll Foreign Studies, Nanchang 330099, Jiangxi, Peoples R China

来源：

VISUAL COMPUTER | 2024年

关键词：

Video super-resolution; Semantic guidance; Efficiency; Convolutional neural network;

D O I：

10.1007/s00371-024-03488-y

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In video streaming, bandwidth constraints significantly affect client-side video quality. Addressing this, deep neural networks offer a promising avenue for implementing video super-resolution (VSR) at the user end, leveraging advancements in modern hardware, including mobile devices. The principal challenge in VSR is the computational intensity involved in processing temporal/spatial video data. Conventional methods, uniformly processing entire scenes, often result in inefficient resource allocation. This is evident in the over-processing of simpler regions and insufficient attention to complex regions, leading to edge artifacts in merged regions. Our innovative approach employs semantic segmentation and spatial frequency-based categorization to divide each video frame into regions of varying complexity: simple, medium, and complex. These are then processed through an efficient incremental model, optimizing computational resources. A key innovation is the sparse temporal/spatial feature transformation layer, which mitigates edge artifacts and ensures seamless integration of regional features, enhancing the naturalness of the super-resolution outcome. Experimental results demonstrate that our method significantly boosts VSR efficiency while maintaining effectiveness. This marks a notable advancement in streaming video technology, optimizing video quality with reduced computational demands. This approach, featuring semantic segmentation, spatial frequency analysis, and an incremental network structure, represents a substantial improvement over traditional VSR methodologies, addressing the core challenges of efficiency and quality in high-resolution video streaming.

引用

页码：4899 / 4911

页数：13

共 31 条

[1] Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation
Caballero, Jose
Ledig, Christian
Aitken, Andrew
Acosta, Alejandro
Totz, Johannes
Wang, Zehan
Shi, Wenzhe
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2848 - 2857
[2] BasicVSR plus plus : Improving Video Super-Resolution with Enhanced Propagation and Alignment
Chan, Kelvin C. K.
Zhou, Shangchen
Xu, Xiangyu
Loy, Chen Change
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5962 - 5971
[3] Chan Kelvin CK, 2022, P IEEE CVF C COMP VI, P5972, DOI DOI 10.48550/ARXIV.2104.13371
[4] Performance analysis of efficient video transmission using EvalSVC, EvalVid-NT, EvalVid
Dawood, M. Sheik
Benazer, S. Sakena
Karthick, R.
Ganesh, R. Senthil
Mary, S. Sugirtha
[J]. MATERIALS TODAY-PROCEEDINGS, 2021, 46 : 3848 - 3850
[5] Controlling Perceptual Factors in Neural Style Transfer
Gatys, Leon A.
Ecker, Alexander S.
Bethge, Matthias
Hertzmann, Aaron
Shechtman, Eli
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3730 - 3738
[6] RSTT: Real-time Spatial Temporal Transformer for Space-Time Video Super-Resolution
Geng, Zhicheng
Liang, Luming
Ding, Tianyu
Zharkov, Ilya
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17420 - 17430
[7] ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic
Kong, Xiangtao
Zhao, Hengyuan
Qiao, Yu
Dong, Chao
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12011 - 12020
[8] Patch-based Evaluation of Image Segmentation
Ledig, Christian
Shi, Wenzhe
Bai, Wenjia
Rueckert, Daniel
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3065 - 3072
[9] Deep Neural Network-based Enhancement for Image and Video Streaming Systems: A Survey and Future Directions
Lee, Royson
Venieris, Stylianos, I
Lane, Nicholas D.
[J]. ACM COMPUTING SURVEYS, 2021, 54 (08)
[10] Towards High-Quality and Efficient Video Super-Resolution via Spatial-Temporal Data Overfitting
Li, Gen
Ji, Jie
Qin, Minghai
Niu, Wei
Ren, Bin
Afghah, Fatemeh
Guo, Linke
Ma, Xiaolong
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10259 - 10269

← 1 2 3 4 →