Semantic guidance incremental network for efficiency video super-resolution

被引:0
作者
He, Xiaonan [1 ]
Xia, Yukun [2 ]
Qiao, Yuansong [1 ]
Lee, Brian [1 ]
Ye, Yuhang [1 ]
机构
[1] Technol Univ Shannon Midlands Midwest, Univ Rd, Athlone N37 HD68, Ireland
[2] Jiangxi Coll Foreign Studies, Nanchang 330099, Jiangxi, Peoples R China
关键词
Video super-resolution; Semantic guidance; Efficiency; Convolutional neural network;
D O I
10.1007/s00371-024-03488-y
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In video streaming, bandwidth constraints significantly affect client-side video quality. Addressing this, deep neural networks offer a promising avenue for implementing video super-resolution (VSR) at the user end, leveraging advancements in modern hardware, including mobile devices. The principal challenge in VSR is the computational intensity involved in processing temporal/spatial video data. Conventional methods, uniformly processing entire scenes, often result in inefficient resource allocation. This is evident in the over-processing of simpler regions and insufficient attention to complex regions, leading to edge artifacts in merged regions. Our innovative approach employs semantic segmentation and spatial frequency-based categorization to divide each video frame into regions of varying complexity: simple, medium, and complex. These are then processed through an efficient incremental model, optimizing computational resources. A key innovation is the sparse temporal/spatial feature transformation layer, which mitigates edge artifacts and ensures seamless integration of regional features, enhancing the naturalness of the super-resolution outcome. Experimental results demonstrate that our method significantly boosts VSR efficiency while maintaining effectiveness. This marks a notable advancement in streaming video technology, optimizing video quality with reduced computational demands. This approach, featuring semantic segmentation, spatial frequency analysis, and an incremental network structure, represents a substantial improvement over traditional VSR methodologies, addressing the core challenges of efficiency and quality in high-resolution video streaming.
引用
收藏
页码:4899 / 4911
页数:13
相关论文
共 31 条
  • [1] Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation
    Caballero, Jose
    Ledig, Christian
    Aitken, Andrew
    Acosta, Alejandro
    Totz, Johannes
    Wang, Zehan
    Shi, Wenzhe
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2848 - 2857
  • [2] BasicVSR plus plus : Improving Video Super-Resolution with Enhanced Propagation and Alignment
    Chan, Kelvin C. K.
    Zhou, Shangchen
    Xu, Xiangyu
    Loy, Chen Change
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5962 - 5971
  • [3] Chan Kelvin CK, 2022, P IEEE CVF C COMP VI, P5972, DOI DOI 10.48550/ARXIV.2104.13371
  • [4] Performance analysis of efficient video transmission using EvalSVC, EvalVid-NT, EvalVid
    Dawood, M. Sheik
    Benazer, S. Sakena
    Karthick, R.
    Ganesh, R. Senthil
    Mary, S. Sugirtha
    [J]. MATERIALS TODAY-PROCEEDINGS, 2021, 46 : 3848 - 3850
  • [5] Controlling Perceptual Factors in Neural Style Transfer
    Gatys, Leon A.
    Ecker, Alexander S.
    Bethge, Matthias
    Hertzmann, Aaron
    Shechtman, Eli
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3730 - 3738
  • [6] RSTT: Real-time Spatial Temporal Transformer for Space-Time Video Super-Resolution
    Geng, Zhicheng
    Liang, Luming
    Ding, Tianyu
    Zharkov, Ilya
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17420 - 17430
  • [7] ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic
    Kong, Xiangtao
    Zhao, Hengyuan
    Qiao, Yu
    Dong, Chao
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12011 - 12020
  • [8] Patch-based Evaluation of Image Segmentation
    Ledig, Christian
    Shi, Wenzhe
    Bai, Wenjia
    Rueckert, Daniel
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3065 - 3072
  • [9] Deep Neural Network-based Enhancement for Image and Video Streaming Systems: A Survey and Future Directions
    Lee, Royson
    Venieris, Stylianos, I
    Lane, Nicholas D.
    [J]. ACM COMPUTING SURVEYS, 2021, 54 (08)
  • [10] Towards High-Quality and Efficient Video Super-Resolution via Spatial-Temporal Data Overfitting
    Li, Gen
    Ji, Jie
    Qin, Minghai
    Niu, Wei
    Ren, Bin
    Afghah, Fatemeh
    Guo, Linke
    Ma, Xiaolong
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10259 - 10269