Lightweight Video Super-Resolution for Compressed Video

被引:4
作者
Kwon, Ilhwan [1 ]
Li, Jun [1 ]
Prasad, Mukesh [1 ]
机构
[1] Univ Technol Sydney, Sch Comp Sci, FEIT, Sydney 2007, Australia
关键词
video super-resolution; video compression; motion vector; spatio-temporal consistency; CONVOLUTION;
D O I
10.3390/electronics12030660
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video compression technology for Ultra-High Definition (UHD) and 8K UHD video has been established and is being widely adopted by major broadcasting companies and video content providers, allowing them to produce high-quality videos that meet the demands of today's consumers. However, high-resolution video content broadcasting is not an easy problem to be resolved in the near future due to limited resources in network bandwidth and data storage. An alternative solution to overcome the challenges of broadcasting high-resolution video content is to downsample UHD or 8K video at the transmission side using existing infrastructure, and then utilizing Video Super-Resolution (VSR) technology at the receiving end to recover the original quality of the video content. Current deep learning-based methods for Video Super-Resolution (VSR) fail to consider the fact that the delivered video to viewers goes through a compression and decompression process, which can introduce additional distortion and loss of information. Therefore, it is crucial to develop VSR methods that are specifically designed to work with the compression-decompression pipeline. In general, various information in the compressed video is not utilized enough to realize the VSR model. This research proposes a highly efficient VSR network making use of data from decompressed video such as frame type, Group of Pictures (GOP), macroblock type and motion vector. The proposed Convolutional Neural Network (CNN)-based lightweight VSR model is suitable for real-time video services. The performance of the model is extensively evaluated through a series of experiments, demonstrating its effectiveness and applicability in practical scenarios.
引用
收藏
页数:18
相关论文
共 59 条
[1]  
antmedia.io, VID BITR VS RES 4 KE
[2]  
Blau Y, 2020, Arxiv, DOI [arXiv:1711.06077, 10.1109/CVPR.2018.00652, DOI 10.1109/CVPR.2018.00652]
[3]   Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation [J].
Caballero, Jose ;
Ledig, Christian ;
Aitken, Andrew ;
Acosta, Alejandro ;
Totz, Johannes ;
Wang, Zehan ;
Shi, Wenzhe .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2848-2857
[4]  
cdvl.org, US
[5]   iSeeBetter: Spatio-temporal video super-resolution using recurrent generative back-projection networks [J].
Chadha, Aman ;
Britto, John ;
Roja, M. Mani .
COMPUTATIONAL VISUAL MEDIA, 2020, 6 (03) :307-317
[6]   Compressed Domain Deep Video Super-Resolution [J].
Chen, Peilin ;
Yang, Wenhan ;
Wang, Meng ;
Sun, Long ;
Hu, Kangkang ;
Wang, Shiqi .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 (30) :7156-7169
[7]  
Clark A, 2019, Arxiv, DOI arXiv:1907.06571
[8]   Compression Artifacts Reduction by a Deep Convolutional Network [J].
Dong, Chao ;
Deng, Yubin ;
Loy, Chen Change ;
Tang, Xiaoou .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :576-584
[9]   Image Super-Resolution Using Deep Convolutional Networks [J].
Dong, Chao ;
Loy, Chen Change ;
He, Kaiming ;
Tang, Xiaoou .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (02) :295-307
[10]   A Survey of Natural Language Generation [J].
Dong, Chenhe ;
Li, Yinghui ;
Gong, Haifan ;
Chen, Miaoxin ;
Li, Junxin ;
Shen, Ying ;
Yang, Min .
ACM COMPUTING SURVEYS, 2023, 55 (08)