Online Video Super-Resolution With Convolutional Kernel Bypass Grafts

被引：18

作者：

Xiao, Jun ^{[1
]}

Jiang, Xinyang ^{[2
]}

Zheng, Ningxin ^{[2
]}

Yang, Huan ^{[2
]}

Yang, Yifan ^{[2
]}

Yang, Yuqing ^{[2
]}

Li, Dongsheng ^{[2
]}

Lam, Kin-Man ^{[1
]}

机构：

[1] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Hong Kong, Peoples R China

[2] Microsoft Res Asia, Shanghai 200232, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2023年 / 25卷

关键词：

Feature extraction; Complexity theory; Streaming media; Kernel; Superresolution; Computational modeling; Video sequences; Video super-resolution; deep lightweight model; video restoration;

D O I：

10.1109/TMM.2023.3243615

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep learning-based models have achieved remarkable performance in video super-resolution (VSR) in recent years, but most of these models are less applicable to online video applications. These methods solely consider the distortion quality and ignore crucial requirements for online applications, e.g., low latency and low model complexity. In this paper, we focus on online video transmission in which VSR algorithms are required to generate high-resolution video sequences frame by frame in real time. To address such challenges, we propose an extremely low-latency VSR algorithm based on a novel kernel knowledge transfer method, named the convolutional kernel bypass graft (CKBG). First, we design a lightweight network structure that does not require future frames as inputs and saves extra time for caching these frames. Then, our proposed CKBG method enhances this lightweight base model by bypassing the original network with "kernel grafts," which are extra convolutional kernels containing the prior knowledge of the external pretrained image SR models. During the testing phase, we further accelerate the grafted multibranch network by converting it into a simple single-path structure. The experimental results show that our proposed method can process online video sequences up to 110 FPS with very low model complexity and competitive SR performance.

引用

页码：8972 / 8987

页数：16

共 60 条

[1] Fast, Accurate, and Lightweight Super-Resolution with Cascading Residual Network [J].

Ahn, Namhyuk ;

Kang, Byungkon ;

Sohn, Kyung-Ah .

COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 :256-272

[2] Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation [J].

Caballero, Jose ;

Ledig, Christian ;

Aitken, Andrew ;

Acosta, Alejandro ;

Totz, Johannes ;

Wang, Zehan ;

Shi, Wenzhe .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2848-2857

[3] Toward Real-World Single Image Super-Resolution: A New Benchmark and A New Model [J].

Cai, Jianrui ;

Zeng, Hui ;

Yong, Hongwei ;

Cao, Zisheng ;

Zhang, Lei .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3086-3095

[4] BasicVSR plus plus : Improving Video Super-Resolution with Enhanced Propagation and Alignment [J].

Chan, Kelvin C. K. ;

Zhou, Shangchen ;

Xu, Xiangyu ;

Loy, Chen Change .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :5962-5971

[5] BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond [J].

Chan, Kelvin C. K. ;

Wang, Xintao ;

Yu, Ke ;

Dong, Chao ;

Loy, Chen Change .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :4945-4954

[6] Audio Matters in Video Super-Resolution by Implicit Semantic Guidance [J].

Chen, Yanxiang ;

Zhao, Pengcheng ;

Qi, Meibin ;

Zhao, Yang ;

Jia, Wei ;

Wang, Ronggang .

IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 :4128-4142

[7]

Chu XX, 2020, Img Proc Comp Vis Re, V12538, P99, DOI 10.1007/978-3-030-66823-5_6

[8] Deformable Convolutional Networks [J].

Dai, Jifeng ;

Qi, Haozhi ;

Xiong, Yuwen ;

Li, Yi ;

Zhang, Guodong ;

Hu, Han ;

Wei, Yichen .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773

[9] Diverse Branch Block: Building a Convolution as an Inception-like Unit [J].

Ding, Xiaohan ;

Zhang, Xiangyu ;

Han, Jungong ;

Ding, Guiguang .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :10881-10890

[10] RepVGG: Making VGG-style ConvNets Great Again [J].

Ding, Xiaohan ;

Zhang, Xiangyu ;

Ma, Ningning ;

Han, Jungong ;

Ding, Guiguang ;

Sun, Jian .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13728-13737

← 1 2 3 4 5 6 →