CycMuNet plus : Cycle-Projected Mutual Learning for Spatial-Temporal Video Super-Resolution

被引：19

作者：

Hu, Mengshun ^{[1
]}

Jiang, Kui ^{[2
]}

Wang, Zheng ^{[1
]}

Bai, Xiang ^{[3
]}

Hu, Ruimin ^{[1
]}

机构：

[1] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China

[2] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China

[3] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan 430074, Peoples R China

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2023年 / 45卷 / 11期

基金：

中国国家自然科学基金;

关键词：

Superresolution; Optical flow; Task analysis; Image reconstruction; Correlation; Iterative methods; Performance evaluation; Spatial-temporal video super-resolution; spatial video super-resolution; temporal video super-resolution; cycle-projected; mutual learning; up-projection unit; down-projection unit; SPACE-TIME SUPERRESOLUTION;

D O I：

10.1109/TPAMI.2023.3293522

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Spatial-Temporal Video Super-Resolution (ST-VSR) aims to generate high-quality videos with higher resolution (HR) and higher frame rate (HFR). Quite intuitively, pioneering two-stage based methods complete ST-VSR by directly combining two sub-tasks: Spatial Video Super-Resolution (S-VSR) and Temporal Video Super-Resolution (T-VSR) but ignore the reciprocal relations among them. 1) T-VSR to S-VSR: temporal correlations help accurate spatial detail representation; 2) S-VSR to T-VSR: abundant spatial information contributes to the refinement of temporal prediction. To this end, we propose a one-stage based Cycle-projected Mutual learning network (CycMuNet) for ST-VSR, which makes full use of spatial-temporal correlations via the mutual learning between S-VSR and T-VSR. Specifically, we propose to exploit the mutual information among them via iterative up- and down projections, where spatial and temporal features are fully fused and distilled, helping high-quality video reconstruction. In addition, we also show interesting extensions for efficient network design (CycMuNet+), such as parameter sharing and dense connection on projection units and feedback mechanism in CycMuNet. Besides extensive experiments on benchmark datasets, we also compare our proposed CycMuNet (+) with S-VSR and T-VSR tasks, demonstrating that our method significantly outperforms the state-of-the-art methods.

引用

页码：13376 / 13392

页数：17

共 87 条

[1] A Database and Evaluation Methodology for Optical Flow
Baker, Simon
Scharstein, Daniel
Lewis, J. P.
Roth, Stefan
Black, Michael J.
Szeliski, Richard
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2011, 92 (01) : 1 - 31
[2] Depth-Aware Video Frame Interpolation
Bao, Wenbo
Lai, Wei-Sheng
Ma, Chao
Zhang, Xiaoyun
Gao, Zhiyong
Yang, Ming-Hsuan
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3698 - 3707
[3] MEMC-Net: Motion Estimation and Motion Compensation Driven Neural Network for Video Interpolation and Enhancement
Bao, Wenbo
Lai, Wei-Sheng
Zhang, Xiaoyun
Gao, Zhiyong
Yang, Ming-Hsuan
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (03) : 933 - 948
[4] Batra T, 2017, Arxiv, DOI arXiv:1705.05512
[5] Fast approximate energy minimization via graph cuts
Boykov, Y
Veksler, O
Zabih, R
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (11) : 1222 - 1239
[6] Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation
Caballero, Jose
Ledig, Christian
Aitken, Andrew
Acosta, Alejandro
Totz, Johannes
Wang, Zehan
Shi, Wenzhe
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2848 - 2857
[7] Cao JZ, 2023, Arxiv, DOI arXiv:2106.06847
[8] Chan K.C., 2022, P IEEE C COMP VIS PA, P5972
[9] BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond
Chan, Kelvin C. K.
Wang, Xintao
Yu, Ke
Dong, Chao
Loy, Chen Change
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4945 - 4954
[10] Chan KCK, 2021, AAAI CONF ARTIF INTE, V35, P973

← 1 2 3 4 5 6 7 8 9 →