Multi-Level Alignments for Compressed Video Super-Resolution

被引:0
|
作者
Wei, Liu [1 ]
Ye, Mao [1 ]
Ji, Luping [1 ]
Gan, Yan [2 ]
Li, Shuai [3 ]
Li, Xue [4 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[2] Chongqing Univ, Coll Comp Sci, Chongqing 400044, Peoples R China
[3] Shandong Univ, Sch Control Sci & Engn, Jinan 250100, Peoples R China
[4] Univ Queensland, Sch Elect Engn & Comp Sci, Brisbane, Qld 4072, Australia
基金
中国国家自然科学基金;
关键词
Streaming media; Transformers; Superresolution; Convolution; Video recording; Quality assessment; Encoding; Compressed video super-resolution; Transformer; Compressed video quality enhancement; SUPER RESOLUTION; ENHANCEMENT; EFFICIENCY;
D O I
10.1109/TCE.2024.3411144
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Due to the limited transmission bandwidth, to meet the application needs of consumer electronics products, there exists an approach to down-sample a video and then compress it to satisfy the limited bandwidth. The existing compressed video super-resolution methods pay more attention to the gain of low-frequency information in the video and process high-frequency information roughly. Besides, the geometric alignment information among temporal frames as well as the global information is also poorly extracted due to the limitation of the convolution operation. To address these limitations, we propose a Transformer based multi-level Alignments method to recover high-frequency and global information for compressed Video Super-Resolution (TAVSR). Specifically, a dual-branch alignment network is proposed. One branch is for recovering high-frequency information based on intra-frame which is compressed at original resolution; another branch is for low-frequency information in the continuous inter-frames at a lower resolution. For each branch, global and local alignments are performed respectively. To achieve global pixel movement alignment between the current frame and intra/inter-frame, Transformer based U-shape Network (TUNet) is proposed to estimate deformable convolution offsets, which performs much better than convolution in the geometric distance formulation from texture. By contrast, the local information is implicitly aligned using TUNet to keep the details. A multi-stage fusion module is further proposed to fuse aligned features to obtain the original resolution frame with enhanced quality. Extensive experiments show that the proposed method achieves the best rate-distortion (R-D) performance on JCT-VC test sequences compared with the most advanced methods.
引用
收藏
页码:5101 / 5114
页数:14
相关论文
共 50 条
  • [1] Lightweight Video Super-Resolution for Compressed Video
    Kwon, Ilhwan
    Li, Jun
    Prasad, Mukesh
    ELECTRONICS, 2023, 12 (03)
  • [2] Compressed Domain Deep Video Super-Resolution
    Chen, Peilin
    Yang, Wenhan
    Wang, Meng
    Sun, Long
    Hu, Kangkang
    Wang, Shiqi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 (30) : 7156 - 7169
  • [3] A NOVEL ALGORITHM OF SUPER-RESOLUTION RECONSTRUCTION FOR COMPRESSED VIDEO
    Xu Zhongqiang Zhu Xiuchang (Information Industry Ministry and Jiangsu Province Key Lab of Image Processing & Image Communication
    Journal of Electronics(China), 2007, (03) : 363 - 368
  • [4] Edge-Oriented Compressed Video Super-Resolution
    Wang, Zheng
    Quan, Guancheng
    He, Gang
    SENSORS, 2024, 24 (01)
  • [5] Super-resolution mosaicing from MPEG compressed video
    Kramer, P
    Hadar, O
    Benois-Pineau, J
    Domenger, JP
    2005 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), VOLS 1-5, 2005, : 801 - 804
  • [6] Super-resolution mosaicing from MPEG compressed video
    Kramer, P.
    Hadar, O.
    Benois-Pineau, J.
    Domenger, J. -P.
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2007, 22 (10) : 845 - 865
  • [7] Multi-level Feature Fusion Network for Single Image Super-Resolution
    Zhang, Xinxia
    Zhang, Xiaoqin
    Zhao, Li
    Jiang, Runhua
    Huang, Pengcheng
    Xu, Jiawei
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 3361 - 3368
  • [8] Super-resolution imaging with an achromatic multi-level diffractive microlens array
    Banerji, Sourangsu
    Meem, Monjurul
    Majumder, Apratim
    Sensale-Rodriguez, Berardi
    Menon, Rajesh
    OPTICS LETTERS, 2020, 45 (22) : 6158 - 6161
  • [9] Multi-level Feature Fusion Mechanism for Single Image Super-Resolution
    Lyn, Jiawen
    2020 THE 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTICS AND CONTROL ENGINEERING (IRCE 2020), 2020, : 52 - 57
  • [10] MFFN: image super-resolution via multi-level features fusion network
    Chen, Yuantao
    Xia, Runlong
    Yang, Kai
    Zou, Ke
    VISUAL COMPUTER, 2024, 40 (02): : 489 - 504