An IBC Reference Block Enhancement Model Based on GAN for Screen Content Video Coding

被引:0
|
作者
Yang, Pengjian [1 ]
Wang, Jun [2 ]
Zhong, Guangyu [1 ]
Zhang, Pengyuan [3 ]
Zhang, Lai [1 ]
Liang, Fan [1 ]
Yang, Jianxin [2 ]
机构
[1] Sun Yat Sen Univ, Elect & Informat Technol, Guangzhou, Peoples R China
[2] Zhuhai Jieli Technol Co Ltd, Zhuhai, Peoples R China
[3] Wuhan Res Inst Posts & Telecommun, Wuhan, Peoples R China
来源
关键词
Generative Adversarial Network (GAN); Versatile Video Coding (VVC); Intra block copy (IBC); Screen content coding (SCC);
D O I
10.1007/978-3-030-98355-0_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a special kind of video coding, screen content coding (SCC) has received widespread attention because of the popularity of online classes and conferences. However, few people use neural networks to improve the compression efficiency of SCC. Intra block copy (IBC) is one of the most important coding tools in SCC, which can save half of the bitrate. Due to the need to copy the content of the reference block, the performance of IBC mode largely depends on the quality of the reference block. In the standard encoding process of Versatile Video Coding (VVC), the IBC reference block is not filtered, and there are still serious compression artifacts. This will result in a decrease in IBC search accuracy and SCC compression efficiency. Inspired by in-loop filtering, we propose an IBC reference blocks enhancement network based on GAN (IREGAN) to filter the reference blocks before IBC estimation, which can improve the quality of IBC reference block and the accuracy of IBC matching. In addition to the generator used for image enhancement, our model also includes a variance-based classifier and a discriminator obtained from adversarial training The classifier can effectively improve the efficiency of the model and the discriminator can improve the robustness of the entire system. Experimental results demon-strate the performance gains of IREGAN with VTM10.0, offering about 6.98% BDBR reduction, 0.71dB BDPSNR gains in average (luminance). SSIM increased by 0.0113 and the number of blocks using IBC mode is increased by 1.42%.
引用
收藏
页码:15 / 26
页数:12
相关论文
共 50 条
  • [1] IBC-Mirror Mode for Screen Content Coding for the Next Generation Video Coding Standards
    Cao, Jian
    Qiu, Zhen
    Li, Zhengren
    Liang, Fan
    Wang, Jun
    2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 471 - 474
  • [2] Intra Block Copy Mirror Mode for Screen Content Coding in Versatile Video Coding
    Cao, Jian
    Liang, Fan
    Wang, Jun
    IEEE ACCESS, 2021, 9 : 31390 - 31400
  • [3] Intra block copy mirror mode for screen content coding in versatile video coding
    Cao, Jian
    Liang, Fan
    Wang, Jun
    IEEE Access, 2021, 9 : 31390 - 31400
  • [4] Screen Content Video Coding and Applications
    Peng, Wen-Hsiao
    Xu, Jizheng
    Cohen, Robert A.
    Ostermann, Joern
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2016, 6 (04) : 389 - 392
  • [5] Enhancement Layer Coding for Chroma Sub-Sampled Screen Content Video
    Heindel, Andreas
    Prestele, Benjamin
    Gehlert, Alexander
    Kaup, Andre
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (02) : 788 - 801
  • [6] Hash-Based Block Matching for Screen Content Coding
    Zhu, Weijia
    Ding, Wenpeng
    Xu, Jizheng
    Shi, Yunhui
    Yin, Baocai
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (07) : 935 - 944
  • [7] A content-based rate control algorithm for screen content video coding
    Yang, Yang
    Shen, Liquan
    Yang, Hao
    An, Ping
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 60 : 328 - 338
  • [8] An Open Video Dataset For Screen Content Coding
    Wang, Yingbin
    Zhao, Xin
    Xu, Xiaozhong
    Liu, Shan
    Lei, Zhijun
    Afonso, Mariana
    Norkin, Andrey
    Daede, Thomas
    2022 PICTURE CODING SYMPOSIUM (PCS), 2022, : 301 - 305
  • [9] A STAIRCASE TRANSFORM CODING SCHEME FOR SCREEN CONTENT VIDEO CODING
    Chen, Cheng
    Han, Jingning
    Xu, Yaowu
    Bankoski, James
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 2365 - 2369
  • [10] Block Partitioning Decision Based on Content Complexity for Future Video Coding
    Zhang, Yanhong
    Zhao, Yao
    Lin, Chunyu
    Liu, Meiqin
    IMAGE AND GRAPHICS, ICIG 2019, PT III, 2019, 11903 : 70 - 80