Tailored AVX2 Transform Kernels for Versatile Video Coding

被引:0
作者
Siivonen, Kari [1 ]
Sainio, Joose [1 ]
Mercat, Alexandre [1 ]
Vanne, Jarno [1 ]
机构
[1] Tampere Univ, Ultra Video Grp, Tampere, Finland
来源
2023 IEEE NORDIC CIRCUITS AND SYSTEMS CONFERENCE, NORCAS | 2023年
基金
芬兰科学院;
关键词
Versatile Video Coding (VVC); transform; complexity reduction; Advanced Vector Extensions 2 (AVX2); practical encoder implementation;
D O I
10.1109/NorCAS58970.2023.10305449
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Transform coding tools play an integral part in video codecs due to their substantial impact on coding efficiency. The latest video coding standard, Versatile Video Coding (VVC), makes the most of these tools by introducing new DST7, DCT8, and non-square transforms alongside the conventional DCT2 transform. This paper proposes optimized AVX2 kernels for all these transforms to speed up VVC coding. Unlike existing solutions, our kernels are specially tailored for each VVC transform type and block size. Accelerating our open-source uvg266 VVC encoder with the proposed kernels yields up to a 1.1x speedup under all intra (AI) coding condition without any coding overhead. Our implementations make forward DCT2 and DST7/DCT8 transforms 4.0x and 6.7x as fast as their respective scalar implementations in the VTM reference encoder. They also outpace the AVX2 kernels of the practical VVenC encoder by factors of 3.0x and 2.8x. The respective speedups rise up to 5.3x, 11.1x, 3.4x, and 3.0x with inverse transforms.
引用
收藏
页数:6
相关论文
共 25 条
  • [1] [Anonymous], Intel intrinsics guide
  • [2] Bossen F., 2020, JVET-T2010
  • [3] Brandenburg J., 2021, VVenC fraunhofer versatile video encoder v1.0.0
  • [4] Pareto-optimized coding configurations for VVenC, a fast and efficient VVC encoder
    Brandenburg, Jens
    Wieckowski, Adam
    Henkel, Anastasia
    Bross, Benjamin
    Marpe, Detlev
    [J]. IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2021,
  • [5] Core Transform Design in the High Efficiency Video Coding (HEVC) Standard
    Budagavi, Madhukar
    Fuldseth, Arild
    Bjontegaard, Gisle
    Sze, Vivienne
    Sadafale, Mangesh
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2013, 7 (06) : 1029 - 1041
  • [6] Fu T., 2019, P INT C MULT EXP SHA
  • [7] github, Fraunhofer Versatile Video Encoder (VvenC-1.9.1) source code
  • [8] github, uvg266
  • [9] A Reconfigurable Multiple Transform Selection Architecture for VVC
    Hao, Zhijian
    Sun, Heming
    Xiang, Guoqing
    Zhang, Peng
    Zeng, Xiaoyang
    Fan, Yibo
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2023, 31 (05) : 658 - 669
  • [10] Low-Complexity Multiple Transform Selection Combining Multi-Type Tree Partition Algorithm for Versatile Video Coding
    He, Liqiang
    Xiong, Shuhua
    Yang, Ruolan
    He, Xiaohai
    Chen, Honggang
    [J]. SENSORS, 2022, 22 (15)