An Efficient and Flexible Complexity Control Method for Versatile Video Coding

被引:1
作者
Zhao, Yan [1 ]
Zhu, Chen [1 ]
Xu, Jun [1 ]
Lu, Guo [1 ]
Song, Li [1 ,2 ]
Ma, Siwei [1 ,3 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, MoE Key Lab Artificial Intelligence, AI Inst, Shanghai 200240, Peoples R China
[3] Peking Univ, Natl Engn Res Ctr Visual Technol, Sch Comp Sci, Beijing 100871, Peoples R China
关键词
Complexity theory; Encoding; Estimation; Resource management; Visualization; Standards; Partitioning algorithms; Distortion; Copper; Video recording; Versatile video coding (VVC); complexity control; complexity estimation; block partition; motion estimation; PARTITION; ENCODERS; SUPPORT;
D O I
10.1109/TBC.2024.3475811
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, numerous complexity control approaches have been proposed to achieve the target encoding complexity. However, only few of them were developed for VVC encoders. This paper fills this gap by proposing an efficient and flexible complexity control approach for VVC. The support for both Acceleration Ratio Control (ARC) and Encoding Time Control (ETC) makes our method highly versatile for various applications. At first, we introduce a sequence-level complexity estimation model to merge the ARC and ETC tasks. Then, four key modules are involved for complexity control: complexity allocation, complexity estimation, encoding configuration decision, and feedback. Specifically, we hierarchically allocate the complexity budget to three coding levels: GOP, frame, and Basic Unit (BU). Each BU's allocation weight is decided by its SSIM distortion, whereby the perceptual quality can be ensured. The multi-complexity configurations are established by altering the partition depth and number of reference frames. Via tuning each BU's configuration according to its target acceleration ratio and adaptively updating the control strategies based on the feedback, our scheme can precisely realize any achievable acceleration targets within one-pass encoding. Moreover, each BU's un-accelerated reference encoding time, which is used to calculate its target acceleration ratio, is estimated by SVR models. Experiments prove that for both the ARC and ETC tasks, our scheme can precisely achieve a wide range of complexity targets (30% similar to 100%) with negligible RD loss in PSNR and SSIM, outperforming other state-of-the-art methods.
引用
收藏
页码:96 / 110
页数:15
相关论文
共 49 条
[1]  
Abdi H., 2010, ENCY RES DESIGN, V3, P1, DOI DOI 10.4135/9781412961288.N168
[2]  
Bjontegaard G., 2001, VCEG M
[3]   VVC Complexity and Software Implementation Analysis [J].
Bossen, Frank ;
Suehring, Karsten ;
Wieckowski, Adam ;
Liu, Shan .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (10) :3765-3778
[4]  
Bross B., 2020, ISO/IEC Standard ITU-T JVET-S2001
[5]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[6]   Pareto-Based Method for High Efficiency Video Coding With Limited Encoding Time [J].
Correa, Guilherme ;
Assuncao, Pedro A. ;
Agostini, Luciano Volcan ;
da Silva Cruz, Luis A. .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (09) :1734-1745
[7]   Complexity Control of High Efficiency Video Encoders for Power-Constrained Devices [J].
Correa, Guilherme ;
Assuncao, Pedro ;
Agostini, Luciano ;
da Silva Cruz, Luis A. .
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2011, 57 (04) :1866-1874
[8]   Hierarchical Complexity Control of HEVC for Live Video Encoding [J].
Deng, Xin ;
Xu, Mai ;
Li, Chen .
IEEE ACCESS, 2016, 4 :7014-7027
[9]  
Deng X, 2014, 2014 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING CONFERENCE, P225, DOI 10.1109/VCIP.2014.7051545
[10]   Subjective-Driven Complexity Control Approach for HEVC [J].
Deng, Xin ;
Xu, Mai ;
Jiang, Lai ;
Sun, Xiaoyan ;
Wang, Zulin .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (01) :91-106