Complexity reduction methods for Versatile Video Coding: A comparative review

被引:0
作者
Filipe, Jose N. [1 ,2 ]
Tavora, Luis M. N. [3 ,4 ]
Faria, Sergio M. M. [3 ,4 ]
Navarro, Antonio [1 ,2 ]
Assuncao, Pedro A. A. [3 ,4 ]
机构
[1] Inst Telecomunicacoes, Campus Univ Santiago, P-3810193 Aveiro, Portugal
[2] Univ Aveiro, Campus Univ Santiago, P-3810193 Aveiro, Portugal
[3] Inst Telecomunicacoes, Campus 2, P-2411901 Leiria, Portugal
[4] Politecn Leiria, Campus 2, P-2411901 Leiria, Portugal
关键词
Video coding complexity; VVC; Fast coding decisions; Complexity reduction metrics; FAST CU PARTITION; MODE DECISION METHOD; ALGORITHM; PREDICTION; GRADIENT;
D O I
10.1016/j.dsp.2025.105021
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
To deal with the growing demand for ultra-high definition video and increasingly challenging requirements of services and applications, the Versatile Video Coding (VVC) is the most recent standard, significantly improving the coding efficiency in comparison with its predecessors. However, since such improvement is obtained at the cost of a great increase in computational complexity, worldwide research has been underway to find methods capable of reducing the VVC complexity without compromising its coding efficiency. Given the number and diversity of the methods presented in the literature in recent years, every new research only analyses and compares with a limited number of previous works, very often selected without clear criteria. Furthermore, we have found that the usual way to assess and compare complexity reduction methods, based on the tradeoff between complexity gain (in percentage) and coding efficiency loss (in Bj & oslash;ntgaard delta rate (BD-Rate)), fails to be a valid performance indicator across the whole operational range. This paper is a contribution to establish a research reference by presenting a comprehensive comparative review of methods specifically proposed for complexity reduction of VVC. Two new performance comparison metrics are also proposed using the complexity gain and BD-Rate loss as parameters. One of them is characterised by a linear behaviour and the other based on the distance to an efficiency frontier defined by the maximum complexity gain for a given BD-Rate loss. This comparative study takes into account different versions of the software implementation through a normalisation approach, which allows fair comparison of different methods implemented on different encoder software versions. In general, it is shown that Machine Learning (ML) based methods usually outperform heuristic ones while fast methods for intra coding mode estimation present the highest complexity reduction opportunities. Overall this paper provides a novel comparative study of 83 methods and proposes fair performance comparison metrics that are useful for further research in the field and also for future developments on hybrid video coding approaches with reduced computational complexity.
引用
收藏
页数:22
相关论文
共 104 条
  • [1] Cisco, White Paper: Cisco Annual Internet Report (2018–2023), (2020)
  • [2] Cisco, White paper: Cisco Visual Networking Index: Forecast and Methodology, 2017-2022, (2018)
  • [3] Wien M., Boyce J.M., Stockhammer T., Peng W., Guest editorial immersive video coding and transmission, IEEE J. Emerg. Sel. Top. Circuits Syst., 9, 1, pp. 1-4, (2019)
  • [4] Skupin R., Sanchez Y., Wang Y., Hannuksela M.M., Boyce J., Wien M., Standardization status of 360 degree video coding and delivery, IEEE Visual Communications and Image Processing (VCIP), pp. 1-4, (2017)
  • [5] Mercat A., Makinen A., Sainio J., Lemmetti A., Viitanen M., Vanne J., Comparative rate-distortion-complexity analysis of VVC and HEVC video codecs, IEEE Access, 9, pp. 813-67 828, (2021)
  • [6] Siqueira I., Correa G., Grellert M., Rate-distortion and complexity comparison of HEVC and VVC video encoders, 2020 IEEE 11th Latin American Symposium on Circuits & Systems (LASCAS), pp. 1-4, (2020)
  • [7] Filipe J.N., Carreira J., Tavora L.M.N., Faria S.M.M., Navarro A., Assuncao P.A.A., Complexity estimation for load balancing of 360-degree intra Versatile Video Coding, 2020 IEEE Workshop on Signal Processing Systems (SiPS), pp. 1-5, (2020)
  • [8] Correa G., Assuncao P.A., Agostini L.V., da Silva Cruz L.A., Fast HEVC encoding decisions using data mining, IEEE Trans. Circuits Syst. Video Technol., 25, 4, pp. 660-673, (2015)
  • [9] Correa G., Agostini L., da Silva Cruz L.A., Fast H.264/AVC to HEVC transcoder based on data mining and decision trees, 2016 IEEE International Symposium on Circuits and Systems (ISCAS), pp. 2539-2542, (2016)
  • [10] Borges A., Zatt B., Porto M., Agostini L., Correa G., Complexity scalable HEVC-to-AV1 transcoding based on coding tree depth inheritance, 2019 27th European Signal Processing Conference (EUSIPCO), pp. 1-5, (2019)