Advances in Video Compression System Using Deep Neural Network: A Review and Case Studies

被引:34
作者
Ding, Dandan [1 ]
Ma, Zhan [2 ]
Chen, Di [3 ]
Chen, Qingshuang [4 ]
Liu, Zoe [5 ]
Zhu, Fengqing [4 ]
机构
[1] Hangzhou Normal Univ, Sch Informat Sci & Engn, Hangzhou 311121, Peoples R China
[2] Nanjing Univ, Sch Elect Sci & Engn, Nanjing 210093, Peoples R China
[3] Google Inc, Mountain View, CA 94043 USA
[4] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
[5] Visionular Inc, Los Altos, CA 94022 USA
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Encoding; Video compression; Video coding; Streaming media; Visualization; Quality of experience; Spatiotemporal phenomena; Deep learning; Neural networks; Adaptive filters; deep neural networks (DNNs); neural video coding; texture analysis; PREDICTION; MODEL; SEGMENTATION; STATISTICS; FRAMEWORK; STANDARD; SCALE;
D O I
10.1109/JPROC.2021.3059994
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Significant advances in video compression systems have been made in the past several decades to satisfy the near-exponential growth of Internet-scale video traffic. From the application perspective, we have identified three major functional blocks, including preprocessing, coding, and postprocessing, which have been continuously investigated to maximize the end-user quality of experience (QoE) under a limited bit rate budget. Recently, artificial intelligence (AI)-powered techniques have shown great potential to further increase the efficiency of the aforementioned functional blocks, both individually and jointly. In this article, we review recent technical advances in video compression systems extensively, with an emphasis on deep neural network (DNN)-based approaches, and then present three comprehensive case studies. On preprocessing, we show a switchable texture-based video coding example that leverages DNN-based scene understanding to extract semantic areas for the improvement of a subsequent video coder. On coding, we present an end-to-end neural video coding framework that takes advantage of the stacked DNNs to efficiently and compactly code input raw videos via fully data-driven learning. On postprocessing, we demonstrate two neural adaptive filters to, respectively, facilitate the in-loop and postfiltering for the enhancement of compressed frames. Finally, a companion website hosting the contents developed in this work can be accessed publicly at https://purdueviper.github.io/dnn-coding/.
引用
收藏
页码:1494 / 1520
页数:27
相关论文
共 50 条
  • [21] Review and Evaluation of End-to-End Video Compression with Deep-Learning
    Yasin, Hajar Maseeh
    Ameen, Siddeeq Yosef
    2021 INTERNATIONAL CONFERENCE OF MODERN TRENDS IN INFORMATION AND COMMUNICATION TECHNOLOGY INDUSTRY (MTICTI 2021), 2021, : 81 - 88
  • [22] Background subtraction for video sequence using deep neural network
    Dai, Yuan
    Yang, Long
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (35) : 82281 - 82302
  • [23] Universal Deep Neural Network Compression
    Choi, Yoojin
    El-Khamy, Mostafa
    Lee, Jungwon
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 14 (04) : 715 - 726
  • [24] Deep Hybrid Neural Network With Attention Mechanism for Video Hash Retrieval Method
    Wu, Kaixing
    Xu, Li
    IEEE ACCESS, 2023, 11 : 47956 - 47966
  • [25] A Deep Neural Network based Multimodal Video Recognition System for Caring
    Yan, Chao
    Xu, Jiahua
    Klopfer, Bastian
    Nuernberger, Andreas
    PROCEEDINGS OF THE 2020 IEEE INTERNATIONAL CONFERENCE ON HUMAN-MACHINE SYSTEMS (ICHMS), 2020, : 472 - 476
  • [26] Parallel Blockwise Knowledge Distillation for Deep Neural Network Compression
    Blakeney, Cody
    Li, Xiaomin
    Yan, Yan
    Zong, Ziliang
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (07) : 1765 - 1776
  • [27] Review prognosis system to predict employees job satisfaction using deep neural network
    Rustam, Furqan
    Ashraf, Imran
    Shafique, Rahman
    Mehmood, Arif
    Ullah, Saleem
    Sang Choi, Gyu
    COMPUTATIONAL INTELLIGENCE, 2021, 37 (02) : 964 - 990
  • [28] Literature Review of Deep Network Compression
    Alqahtani, Ali
    Xie, Xianghua
    Jones, Mark W.
    INFORMATICS-BASEL, 2021, 8 (04):
  • [29] Identification of industrial air compression system using neural network
    Khong, Fan-Hao
    Abd Samad, Md Fahmi
    Tamadaran, Brahmataran
    INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2022, 13 (05)
  • [30] Recurrent Neural Network-Based Video Compression
    Montajabi, Zahra
    Ghassab, Vahid Khorasani
    Bouguila, Nizar
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 925 - 930