Innovative Insights: A Review of Deep Learning Methods for Enhanced Video Compression

被引:0
|
作者
Khadir, Mohammad [1 ]
Farukh Hashmi, Mohammad [1 ]
Kotambkar, Deepali M. [2 ]
Gupta, Aditya [3 ]
机构
[1] Natl Inst Technol, Dept Elect & Commun Engn, Warangal 506004, India
[2] Ramdeobaba Univ, Dept Elect Engn, Nagpur 440013, India
[3] Univ Agder, Dept Informat & Commun Technol, N-4886 Grimstad, Norway
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Video compression; Video coding; Streaming media; Surveys; Transform coding; Deep learning; Reviews; Convolutional neural networks; Generative adversarial networks; Convolutional neural network; deep learning; deep neural network; generative adversarial network and video compression; NEURAL-NETWORKS; AUTO-ENCODER; CODEC; RECONSTRUCTION; DATABASE;
D O I
10.1109/ACCESS.2024.3450814
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video Compression (VC) is a significant aspect of multimedia technology, in which the goal to minimize the size of video data, while also preserving its perceptual quality, for effective transmission and storage. Traditional approaches such as transform coding, predictive coding, and entropy coding are some of the much earlier discovered approaches in this area. VC is a challenging concept which plays a significant role in the effective transmission of data with low storage and minimum bandwidth requirements. However, the limited processing power, storage, memory, lower compression rate and lower resolution are some factors that impact the functionality and performance of VC. This survey aims to encompass a comprehensive review of present DL approaches for VC, especially the application of advanced DL-based Neural Network (NN) algorithms that are developed for solving the aforementioned challenges of VC. Adaptability of DL algorithms is exploited to enhance the potential quality of compressed videos and to positively influence lossless video compression outcomes. The DL approaches include Deep Neural Networks (DNN) methods such as Convolutional Neural Networks (CNN), Generative Adversarial Networks (GAN), Recurrent Neural Networks (RNN), Deep Recurrent Auto-Encoder (DRAE), etc. This survey examines the relationships, strengths as well as problem statements of DL-based compression approaches of VC. Furthermore, this survey also deliberates on datasets, hardware specifications, comparative analysis, and research directions. This survey embeds DL-based computer vision approaches, with hardware accelerators like GPU and FPGA, to minimize the complexity of in a model. This survey aims to overcome the limitations of VC, such as the varying effectiveness of specific encoder approaches, the challenges in utilizing hardware accelerators, low-resource devices, and difficulties in managing the large-scale databases. Integrating DL-based approaches with existing standard codecs remains a significant challenge. Ensuring compatibility, interoperability, and standardization is important for widespread adoption and integration. Enhancing the interpretability and control of DL approaches permit for better customization of compression settings, allowing the users to balance bit rate and quality according to their specific requirements. To gather relevant studies, widespread VC datasets are researched and utilized such as, Ultra-Video-Group dataset (UVG), Video Trace Library (VTL), etc. The selection criteria for this study of VC techniques and deep learning (DL) approaches are chosen to focus on the integration of DL with codecs, which is a primary research area of interest. This integration provides valuable insights into advanced DL applications in overcoming challenges associated with VC. Frameworks such as TensorFlow, Keras, PyTorch are utilized to classify the approaches according to their fundamental NN architectures.
引用
收藏
页码:125706 / 125725
页数:20
相关论文
共 50 条
  • [41] Classification of Taekwondo Techniques using Deep Learning Methods: First Insights
    Barbosa, Paulo
    Cunha, Pedro
    Carvalho, Vitor
    Soares, Filomena
    BIODEVICES: PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES - VOL 1: BIODEVICES, 2021, : 201 - 208
  • [42] A Review on Deep Learning Techniques for Video Prediction
    Oprea, Sergiu
    Martinez-Gonzalez, Pablo
    Garcia-Garcia, Alberto
    Castro-Vargas, John Alejandro
    Orts-Escolano, Sergio
    Garcia-Rodriguez, Jose
    Argyros, Antonis
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (06) : 2806 - 2826
  • [43] Video Processing Using Deep Learning Techniques: A Systematic Literature Review
    Sharma, Vijeta
    Gupta, Manjari
    Kumar, Ajai
    Mishra, Deepti
    IEEE ACCESS, 2021, 9 : 139489 - 139507
  • [44] Deep Learning-Based Chroma Prediction for Intra Versatile Video Coding
    Zhu, Linwei
    Zhang, Yun
    Wang, Shiqi
    Kwong, Sam
    Jin, Xin
    Qiao, Yu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (08) : 3168 - 3181
  • [45] A Comprehensive Review of Deep Learning Approaches for Animal Detection on Video Data
    Kumar, Prashanth
    Luo, Suhuai
    Shaukat, Kamran
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (11) : 1420 - 1437
  • [46] Overview of Research in the field of Video Compression using Deep Neural Networks
    Birman, Raz
    Segal, Yoram
    Hadar, Ofer
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (17-18) : 11699 - 11722
  • [47] Deep Learning-Based Video Coding: A Review and a Case Study
    Liu, Dong
    Li, Yue
    Lin, Jianping
    Li, Houqiang
    Wu, Feng
    ACM COMPUTING SURVEYS, 2020, 53 (01)
  • [48] Study on deep CNN as preprocessing for video compression
    Bhosale, Kavita Arjun
    Kuk, Seungho
    Park, Sang-hyo
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLIV, 2021, 11842
  • [49] Deep video compression with conditional feature coding
    Pientka, Sophie
    Pfaff, Jonathan
    Schwarz, Heiko
    Marpe, Detlev
    Wiegand, Thomas
    2024 PICTURE CODING SYMPOSIUM, PCS 2024, 2024,
  • [50] A Transformer based deep conditional video compression
    Lu G.
    Zhong T.
    Geng J.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2024, 50 (02): : 442 - 448