Metaheuristic-based vector quantization approach: a new paradigm for neural network-based video compression

被引：9

作者：

Darwish, Saad M. ^{[1
]}

Almajtomi, Ahmed A. J. ^{[2
]}

机构：

[1] Alexandria Univ, Inst Grad Studies & Res, Dept Informat Technol, 163 Horreya Ave,POB 832, Alexandria 21526, Egypt

[2] Al Nahrain Univ, Coll Sci, Dept Comp Sci, Baghdad, Iraq

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2021年 / 80卷 / 05期

关键词：

Video compression; Intelligent vector quantization; Optimal codebook; Optimization; ALGORITHM;

D O I：

10.1007/s11042-020-10003-7

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Video compression has great significance in the communication of motion pictures. Video compression techniques try to remove the different types of redundancy within or between video sequences. In the temporal domain, the video compression techniques remove the redundancies between the highly correlated consequence frames of the video. In the spatial domain, the video compression techniques remove the redundancies between the highly correlated consequence pixels (samples) in the same frame. Evolving neural-networks based video coding research efforts are focused on improving existing video codecs by performing better predictions that are incorporated within the same codec framework or holistic methods of end-to-end video compression schemes. Current neural network-based video compression adapts static codebook to achieve compression that leads to learning inability from new samples. This paper proposes a modified video compression model that adapts the genetic algorithm to build an optimal codebook for adaptive vector quantization that is used as an activation function inside the neural network's hidden layer. Background subtraction algorithm is employed to extract motion objects within frames to generate the context-based initial codebook. Furthermore, Differential Pulse Code Modulation (DPCM) is utilized for lossless compression of significant wavelet coefficients; whereas low energy coefficients are lossy compressed using Learning Vector Quantization (LVQ) neural networks. Finally, Run Length Encoding (RLE) is engaged to encode the quantized coefficients to achieve a higher compression ratio. Experiments have proven the system's ability to achieve higher compression ratio with acceptable efficiency measured by PSNR.

引用

页码：7367 / 7396

页数：30

共 47 条

[1]

Afrabandpey H, 2014, 2014 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), P1, DOI 10.1109/ICCKE.2014.6993337

[2]

Atheeshwar M, 2014, INT J ADV RES ENG TE, V2, P5

[3]

Bernatin T, 2014, 2014 INTERNATIONAL CONFERENCE ON CONTROL, INSTRUMENTATION, COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICCICCT), P452, DOI 10.1109/ICCICCT.2014.6993004

[4]

Boufares O, 2016, INT J ADV COMPUT SC, V7, P29

[5] Codebook Optimization in Vector Quantization using Genetic Algorithm [J].

Chavan, Pramod Uttamrao ;

Chavan, Pratibha Pramod ;

Dandawate, Yogesh Haribhau .

SECOND INTERNATIONAL CONFERENCE ON COMPUTER AND ELECTRICAL ENGINEERING, VOL 1, PROCEEDINGS, 2009, :280-+

[6]

Chen T, 2017, ADV MAT SCI ENG, V2017, P1, DOI DOI 10.1109/NAPS.2017.8107189

[7] Design of Efficient Perspective Affine Motion Estimation/Compensation for Versatile Video Coding (VVC) Standard [J].

Choi, Young-Ju ;

Jun, Dong-San ;

Cheong, Won-Sik ;

Kim, Byung-Gyu .

ELECTRONICS, 2019, 8 (09)

[8]

Duch W, 2005, P 15 INT C SCI BUS M, P11

[9]

Elmolla AM, 2015, INT J COMPUT SCI TEL, V6, P7

[10]

Elsayad AM, 2016, TECHNICAL REPORT

← 1 2 3 4 5 →