Temporal Adaptive Learned Surveillance Video Compression

被引：0

作者：

Zhao, Yu ^{[1
]}

Ye, Mao ^{[1
]}

Ji, Luping ^{[1
]}

Guo, Hongwei ^{[2
]}

Zhu, Ce ^{[3
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China

[2] Honghe Univ, Sch Engn, Mengzi 661100, Yunnan, Peoples R China

[3] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu, Peoples R China

来源：

IEEE TRANSACTIONS ON BROADCASTING | 2025年 / 71卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Video compression; Image coding; Surveillance; Encoding; Video coding; Adaptation models; Predictive models; temporal adaptation; surveillance video; PREDICTION; HEVC;

D O I：

10.1109/TBC.2024.3434736

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

As the amount of surveillance video data increases at an exponential rate, the need for efficient video compression algorithms becomes increasingly urgent. The inter-frame compression schemes of existing surveillance video compression methods predict the current frame through the previous frame, causing the error to gradually increase because the quality of the reference frame decreases progressively. In this paper, we propose a Temporal Adaptive enhancement method for Learned surveillance video Compression (TALC). The proposed TALC has two modules: Forward Temporal Adaptive (FTA) module and Backward Temporal Adaptive (BTA) module which are put before and after motion and residual bits transmission modules respectively. These two modules have the same network structure which consists of a Temporal Adaptive Selection (TAS) block and a Feature Enhancement (FE) block. TAS block can analyze the extent which errors accumulate in optical flow and residuals, then select the corresponding enhancement sub-block; while FE block consists of several enhancement sub-blocks according to different levels of error accumulation. The proposed TALC has strong versatility and low coupling, which can be applied in almost all learned video compression frameworks as a plugin. Experimental results show that the proposed TALC method can significantly improve the coding performance of learned surveillance video compression networks without changing the original basic structure.

引用

页码：142 / 153

页数：12

共 74 条

[1] Fusion of Global and Local Motion Estimation Using Foreground Objects for Distributed Video Coding
Abou-Elailah, Abdalbassir
Dufaux, Frederic
Farah, Joumana
Cagnazzo, Marco
Srivastava, Anuj
Pesquet-Popescu, Beatrice
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2015, 25 (06) : 973 - 987
[2] Agustsson E, 2020, PROC CVPR IEEE, P8500, DOI 10.1109/CVPR42600.2020.00853
[3] [Anonymous], 2024, Video Test Media
[4] [Anonymous], 2019, J. Inst. Electron. Comput.
[5] Bjontegaard G., 2001, VCEGM33
[6] Brutzer S, 2011, PROC CVPR IEEE
[7] Foreground/background bit allocation for region-of-interest coding
Chai, D
Ngan, KN
Bouzerdoum, A
[J]. 2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, 2000, : 923 - 926
[8] A Robust Moving Object Detection in Multi-Scenario Big Data for Video Surveillance
Chen, Bo-Hao
Shi, Ling-Feng
Ke, Xiao
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (04) : 982 - 995
[9] Dhar P. K., 2012, Int. J. Signal Process., Image Process. Pattern Recognit., V5, P93
[10] Neural Inter-Frame Compression for Video Coding
Djelouah, Abdelaziz
Campos, Joaquim
Schaub-Meyer, Simone
Schroers, Christopher
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6430 - 6438

← 1 2 3 4 5 6 7 8 →