Temporal Adaptive Learned Surveillance Video Compression

被引:0
作者
Zhao, Yu [1 ]
Ye, Mao [1 ]
Ji, Luping [1 ]
Guo, Hongwei [2 ]
Zhu, Ce [3 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[2] Honghe Univ, Sch Engn, Mengzi 661100, Yunnan, Peoples R China
[3] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
Video compression; Image coding; Surveillance; Encoding; Video coding; Adaptation models; Predictive models; temporal adaptation; surveillance video; PREDICTION; HEVC;
D O I
10.1109/TBC.2024.3434736
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As the amount of surveillance video data increases at an exponential rate, the need for efficient video compression algorithms becomes increasingly urgent. The inter-frame compression schemes of existing surveillance video compression methods predict the current frame through the previous frame, causing the error to gradually increase because the quality of the reference frame decreases progressively. In this paper, we propose a Temporal Adaptive enhancement method for Learned surveillance video Compression (TALC). The proposed TALC has two modules: Forward Temporal Adaptive (FTA) module and Backward Temporal Adaptive (BTA) module which are put before and after motion and residual bits transmission modules respectively. These two modules have the same network structure which consists of a Temporal Adaptive Selection (TAS) block and a Feature Enhancement (FE) block. TAS block can analyze the extent which errors accumulate in optical flow and residuals, then select the corresponding enhancement sub-block; while FE block consists of several enhancement sub-blocks according to different levels of error accumulation. The proposed TALC has strong versatility and low coupling, which can be applied in almost all learned video compression frameworks as a plugin. Experimental results show that the proposed TALC method can significantly improve the coding performance of learned surveillance video compression networks without changing the original basic structure.
引用
收藏
页码:142 / 153
页数:12
相关论文
共 74 条
  • [1] Fusion of Global and Local Motion Estimation Using Foreground Objects for Distributed Video Coding
    Abou-Elailah, Abdalbassir
    Dufaux, Frederic
    Farah, Joumana
    Cagnazzo, Marco
    Srivastava, Anuj
    Pesquet-Popescu, Beatrice
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2015, 25 (06) : 973 - 987
  • [2] Agustsson E, 2020, PROC CVPR IEEE, P8500, DOI 10.1109/CVPR42600.2020.00853
  • [3] [Anonymous], 2024, Video Test Media
  • [4] [Anonymous], 2019, J. Inst. Electron. Comput.
  • [5] Bjontegaard G., 2001, VCEGM33
  • [6] Brutzer S, 2011, PROC CVPR IEEE
  • [7] Foreground/background bit allocation for region-of-interest coding
    Chai, D
    Ngan, KN
    Bouzerdoum, A
    [J]. 2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, 2000, : 923 - 926
  • [8] A Robust Moving Object Detection in Multi-Scenario Big Data for Video Surveillance
    Chen, Bo-Hao
    Shi, Ling-Feng
    Ke, Xiao
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (04) : 982 - 995
  • [9] Dhar P. K., 2012, Int. J. Signal Process., Image Process. Pattern Recognit., V5, P93
  • [10] Neural Inter-Frame Compression for Video Coding
    Djelouah, Abdelaziz
    Campos, Joaquim
    Schaub-Meyer, Simone
    Schroers, Christopher
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6430 - 6438