Object Detection-Based Video Compression

被引:3
|
作者
Kim, Myung-Jun [1 ]
Lee, Yung-Lyul [1 ]
机构
[1] Sejong Univ, Dept Comp Engn, Seoul 05006, South Korea
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 09期
基金
新加坡国家研究基金会;
关键词
object detection; video compression; VVC (Versatile Video Coding); video coding application; quantization;
D O I
10.3390/app12094525
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Video compression is designed to provide good subjective image quality, even at a high-compression ratio. In addition, video quality metrics have been used to show the results can maintain a high Peak Signal-to-Noise Ratio (PSNR), even at high compression. However, there are many difficulties in object recognition on the decoder side due to the low image quality caused by high compression. Accordingly, providing good image quality for the detected objects is necessary for the given total bitrate for utilizing object detection in a video decoder. In this paper, object detection-based video compression by the encoder and decoder is proposed that allocates lower quantization parameters to the detected-object regions and higher quantization parameters to the background. Therefore, better image quality is obtained for the detected objects on the decoder side. Object detection-based video compression consists of two types: Versatile Video Coding (VVC) and object detection. In this paper, the decoder performs the decompression process by receiving the bitstreams in the object-detection decoder and the VVC decoder. In the proposed method, the VVC encoder and decoder are processed based on the information obtained from object detection. In a random access (RA) configuration, the average Bjontegaard Delta (BD)-rates of Y, Cb, and Cr increased by 2.33%, 2.67%, and 2.78%, respectively. In an All Intra (AI) configuration, the average BD-rates of Y, Cb, and Cr increased by 0.59%, 1.66%, and 1.42%, respectively. In an RA configuration, the averages of Delta Y-PSNR, Delta Cb-PSNR, and Delta Cr-PSNR for the object-detected areas improved to 0.17%, 0.23%, and 0.04%, respectively. In an AI configuration, the averages of Delta Y-PSNR, Delta Cb-PSNR, and Delta Cr-PSNR for the object-detected areas improved to 0.71%, 0.30%, and 0.30%, respectively. Subjective image quality was also improved in the object-detected areas.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Object Detection-Based One-Shot Imitation Learning with an RGB-D Camera
    Shao, Quanquan
    Qi, Jin
    Ma, Jin
    Fang, Yi
    Wang, Weiming
    Hu, Jie
    APPLIED SCIENCES-BASEL, 2020, 10 (03):
  • [22] Vehicle trajectory recognition based on video object detection
    Wang, Saisai
    Wang, Ping
    Wang, Jun
    Jin, Yinli
    PROCEEDINGS OF THE 15TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2020), 2020, : 1679 - 1683
  • [23] VMS Video Search System based on Object Detection
    Ko, Jong Gook
    Park, Jong Youl
    2015 IEEE 4TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE), 2015, : 252 - 253
  • [24] Performance Evaluation of Semantic Video Compression using Multi-cue Object Detection
    AL-Shakarji, Noor M.
    Bunyak, Filiz
    Aliakbarpour, Hadi
    Seetharaman, Guna
    Palaniappan, Kannappan
    2019 IEEE APPLIED IMAGERY PATTERN RECOGNITION WORKSHOP (AIPR), 2019,
  • [25] Model Compression in Object Detection
    Salvi, Andrey de Aguiar
    Barros, Rodrigo C.
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [26] An object detection-based model for automated screening of stem-cells senescence during drug screening
    Ren, Yu
    Song, Youyi
    Li, Mingzhu
    He, Liangge
    Xiao, Chunlun
    Yang, Peng
    Zhang, Yongtao
    Zhao, Cheng
    Wang, Tianfu
    Zhou, Guangqian
    Lei, Baiying
    NEURAL NETWORKS, 2025, 183
  • [27] An Efficient Model Compression Method for CNN Based Object Detection
    Qian, Liuchen
    Fu, Yuzhuo
    Liu, Ting
    PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 766 - 769
  • [28] Object-based Surveillance Video Compression using Foreground Motion Compensation
    Babu, R. Venkatesh
    Makur, Anamitra
    2006 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1- 5, 2006, : 584 - +
  • [29] Moving object detection in aerial video based on spatiotemporal saliency
    Shen Hao
    Li Shuxiao
    Zhu Chengfei
    Chang Hongxing
    Zhang Jinglan
    CHINESE JOURNAL OF AERONAUTICS, 2013, 26 (05) : 1211 - 1217
  • [30] Fusion strategies for context based object detection in video sequences
    Paletta, L
    Greindl, C
    Goyal, A
    IMAGE AND VIDEO COMMUNICATIONS AND PROCESSING 2003, PTS 1 AND 2, 2003, 5022 : 522 - 529