END-TO-END LEARNED SCALABLE MULTILAYER FEATURE COMPRESSION FOR MACHINE VISION TASKS

被引:0
作者
Chen, Qiaoxi [1 ]
Gao, Changsheng [1 ]
Liu, Dong [1 ]
机构
[1] Univ Sci & Technol China, Hefei 230026, Anhui, Peoples R China
来源
2024 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2024年
关键词
Feature compression; machine vision; scalable compression;
D O I
10.1109/ICIP51287.2024.10647798
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the field of Video Coding for Machines (VCM), scalable feature compression has attracted attention for its potential to support a variety of machine vision tasks. However, the existing scalable feature compression methods exhibit limited performance. To address this problem, we propose an end-to-end learned scalable multilayer feature compression method in this paper. First, we propose to leverage an end-to-end feature compression method, which can efficiently exploit redundancy among features through a learning approach, to improve compression efficiency. Second, we introduce a novel strategy involving the use of the transformed latent of the base layer as the conditional information for the enhancement layer. Given the learnable nature of our compression method, we propose to optimize the base layer and the enhancement layer jointly. The joint optimization encourages the base layer to produce more suitable conditional information for the enhancement layer. Comparative experiments against existing feature compression and image compression methods verify our approach's remarkable performance improvements.
引用
收藏
页码:1781 / 1787
页数:7
相关论文
共 20 条
[1]  
Ball‚ J, 2018, Arxiv, DOI arXiv:1802.01436
[2]   Lossy Intermediate Deep Learning Feature Compression and Evaluation [J].
Chen, Zhuo ;
Fan, Kui ;
Wang, Shiqi ;
Duan, Ling-Yu ;
Lin, Weisi ;
Kot, Alex C. .
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, :2414-2422
[3]   Task-Aware Quantization Network for JPEG Image Compression [J].
Choi, Jinyoung ;
Han, Bohyung .
COMPUTER VISION - ECCV 2020, PT XX, 2020, 12365 :309-324
[4]  
Gailly J.-l., 1992, GNU Operating System
[5]   Towards Task-Generic Image Compression: A Study of Semantics-Oriented Metrics [J].
Gao, Changsheng ;
Liu, Dong ;
Li, Li ;
Wu, Feng .
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 :721-735
[6]   ELIC: Efficient Learned Image Compression with Unevenly Grouped Space-Channel Contextual Adaptive Coding [J].
He, Dailan ;
Yang, Ziming ;
Peng, Weikun ;
Ma, Rui ;
Qin, Hongwei ;
Wang, Yan .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :5708-5717
[7]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[8]   Collaborative Scalable Visual Compression for Human-Centered Videos [J].
Huang, Haofeng ;
Yang, Wenhan ;
Xiang, Wei ;
Liu, Jiaying ;
Duan, Ling-Yu .
2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, :2988-2992
[9]   End-to-End Learnable Multi-Scale Feature Compression for VCM [J].
Kim, Yeongwoong ;
Jeong, Hyewon ;
Yu, Janghyun ;
Kim, Younhee ;
Lee, Jooyoung ;
Jeong, Se Yoon ;
Kim, Hui Yong .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) :3156-3167
[10]  
King DB, 2015, ACS SYM SER, V1214, P1, DOI 10.1021/bk-2015-1214.ch001