Human-Machine Collaborative Image Compression Method Based on Implicit Neural Representations

被引:1
作者
Li, Huanyang [1 ]
Zhang, Xinfeng [1 ]
机构
[1] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100049, Peoples R China
关键词
Image compression; image coding for machine; implicit neural representation;
D O I
10.1109/JETCAS.2024.3386639
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the explosive increase in the volume of images intended for analysis by AI, image coding for machine have been proposed to transmit information in a machine-interpretable format, thereby enhancing image compression efficiency. However, such efficient coding schemes often lead to issues like loss of image details and features, and unclear semantic information due to high data compression ratio, making them less suitable for human vision domains. Thus, it is a critical problem to balance image visual quality and machine vision accuracy at a given compression ratio. To address these issues, we introduce a human-machine collaborative image coding framework based on Implicit Neural Representations (INR), which effectively reduces the transmitted information for machine vision tasks at the decoding side while maintaining high-efficiency image compression for human vision against INR compression framework. To enhance the model's perception of images for machine vision, we design a semantic embedding enhancement module to assist in understanding image semantics. Specifically, we employ the Swin Transformer model to initialize image features, ensuring that the embedding of the compression model are effectively applicable to downstream visual tasks. Extensive experimental results demonstrate that our method significantly outperforms other image compression methods in classification tasks while ensuring image compression efficiency.
引用
收藏
页码:198 / 208
页数:11
相关论文
共 62 条
[51]   Double JPEG compression forensics based on a convolutional neural network [J].
Wang Q. ;
Zhang R. .
EURASIP Journal on Information Security, 2016 (1)
[52]   Deep Image Compression Toward Machine Vision: A Unified Optimization Framework [J].
Wang, Shurun ;
Wang, Zhao ;
Wang, Shiqi ;
Ye, Yan .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (06) :2979-2989
[53]  
Xia XL, 2017, 2017 2ND INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2017), P783, DOI 10.1109/ICIVC.2017.7984661
[54]   Zero-Shot Learning - The Good, the Bad and the Ugly [J].
Xian, Yongqin ;
Schiele, Bernt ;
Akata, Zeynep .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3077-3086
[55]   Multimodal End-to-End Autonomous Driving [J].
Xiao, Yi ;
Codevilla, Felipe ;
Gurram, Akhil ;
Urfalioglu, Onay ;
Lopez, Antonio M. .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (01) :537-547
[56]   Reluplex made more practical: Leaky ReLU [J].
Xu, Jin ;
Li, Zishan ;
Du, Bowen ;
Zhang, Miaomiao ;
Liu, Jing .
2020 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2020, :703-709
[57]   Towards Coding for Human and Machine Vision: Scalable Face Image Coding [J].
Yang, Shuai ;
Hu, Yueyu ;
Yang, Wenhan ;
Duan, Ling-Yu ;
Liu, Jiaying .
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 :2957-2971
[58]  
Yang Wenhao, 2021, ARXIV
[59]   Enhanced Quantified Local Implicit Neural Representation for Image Compression [J].
Zhang, Gai ;
Zhang, Xinfeng ;
Tang, Lv .
IEEE SIGNAL PROCESSING LETTERS, 2023, 30 :1742-1746
[60]   Deep Reinforcement Learning Assisted Federated Learning Algorithm for Data Management of IIoT [J].
Zhang, Peiying ;
Wang, Chao ;
Jiang, Chunxiao ;
Han, Zhu .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (12) :8475-8484