Toward Robust Neural Image Compression: Adversarial Attack and Model Finetuning

被引:15
作者
Chen, Tong [1 ]
Ma, Zhan [1 ]
机构
[1] Nanjing Univ, Sch Elect Sci & Engn, Nanjing 210093, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Neural image compression; model robustness; adversarial attack; adversarial training;
D O I
10.1109/TCSVT.2023.3276442
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep neural network-based image compression has been extensively studied. However, the model robustness which is crucial to practical application is largely overlooked. We propose to examine the robustness of prevailing learned image compression models by injecting negligible adversarial perturbation into the original source image. Severe distortion in decoded reconstruction reveals the general vulnerability in existing methods regardless of their settings (e.g., network architecture, loss function, quality scale). A variety of defense strategies including geometric self-ensemble based pre-processing, and adversarial training, are investigated against the adversarial attack to improve the model's robustness. Later the defense efficiency is further exemplified in real-life image recompression case studies. Overall, our methodology is simple, effective, and generalizable, making it attractive for developing robust learned image compression solutions. All materials are made publicly accessible at https://njuvision.github.io/RobustNIC for reproducible research.
引用
收藏
页码:7842 / 7856
页数:15
相关论文
共 68 条
[1]   DISCRETE COSINE TRANSFORM [J].
AHMED, N ;
NATARAJAN, T ;
RAO, KR .
IEEE TRANSACTIONS ON COMPUTERS, 1974, C 23 (01) :90-93
[2]  
Nguyen A, 2015, PROC CVPR IEEE, P427, DOI 10.1109/CVPR.2015.7298640
[3]  
Bai Y, 2022, Arxiv, DOI [arXiv:2103.08307, DOI 10.48550/ARXIV.2103.08307]
[4]  
Ball‚ J, 2017, Arxiv, DOI arXiv:1611.01704
[5]  
Balle J, 2016, Arxiv, DOI arXiv:1511.06281
[6]  
Ball‚ J, 2018, Arxiv, DOI arXiv:1802.01436
[7]  
Begaint Jean., 2020, arXiv
[8]   Overview of the Versatile Video Coding (VVC) Standard and its Applications [J].
Bross, Benjamin ;
Wang, Ye-Kui ;
Ye, Yan ;
Liu, Shan ;
Chen, Jianle ;
Sullivan, Gary J. ;
Ohm, Jens-Rainer .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (10) :3736-3764
[9]  
Carlini N, 2017, Arxiv, DOI [arXiv:1608.04644, DOI 10.48550/ARXIV.1608.04644, 10.48550/arXiv.1608.04644]
[10]   End-to-End Learnt Image Compression via Non-Local Attention Optimization and Improved Context Modeling [J].
Chen, Tong ;
Liu, Haojie ;
Ma, Zhan ;
Shen, Qiu ;
Cao, Xun ;
Wang, Yao .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :3179-3191