Exploring Multi-View Pixel Contrast for General and Robust Image Forgery Localization

被引:8
作者
Lou, Zijie [1 ,2 ]
Cao, Gang [1 ,2 ]
Guo, Kun [1 ,2 ]
Yu, Lifang [3 ]
Weng, Shaowei [4 ]
机构
[1] Commun Univ China, Sch Comp & Cyber Sci, Beijing 100024, Peoples R China
[2] Commun Univ China, State Key Lab Media Convergence & Commun, Beijing, Peoples R China
[3] Beijing Inst Grap Commun, Dept Informat Engn, Beijing 100026, Peoples R China
[4] Fujian Univ Technol, Fujian Prov Key Lab Big Data Min & Applicat, Fuzhou 350118, Peoples R China
基金
中国国家自然科学基金;
关键词
Location awareness; Forgery; Training; Head; Contrastive learning; Feature extraction; Protocols; Forensics; Testing; Accuracy; Digital forensics; image forensics; image forgery localization; multi-view contrastive learning; large scale testing;
D O I
10.1109/TIFS.2025.3541957
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Image forgery localization, which aims to segment tampered regions in an image, is a fundamental yet challenging digital forensic task. While some deep learning-based forensic methods have achieved impressive results, they directly learn pixel-to-label mappings without fully exploiting the relationship between pixels in the feature space. To address such deficiency, we propose a Multi-view Pixel-wise Contrastive algorithm (MPC) for image forgery localization. Specifically, we first pre-train the feature extraction backbone network with a supervised contrastive loss to model pixel relationships in view of within-image, cross-scale and cross-modality. That is aimed at increasing intra-class compactness and inter-class separability. Then the localization head is fine-tuned using cross-entropy loss, resulting in a better forged pixel localizer. The MPC is trained on three different scale training datasets to make a comprehensive and fair comparison with existing image forgery localization algorithms. Extensive test results on over ten public datasets show that the proposed MPC achieves higher generalization performance and robustness than the state-of-the-arts. It is particularly noteworthy that our approach maintains a high level of localization accuracy under various post-processing combinations that approximate real-world scenarios, as well as when confronted with novel intelligent editing techniques. Finally, comprehensive and detailed ablation experiments demonstrate the reasonableness of MPC.
引用
收藏
页码:2329 / 2341
页数:13
相关论文
共 65 条
[61]   Contrastive Learning for Label Efficient Semantic Segmentation [J].
Zhao, Xiangyun ;
Vemulapalli, Raviteja ;
Mansfield, Philip Andrew ;
Gong, Boqing ;
Green, Bradley ;
Shapira, Lior ;
Wu, Ying .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :10603-10613
[62]  
Zhou P, 2020, AAAI CONF ARTIF INTE, V34, P13058
[63]   Learning Rich Features for Image Manipulation Detection [J].
Zhou, Peng ;
Han, Xintong ;
Morariu, Vlad I. ;
Davis, Larry S. .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1053-1061
[64]   Effective image tampering localization with multi-scale ConvNeXt feature fusion [J].
Zhu, Haochen ;
Cao, Gang ;
Zhao, Mo ;
Tian, Huawei ;
Lin, Weiguo .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 98
[65]   Self-Adversarial Training Incorporating Forgery Attention for Image Forgery Localization [J].
Zhuo, Long ;
Tan, Shunquan ;
Li, Bin ;
Huang, Jiwu .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2022, 17 :819-834