Weakly-supervised cross-contrastive learning network for image manipulation detection and localization

被引:0
|
作者
Bai, Ruyi [1 ]
机构
[1] Shanxi Univ, Coll Automat & Software, Taiyuan 030006, Shanxi, Peoples R China
关键词
Weakly-supervised; Image manipulation detection and localization; Cross-contrastive learning; ATTENTION;
D O I
10.1016/j.knosys.2025.113033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the significant reduction in the cost of image manipulation due to advancements in image editing tools, it is crucial to investigate methods for detecting image manipulation. Currently, mainstream methods are based on various types of deep learning models, which have achieved some success. However, these models largely rely on pixel-level ground truth annotations for supervision, leading to an increase in image-level false positives due to limited real images. Obtaining GT annotations is time-consuming and labor-intensive, and the supervised model has a high demand for tampering mask. To address these limitations, we propose a Weakly-Supervised CrossContrastive Learning (WSCCL) network that can detect and locate image manipulation based solely on imagelevel labels ('real'/'tampered'). Specifically, we first leverage a dual-stream encoder-decoder architecture to extract visual and noise features separately and generate corresponding prediction distribution maps. We then adopt an adaptive approach to fuse prediction distribution maps, obtaining weakly-supervised pseudo-label. We design the Cross-Contrastive Learning Module(CCLM) using different aggregation methods for different layer features in the encoder, and apply cross-contrastive learning for the fusion features and the predicted features maps generated by the decoder. Finally, WSCCL compares the similarity between the reconstructed image obtained from the decoder and the predicted distribution map to make the pseudo-label closer to the real GT. Furthermore, extensive experiments confirm that our approach based on weakly supervised learning is comparable to supervised learning, both at the image-level and pixel-level. WSCCL exhibits strong adaptability to various types of manipulation and high resistance to attacks. This study demonstrates that our weakly supervised learning method can compete fully with supervised learning, regardless of the level of manipulation or annotation.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Exploring weakly-supervised image manipulation localization with tampering Edge-based class activation map
    Zhou, Yang
    Wang, Hongxia
    Zeng, Qiang
    Zhang, Rui
    Meng, Sijiang
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [22] Weakly-Supervised Temporal Action Localization via Cross-Stream Collaborative Learning
    Ji, Yuan
    Jia, Xu
    Lu, Huchuan
    Ruan, Xiang
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 853 - 861
  • [23] Background Suppression Network for Weakly-Supervised Temporal Action Localization
    Lee, Pilhyeon
    Uh, Youngjung
    Byun, Hyeran
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11320 - 11327
  • [24] Adversarial Learning for Weakly-Supervised Social Network Alignment
    Li, Chaozhuo
    Wang, Senzhang
    Wang, Yukun
    Yu, Philip
    Liang, Yanbo
    Liu, Yun
    Li, Zhoujun
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 996 - 1003
  • [25] Entropy guided attention network for weakly-supervised action localization
    Cheng, Yi
    Sun, Ying
    Fan, Hehe
    Zhuo, Tao
    Lim, Joo-Hwee
    Kankanhalli, Mohan
    PATTERN RECOGNITION, 2022, 129
  • [26] Feature Matching Network for Weakly-Supervised Temporal Action Localization
    Dou, Peng
    Zhou, Wei
    Liao, Zhongke
    Hu, Haifeng
    PATTERN RECOGNITION AND COMPUTER VISION, PT IV, 2021, 13022 : 459 - 471
  • [27] Action Coherence Network for Weakly-Supervised Temporal Action Localization
    Zhai, Yuanhao
    Wang, Le
    Tang, Wei
    Zhang, Qilin
    Zheng, Nanning
    Hua, Gang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1857 - 1870
  • [28] Weakly supervised foreground learning for weakly supervised localization and detection
    Zhang, Chen -Lin
    Li, Yin
    Wu, Jianxin
    PATTERN RECOGNITION, 2023, 137
  • [29] Weakly-supervised Disentanglement Network for Video Fingerspelling Detection
    Jiang, Ziqi
    Zhang, Shengyu
    Yao, Siyuan
    Zhang, Wenqiao
    Zhang, Sihan
    Li, Juncheng
    Zhao, Zhou
    Wu, Fei
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5446 - 5455
  • [30] Temporal RPN Learning for Weakly-Supervised Temporal Action Localization
    Huang, Jing
    Kong, Ming
    Chen, Luyuan
    Liang, Tian
    Zhu, Qiang
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222