TBNet: A Two-Stream Boundary-Aware Network for Generic Image Manipulation Localization

被引:18
|
作者
Gao, Zan [1 ,2 ]
Sun, Chao [1 ]
Cheng, Zhiyong [1 ]
Guan, Weili [3 ]
Liu, Anan [4 ]
Wang, Meng [5 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Shandong Artificial Intelligence Inst, Jinan 250316, Shandong, Peoples R China
[2] Tianjin Univ Technol, Key Lab Comp Vis & Syst, Minist Educ, Tianjin 300384, Peoples R China
[3] Monash Univ, Fac Informat Technol, Clayton Campus, Clayton, Vic 3800, Australia
[4] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[5] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230009, Peoples R China
基金
中国国家自然科学基金;
关键词
Splicing; Location awareness; Streaming media; Frequency-domain analysis; Task analysis; Feature extraction; Image color analysis; Adaptive cross-attention fusion; adaptive frequency selection; boundary artifact localization; generic image manipulation localization; two-stream boundary-aware; SPLICING FORGERY;
D O I
10.1109/TKDE.2022.3187091
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Finding tampered regions in images is a common research topic in machine learning and computer vision. Although many image manipulation location algorithms have been proposed, most of them only focus on RGB images with different color spaces, and the frequency information that contains the potential tampering clues is often ignored. Moreover, among the manipulation operations, splicing and copy-move are two frequently used methods, but as their characteristics are quite different, specific methods have been individually designed for detecting the operations of either splicing or copy-move, and it is very difficult to widely apply these methods in practice. To solve these issues, in this work, a novel end-to-end two-stream boundary-aware network (abbreviated as TBNet) is proposed for generic image manipulation localization where the RGB stream, the frequency stream, and the boundary artifact location are explored in a unified framework. Specifically, we first design an adaptive frequency selection module (AFS) to adaptively select the appropriate frequency to mine inconsistent statistics and eliminate the interference of redundant statistics. Then, an adaptive cross-attention fusion module (ACF) is proposed to adaptively fuse the RGB feature and the frequency feature. Finally, the boundary artifact location network (BAL) is designed to locate the boundary artifacts for which the parameters are jointly updated by the outputs of the ACF, and its results are further fed into the decoder. Thus, the parameters of the RGB stream, the frequency stream, and the boundary artifact location network are jointly optimized, and their latent complementary relationships are fully mined. The results of the extensive experiments performed on six public benchmarks of the image manipulation localization task, namely, CASIA1.0, COVER, Carvalho, In-The-Wild, NIST-16, and IMD-2020, demonstrate that the proposed TBNet can substantially outperform state-of-the-art generic image manipulation localization methods in terms of MCC, F1, and AUC while maintaining robustness with respect to various attacks. Compared with DeepLabV3+ on the CASIA1.0, COVER, Carvalho, In-The-Wild, and NIST-16 datasets, the improvements in MCC/F1 reach 11%/11.1%, 8.2%/10.3%, 10.2%/11.6%, 8.9%/6.2%, and 13.3%/16.0%, respectively. Moreover, when IMD2020 is utilized, its AUC improvement can achieve 14.7%.
引用
收藏
页码:7541 / 7556
页数:16
相关论文
共 50 条
  • [41] Two-stream Attentive CNNs for Image Retrieval
    Yang, Fei
    Li, Jia
    Wei, Shikui
    Zheng, Qinjie
    Liu, Ting
    Zhao, Yao
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1513 - 1521
  • [42] Multiview Spatial-Spectral Two-Stream Network for Hyperspectral Image Unmixing
    Qi, Lin
    Chen, Zhenwei
    Gao, Feng
    Dong, Junyu
    Gao, Xinbo
    Du, Qian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [43] Robust Detection of Image Operator Chain With Two-Stream Convolutional Neural Network
    Liao, Xin
    Li, Kaide
    Zhu, Xinshan
    Liu, K. J. Ray
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 14 (05) : 955 - 968
  • [44] Enhancing Partially Spoofed Audio Localization with Boundary-aware Attention Mechanism
    Zhong, Jiafeng
    Li, Bin
    Yi, Jiangyan
    INTERSPEECH 2024, 2024, : 4838 - 4842
  • [45] Alternate guidance network for boundary-aware camouflaged object detection
    Yu, Jinhao
    Chen, Shuhan
    Lu, Lu
    Chen, Zeyu
    Xu, Xiuqi
    Hu, Xuelong
    Zhu, Jinrong
    MACHINE VISION AND APPLICATIONS, 2023, 34 (04)
  • [46] A Boundary-aware Distillation Network for Compressed Video Semantic Segmentation
    Lu, Hongchao
    Deng, Zhidong
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5354 - 5359
  • [47] Attentive Feedback Network for Boundary-Aware Salient Object Detection
    Feng, Mengyang
    Lu, Huchuan
    Ding, Errui
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1623 - 1632
  • [48] Alternate guidance network for boundary-aware camouflaged object detection
    Jinhao Yu
    Shuhan Chen
    Lu Lu
    Zeyu Chen
    Xiuqi Xu
    Xuelong Hu
    Jinrong Zhu
    Machine Vision and Applications, 2023, 34
  • [49] Adaptive Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization
    Zhai, Yuanhao
    Wang, Le
    Tang, Wei
    Zhang, Qilin
    Zheng, Nanning
    Doermann, David
    Yuan, Junsong
    Hua, Gang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4136 - 4151
  • [50] Two-stream boundary constraints and relativistic generation adversarial network for building contour regularization
    Yin, Jichong
    Wu, Fang
    Zhai, Renjian
    Qiu, Yue
    Gong, Xianyong
    Xing, Ruixing
    Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2024, 53 (07): : 1444 - 1457