TBNet: A Two-Stream Boundary-Aware Network for Generic Image Manipulation Localization

被引:18
|
作者
Gao, Zan [1 ,2 ]
Sun, Chao [1 ]
Cheng, Zhiyong [1 ]
Guan, Weili [3 ]
Liu, Anan [4 ]
Wang, Meng [5 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Shandong Artificial Intelligence Inst, Jinan 250316, Shandong, Peoples R China
[2] Tianjin Univ Technol, Key Lab Comp Vis & Syst, Minist Educ, Tianjin 300384, Peoples R China
[3] Monash Univ, Fac Informat Technol, Clayton Campus, Clayton, Vic 3800, Australia
[4] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[5] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230009, Peoples R China
基金
中国国家自然科学基金;
关键词
Splicing; Location awareness; Streaming media; Frequency-domain analysis; Task analysis; Feature extraction; Image color analysis; Adaptive cross-attention fusion; adaptive frequency selection; boundary artifact localization; generic image manipulation localization; two-stream boundary-aware; SPLICING FORGERY;
D O I
10.1109/TKDE.2022.3187091
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Finding tampered regions in images is a common research topic in machine learning and computer vision. Although many image manipulation location algorithms have been proposed, most of them only focus on RGB images with different color spaces, and the frequency information that contains the potential tampering clues is often ignored. Moreover, among the manipulation operations, splicing and copy-move are two frequently used methods, but as their characteristics are quite different, specific methods have been individually designed for detecting the operations of either splicing or copy-move, and it is very difficult to widely apply these methods in practice. To solve these issues, in this work, a novel end-to-end two-stream boundary-aware network (abbreviated as TBNet) is proposed for generic image manipulation localization where the RGB stream, the frequency stream, and the boundary artifact location are explored in a unified framework. Specifically, we first design an adaptive frequency selection module (AFS) to adaptively select the appropriate frequency to mine inconsistent statistics and eliminate the interference of redundant statistics. Then, an adaptive cross-attention fusion module (ACF) is proposed to adaptively fuse the RGB feature and the frequency feature. Finally, the boundary artifact location network (BAL) is designed to locate the boundary artifacts for which the parameters are jointly updated by the outputs of the ACF, and its results are further fed into the decoder. Thus, the parameters of the RGB stream, the frequency stream, and the boundary artifact location network are jointly optimized, and their latent complementary relationships are fully mined. The results of the extensive experiments performed on six public benchmarks of the image manipulation localization task, namely, CASIA1.0, COVER, Carvalho, In-The-Wild, NIST-16, and IMD-2020, demonstrate that the proposed TBNet can substantially outperform state-of-the-art generic image manipulation localization methods in terms of MCC, F1, and AUC while maintaining robustness with respect to various attacks. Compared with DeepLabV3+ on the CASIA1.0, COVER, Carvalho, In-The-Wild, and NIST-16 datasets, the improvements in MCC/F1 reach 11%/11.1%, 8.2%/10.3%, 10.2%/11.6%, 8.9%/6.2%, and 13.3%/16.0%, respectively. Moreover, when IMD2020 is utilized, its AUC improvement can achieve 14.7%.
引用
收藏
页码:7541 / 7556
页数:16
相关论文
共 50 条
  • [31] StfNet: A Two-Stream Convolutional Neural Network for Spatiotemporal Image Fusion
    Liu, Xun
    Deng, Chenwei
    Chanussot, Jocelyn
    Hong, Danfeng
    Zhao, Baojun
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (09): : 6552 - 6564
  • [32] Boundary-Aware Bilateral Fusion Network for Cloud Detection
    Zhao, Chao
    Zhang, Xiang
    Kuang, Nailiang
    Luo, Hangzai
    Zhong, Sheng
    Fan, Jianping
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [33] A Quaternion Two-Stream R-CNN Network for Pixel-Level Color Image Splicing Localization
    CHEN Beijing
    JU Xingwang
    GAO Ye
    WANG Jinwei
    Chinese Journal of Electronics, 2021, 30 (06) : 1069 - 1079
  • [34] A Quaternion Two-Stream R-CNN Network for Pixel-Level Color Image Splicing Localization
    Chen Beijing
    Ju Xingwang
    Gao Ye
    Wang Jinwei
    CHINESE JOURNAL OF ELECTRONICS, 2021, 30 (06) : 1069 - 1079
  • [35] Boundary-aware registration network for 4D-CT lung image with sliding motion
    Duan, Luwen
    Cao, Yuzhu
    Wang, Ziyu
    Liu, Desen
    Fu, Tianxiao
    Yuan, Gang
    Zheng, Jian
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 86
  • [36] Two-Stream Edge-Aware Network for Infrared and Visible Image Fusion With Multi-Level Wavelet Decomposition
    Wang, Haozhe
    Shu, Chang
    Li, Xiaofeng
    Fu, Yu
    Fu, Zhizhong
    Yin, Xiaofeng
    IEEE ACCESS, 2024, 12 : 22190 - 22204
  • [37] Driver Distraction Recognition with Pose-aware Two-stream Convolutional Neural Network
    Tao, Chenghao
    Ma, Sheqiang
    Proceedings of SPIE - The International Society for Optical Engineering, 2023, 12790
  • [38] Attention-Based Multi-Kernelized and Boundary-Aware Network for image semantic segmentation
    Zhou, Xuanchen
    Wu, Gengshen
    Sun, Xin
    Hu, Pengpeng
    Liu, Yi
    NEUROCOMPUTING, 2024, 597
  • [39] Robust Detection of Image Operator Chain with Two-Stream Convolutional Neural Network
    Liao, Xin
    Li, Kaide
    Zhu, Xinshan
    Liu, K. J. Ray
    IEEE Journal on Selected Topics in Signal Processing, 2020, 5 (955-968): : 955 - 968
  • [40] Video classification by fusing two-stream image template classification and pretrained network
    Zebhi, Saeedeh
    AlModarresi, Seyed M. T.
    Abootalebi, Vahid
    JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (05)