CTNet: hybrid architecture based on CNN and transformer for image inpainting detection

被引:6
作者
Xiao, Fengjun [1 ]
Zhang, Zhuxi [2 ]
Yao, Ye [2 ]
机构
[1] Hangzhou Dianzi Univ, Zhejiang Informatizat Dev Inst, Xiasha Higher Educ Zone, Hangzhou 310018, Zhejiang, Peoples R China
[2] Hangzhou Dianzi Univ, Sch Cyberspace, Xiasha Higher Educ Zone, Hangzhou 310018, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Image inpainting detection; Deep neural network; Hybrid CNN-Transformer encoder; High-pass filter; DIFFUSION; NETWORK;
D O I
10.1007/s00530-023-01184-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Digital image inpainting technology has increasingly gained popularity as a result of the development of image processing and machine vision. However, digital image inpainting can be used not only to repair damaged photographs, but also to remove specific people or distort the semantic content of images. To address the issue of image inpainting forgeries, a hybrid CNN-Transformer Network (CTNet), which is composed of the hybrid CNN-Transformer encoder, the feature enhancement module, and the decoder module, is proposed for image inpainting detection and localization. Different from existing inpainting detection methods that rely on hand-crafted attention mechanisms, the hybrid CNN-Transformer encoder employs CNN as a feature extractor to build feature maps tokenized as the input patches of the Transformer encoder. The hybrid structure exploits the innate global self-attention mechanisms of Transformer and can effectively capture the long-term dependency of the image. Since inpainting traces mainly exist in the high-frequency components of digital images, the feature enhancement module performs feature extraction in the frequency domain. The decoder regularizes the upsampling process of the predicted masks with the assistance of high-frequency features. We investigate the generalization capacity of our CTNet on datasets generated by ten commonly used inpainting methods. The experimental results show that the proposed model can detect a variety of unknown inpainting operations after being trained on the datasets generated by a single inpainting method.
引用
收藏
页码:3819 / 3832
页数:14
相关论文
共 50 条
  • [1] CTNet: hybrid architecture based on CNN and transformer for image inpainting detection
    Fengjun Xiao
    Zhuxi Zhang
    Ye Yao
    Multimedia Systems, 2023, 29 (6) : 3819 - 3832
  • [2] A CNN-Transformer Hybrid Model Based on CSWin Transformer for UAV Image Object Detection
    Lu, Wanjie
    Lan, Chaozhen
    Niu, Chaoyang
    Liu, Wei
    Lyu, Liang
    Shi, Qunshan
    Wang, Shiju
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 1211 - 1231
  • [3] HyFormer: a hybrid transformer-CNN architecture for retinal OCT image segmentation
    Jiang, Qingxin
    Fan, Ying
    Li, Menghan
    Fang, Sheng
    Zhu, Weifang
    Xiang, Dehui
    Peng, Tao
    Chen, Xinjian
    Xu, Xun
    Shi, Fei
    BIOMEDICAL OPTICS EXPRESS, 2024, 15 (11): : 6156 - 6170
  • [4] Rethinking Image Deblurring via CNN-Transformer Multiscale Hybrid Architecture
    Zhao, Qian
    Yang, Hao
    Zhou, Dongming
    Cao, Jinde
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [5] Encoder-decoder-based CNN model for detection of object removal by image inpainting
    Kumar, Nitish
    Meenpal, Toshanlal
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (04)
  • [6] Image Inpainting Detection Based on High-Pass Filter Attention Network
    Xiao, Can
    Li, Feng
    Zhang, Dengyong
    Huang, Pu
    Ding, Xiangling
    Sheng, Victor S.
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2022, 43 (03): : 1145 - 1154
  • [7] HCformer: Hybrid CNN-Transformer for LDCT Image Denoising
    Yuan, Jinli
    Zhou, Feng
    Guo, Zhitao
    Li, Xiaozeng
    Yu, Hengyong
    JOURNAL OF DIGITAL IMAGING, 2023, 36 (05) : 2290 - 2305
  • [8] Land Cover Classification of UAV Remote Sensing Based on Transformer-CNN Hybrid Architecture
    Lu, Tingyu
    Wan, Luhe
    Qi, Shaoqun
    Gao, Meixiang
    SENSORS, 2023, 23 (11)
  • [9] TransCNN: Hybrid CNN and transformer mechanism for surveillance anomaly detection
    Ullah, Waseem
    Hussain, Tanveer
    Ullah, Fath U. Min
    Lee, Mi Young
    Baik, Sung Wook
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [10] Hybrid CNN-Transformer Feature Fusion for Single Image Deraining
    Chen, Xiang
    Pan, Jinshan
    Lu, Jiyang
    Fan, Zhentao
    Li, Hao
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 378 - 386