FP-Net: frequency-perception network with adversarial training for image manipulation localization

被引:3
作者
Gao, Jintong [1 ]
Huang, Yongping [2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, 2699 Qianjin Rd, Changchun 130012, Peoples R China
[2] Jilin Univ, Coll Comp Sci & Technol, 2699 Qianjin Rd, Changchun 130012, Peoples R China
基金
中国国家自然科学基金;
关键词
Image manipulation localization; Frequency perception; Attention mechanism; Adversarial learning; POINT CLOUDS; MPEG;
D O I
10.1007/s11042-023-17914-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mining the forged regions of digitally tampered images is one of the key research tasks for visual recognition. Although there are many algorithms investigating image manipulation localization, most approaches focus only on the semantic information of the spatial domain and ignore the frequency inconsistency between authentic and tampered regions. In addition, the generality and robustness of the models are severely affected by the different noise distributions of the training and test sets. To address these issues, we propose the frequency-perception network with adversarial training for image manipulation localization. Our method not only captures representation information for boundary artifact identification in the spatial domain but also separates low and high-frequency information in the frequency domain to acquire tampered cues. Specifically, the frequency separation sensing module enriches the local sensing range by separating multi-scale frequency domain features. It accurately identifies high-frequency noise features in the manipulated region and distinguishes low-frequency information. The global frequency attention module uses multiple sampling and convolution operations to interactively learn multi-scale feature information and integrate dual-domain frequency content to identify tampered physical locations. Adversarial training is employed to construct hard training adversarial samples based on adversarial attacks to avoid interference from unevenly distributed redundant noise information. Extensive experimental results show that our proposed method performs significantly better than the mainstream approach on five common standard datasets.
引用
收藏
页码:62721 / 62739
页数:19
相关论文
共 32 条
[1]  
3DG, 2018, ISO/IEC JTC1/SC29/WG11, N18883
[2]   Denoising and Inpainting for Point Clouds Compressed by V-PCC [J].
Cao, Keming ;
Cosman, Pamela .
IEEE ACCESS, 2021, 9 :107688-107700
[3]   Point-Cloud Compression Moving Picture Experts Group's new standard in 2020 [J].
Cui, Li ;
Mekuria, Rufael ;
Preda, Marius ;
Jang, Euee S. .
IEEE CONSUMER ELECTRONICS MAGAZINE, 2019, 8 (04) :17-21
[4]   Distance-Based Probability Model for Octree Coding [J].
de Queiroz, Ricardo L. ;
Garcia, Diogo C. ;
Chou, Philip A. ;
Florencio, Dinei A. .
IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (06) :739-742
[5]   Performance Evaluation of the Codec Agnostic Approach in MPEG-I Video-Based Point Cloud Compression [J].
Dong, Tianyu ;
Kim, Kyutae ;
Jang, Euee S. .
IEEE ACCESS, 2021, 9 (09) :167990-168003
[6]   Immersive 3D Telepresence [J].
Fuchs, Henry ;
State, Andrei ;
Bazin, Jean-Charles .
COMPUTER, 2014, 47 (07) :46-52
[7]   Geometry Coding for Dynamic Voxelized Point Clouds Using Octrees and Multiple Contexts [J].
Garcia, Diogo C. ;
Fonseca, Tiago A. ;
Ferreira, Renan U. ;
de Queiroz, Ricardo L. .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :313-322
[8]   Learning Semantic Segmentation of Large-Scale Point Clouds With Random Sampling [J].
Hu, Qingyong ;
Yang, Bo ;
Xie, Linhai ;
Rosa, Stefano ;
Guo, Yulan ;
Wang, Zhihua ;
Trigoni, Niki ;
Markham, Andrew .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) :8338-8354
[9]  
ISO/IEC, ISO/IEC FDIS 23090-5
[10]   Video-Based Point-Cloud-Compression Standard in MPEG: From Evidence Collection to Committee Draft [J].
Jang, Euee S. ;
Preda, Marius ;
Mammou, Khaled ;
Tourapis, Alexis M. ;
Kim, Jungsun ;
Graziosi, Danillo B. ;
Rhyu, Sungryeul ;
Budagavi, Madhukar .
IEEE SIGNAL PROCESSING MAGAZINE, 2019, 36 (03) :118-123