WFormer: A Transformer-Based Soft Fusion Model for Robust Image Watermarking

被引：3

作者：

Luo, Ting ^{[1
]}

Wu, Jun ^{[2
]}

He, Zhouyan ^{[1
]}

Xu, Haiyong ^{[2
]}

Jiang, Gangyi ^{[2
]}

Chang, Chin-Chen ^{[3
]}

机构：

[1] Ningbo Univ, Coll Sci & Technol, Ningbo 315212, Peoples R China

[2] Ningbo Univ, Fac Informat Sci & Engn, Ningbo 315211, Peoples R China

[3] Feng Chia Univ, Dept Informat Engn & Comp Sci, Taichung 40724, Taiwan

来源：

IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE | 2024年

基金：

中国国家自然科学基金;

关键词：

Watermarking; Feature extraction; Transformers; Decoding; Convolution; Noise; Robustness; transformer; soft fusion; cross-attention;

D O I：

10.1109/TETCI.2024.3386916

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most deep neural network (DNN) based image watermarking models often employ the encoder-noise-decoder structure, in which watermark is simply duplicated for expansion and then directly fused with image features to produce the encoded image. However, simple duplication will generate watermark over-redundancies, and the communication between the cover image and watermark in different domains is lacking in image feature extraction and direction fusion, which degrades the watermarking performance. To solve those drawbacks, this paper proposes a Transformer-based soft fusion model for robust image watermarking, namely WFormer. Specifically, to expand watermark effectively, a watermark preprocess module (WPM) is designed with Transformers to extract valid and expanded watermark features by computing its self-attention. Then, to replace direct fusion, a soft fusion module (SFM) is deployed to integrate Transformers into image fusion with watermark by mining their long-range correlations. Precisely, self-attention is computed to extract their own latent features, and meanwhile, cross-attention is learned for bridging their gap to embed watermark effectively. In addition, a feature enhancement module (FEM) builds communication between the cover image and watermark by capturing their cross-feature dependencies, which tunes image features in accordance with watermark features for better fusion. Experimental results show that the proposed WFormer outperforms the existing state-of-the-art watermarking models in terms of invisibility, robustness, and embedding capacity. Furthermore, ablation results prove the effectiveness of the WPM, the FEM, and the SFM.

引用

页码：4179 / 4196

页数：18

共 50 条

[41] Color image watermarking using multidimensional Fourier transforms
Tsui, Tsz Kin
Zhang, Xiao-Ping
Androutsos, Dimitrios
[J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2008, 3 (01) : 16 - 28
[42] Vaswani A, 2017, ADV NEUR IN, V30
[43] Adaptor: Improving the Robustness and Imperceptibility of Watermarking by the Adaptive Strength Factor
Wang, Baowei
Wu, Yufeng
Wang, Guiling
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 6260 - 6272
[44] Image quality assessment: From error visibility to structural similarity
Wang, Z
Bovik, AC
Sheikh, HR
Simoncelli, EP
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2004, 13 (04) : 600 - 612
[45] Deep 3D-to-2D Watermarking: Embedding Messages in 3D Meshes and Extracting Them from 2D Renderings
Yoo, Innfarn
Chang, Huiwen
Luo, Xiyang
Stava, Ondrej
Liu, Ce
Milanfar, Peyman
Yang, Feng
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10021 - 10030
[46] Restormer: Efficient Transformer for High-Resolution Image Restoration
Zamir, Syed Waqas
Arora, Aditya
Khan, Salman
Hayat, Munawar
Khan, Fahad Shahbaz
Yang, Ming-Hsuan
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5718 - 5729
[47] Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
Zheng, Sixiao
Lu, Jiachen
Zhao, Hengshuang
Zhu, Xiatian
Luo, Zekun
Wang, Yabiao
Fu, Yanwei
Feng, Jianfeng
Xiang, Tao
Torr, Philip H. S.
Zhang, Li
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6877 - 6886
[48] An Automated and Robust Image Watermarking Scheme Based on Deep Neural Networks
Zhong, Xin
Huang, Pei-Chi
Mastorakis, Spyridon
Shih, Frank Y.
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1951 - 1961
[49] HiDDeN: Hiding Data With Deep Networks
Zhu, Jiren
Kaplan, Russell
Johnson, Justin
Li Fei-Fei
[J]. COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 : 682 - 697
[50] Cross View Capture for Stereo Image Super-Resolution
Zhu, Xiangyuan
Guo, Kehua
Fang, Hui
Chen, Liang
Ren, Sheng
Hu, Bin
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 3074 - 3086

← 1 2 3 4 5 →