WFormer: A Transformer-Based Soft Fusion Model for Robust Image Watermarking

被引:3
|
作者
Luo, Ting [1 ]
Wu, Jun [2 ]
He, Zhouyan [1 ]
Xu, Haiyong [2 ]
Jiang, Gangyi [2 ]
Chang, Chin-Chen [3 ]
机构
[1] Ningbo Univ, Coll Sci & Technol, Ningbo 315212, Peoples R China
[2] Ningbo Univ, Fac Informat Sci & Engn, Ningbo 315211, Peoples R China
[3] Feng Chia Univ, Dept Informat Engn & Comp Sci, Taichung 40724, Taiwan
基金
中国国家自然科学基金;
关键词
Watermarking; Feature extraction; Transformers; Decoding; Convolution; Noise; Robustness; transformer; soft fusion; cross-attention;
D O I
10.1109/TETCI.2024.3386916
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most deep neural network (DNN) based image watermarking models often employ the encoder-noise-decoder structure, in which watermark is simply duplicated for expansion and then directly fused with image features to produce the encoded image. However, simple duplication will generate watermark over-redundancies, and the communication between the cover image and watermark in different domains is lacking in image feature extraction and direction fusion, which degrades the watermarking performance. To solve those drawbacks, this paper proposes a Transformer-based soft fusion model for robust image watermarking, namely WFormer. Specifically, to expand watermark effectively, a watermark preprocess module (WPM) is designed with Transformers to extract valid and expanded watermark features by computing its self-attention. Then, to replace direct fusion, a soft fusion module (SFM) is deployed to integrate Transformers into image fusion with watermark by mining their long-range correlations. Precisely, self-attention is computed to extract their own latent features, and meanwhile, cross-attention is learned for bridging their gap to embed watermark effectively. In addition, a feature enhancement module (FEM) builds communication between the cover image and watermark by capturing their cross-feature dependencies, which tunes image features in accordance with watermark features for better fusion. Experimental results show that the proposed WFormer outperforms the existing state-of-the-art watermarking models in terms of invisibility, robustness, and embedding capacity. Furthermore, ablation results prove the effectiveness of the WPM, the FEM, and the SFM.
引用
收藏
页码:1 / 18
页数:18
相关论文
共 50 条
  • [31] ThaiTC:Thai Transformer-based Image Captioning
    Jaknamon, Teetouch
    Marukatat, Sanparith
    2022 17TH INTERNATIONAL JOINT SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE PROCESSING (ISAI-NLP 2022) / 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INTERNET OF THINGS (AIOT 2022), 2022,
  • [32] A Review of Transformer-Based Approaches for Image Captioning
    Ondeng, Oscar
    Ouma, Heywood
    Akuon, Peter
    APPLIED SCIENCES-BASEL, 2023, 13 (19):
  • [33] A transformer-based Urdu image caption generation
    Hadi M.
    Safder I.
    Waheed H.
    Zaman F.
    Aljohani N.R.
    Nawaz R.
    Hassan S.U.
    Sarwar R.
    Journal of Ambient Intelligence and Humanized Computing, 2024, 15 (9) : 3441 - 3457
  • [34] Image fusion-based watermarking
    Xu, Yanjie
    Xu, Luping
    Guangzi Xuebao/Acta Photonica Sinica, 2002, 31 (06):
  • [35] Transformer-based Image Compression with Variable Image Quality Objectives
    Kao, Chia-Hao
    Chen, Yi-Hsin
    Chien, Cheng
    Chiu, Wei-Chen
    Peng, Wen-Hsiao
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1718 - 1725
  • [36] Transformer-Based Sensor Fusion for Autonomous Driving: A Survey
    Singh, Apoorv
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3304 - 3309
  • [37] A Transformer-Based Cross-Window Aggregated Attentional Image Inpainting Model
    Chen, Mingju
    Liu, Tingting
    Xiong, Xingzhong
    Duan, Zhengxu
    Cui, Anle
    ELECTRONICS, 2023, 12 (12)
  • [38] Advancing Hyperspectral and Multispectral Image Fusion: An Information-Aware Transformer-Based Unfolding Network
    Sun, Jianqiao
    Chen, Bo
    Lu, Ruiying
    Cheng, Ziheng
    Qu, Chunhui
    Yuan, Xin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
  • [39] Interactive CNN and Transformer-Based Cross-Attention Fusion Network for Medical Image Classification
    Cai, Shu
    Zhang, Qiude
    Wang, Shanshan
    Hu, Junjie
    Zeng, Liang
    Li, Kaiyan
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2025, 35 (03)
  • [40] Soft-decision detection robust image watermarking scheme
    Yang, Wen-Xue
    Sang, Mao-Dong
    Zhao, Yao
    Tiedao Xuebao/Journal of the China Railway Society, 2005, 27 (01): : 45 - 51