Screen shooting resistant watermarking based on cross attention

被引:0
作者
Lianshan Liu [1 ]
Peng Xu [1 ]
Qianwen Xue [2 ]
机构
[1] Shandong University of Science and Technology,College of Computer Science and Engineering
[2] Qingdao Maternal & Child Health and Family Planning Service Center,undefined
关键词
Robust watermarking; Screen-shooting; Deep learning; Cross attention;
D O I
10.1038/s41598-025-00912-8
中图分类号
学科分类号
摘要
With the development of digital imaging devices, the process of recording sensitive information displayed on screens through mobile phones and cameras has become a prominent technique for modern data leaks. In order to identify the origin of information violations, Screen-Shooting Resistant Watermarking (SSRW) has attracted a lot of attention. Most existing solutions are based on Convolutional Neural Networks (CNNs) for the embedding of watermarks. However, due to the limited reception field of CNNs, they are proficient in extracting local features but cannot understand the entire image. This paper presents a new watermarking system that is resistant to screen recording, with multi-head and cross-attention to incorporate watermarks, replacing the encoder in the end-to-end architecture. Specifically, we segment the image and watermark into smaller patches for positional embedding. Afterward, we calculate the attention scores through multi-head attention layers and generate the encoded image through concatenation. This approach increases the model’s ability to comprehend the entire image, thereby increasing performance. In addition, we enhance the U-Net network structure to replace the end-to-end decoder. The experimental results demonstrate that the proposed method not only reaches more than 95% accuracy in different capture scenarios but also excels in terms of reliability and invisibility relative to current state-of-the-art (SOTA) methods. In addition, this approach yields impressive PSNR and SSIM average values of 41.90 dB and 0.99, showing the excellent visual quality and reliability of the watermarked images.
引用
收藏
相关论文
共 50 条
  • [31] An Improved Siamese Tracking Network Based On Self-Attention And Cross-Attention
    Lai Yijun
    Song Jianmei
    She Haoping
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 466 - 470
  • [32] Transformer-based Cross attention and Feature Diversity for Occluded Person Re-identification
    Kang S.
    Kim S.
    Seo K.
    Transactions of the Korean Institute of Electrical Engineers, 2023, 72 (01) : 108 - 113
  • [33] SEQUENTIAL CROSS ATTENTION BASED MULTI-TASK LEARNING
    Kim, Sunkyung
    Choi, Hyesong
    Min, Dongbo
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2311 - 2315
  • [34] Dual Channel Text Relation Extraction Based on Cross Attention
    Ye, Naifu
    Yuan, Deyu
    Zhang, Zhi
    Hou, Xiaolong
    Data Analysis and Knowledge Discovery, 2024, 8 (11) : 114 - 125
  • [35] Bilateral Mammogram Mass Detection Based on Window Cross Attention
    Yuan, Hua
    Yan, YiMao
    Dong, Shoubin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT IV, 2023, 14257 : 63 - 74
  • [36] Pulmonary CT Registration Network Based on Deformable Cross Attention
    Ren, Meirong
    Xue, Peng
    Ji, Huizhong
    Zhang, Zhili
    Dong, Enqing
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024,
  • [37] Robust Watermarking for Medical Images Resistant to Geometric Attacks
    Naseem, Muhammad Tahir
    Qureshi, Ijaz Mansoor
    Atta-ur-Rahman
    Muzaffar, Muhammad Zeeshan
    2012 15TH INTERNATIONAL MULTITOPIC CONFERENCE (INMIC), 2012, : 224 - 228
  • [38] A Reversible Watermarking Algorithm Resistant to Image Geometric Transformation
    Li, Jian
    Wang, Jinwei
    Yu, Shuang
    Luo, Xiangyang
    INTERNATIONAL JOURNAL OF DIGITAL CRIME AND FORENSICS, 2019, 11 (01) : 100 - 113
  • [39] A multimodal fusion network based on a cross-attention mechanism for the classification of Parkinsonian tremor and essential tremor
    Tang, Lu
    Hu, Qianyuan
    Wang, Xiangrui
    Liu, Long
    Zheng, Hui
    Yu, Wenjie
    Luo, Ningdi
    Liu, Jun
    Song, Chengli
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [40] CAFE: A Cross-Attention Based Adaptive Weighting Fusion Network for MODIS and Landsat Spatiotemporal Fusion
    Lin, Liupeng
    Shen, Yao
    Wu, Jingan
    Nan, Fang
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20