DAMS: Document Image Steganography with Dual Attention Multi-scale Encoder-Decoder Architecture

被引:0
|
作者
Li, Kaijiang [1 ]
Qin, Yi [1 ]
Wang, Peisen [1 ]
Guo, Chunyi [1 ]
Wang, Junqi [2 ]
Jia, Ruiyang [1 ]
Jiang, Wenfeng [3 ]
机构
[1] Zhengzhou Univ, Zhengzhou, Peoples R China
[2] Zhengzhou Univ Aeronaut, Zhengzhou, Peoples R China
[3] China Mobile Grp Henan Co Ltd, Zhengzhou, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT II | 2025年 / 15032卷
关键词
Steganography; Document image; Channel attention; Transformer; Multi-scale feature fusion;
D O I
10.1007/978-981-97-8490-5_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the research field of steganography, advances in deep learning techniques have significantly improved the ability to embed secret messages into scene images. However, for document images with significant differences in color and background distributions, it is still a major challenge to ensure the invisibility of hidden information without interfering with the text-reading experience. To address this challenge, we propose an end-to-end framework designed specifically for document images, namely, the Dual Attention Multi-scale Encoder-Decoder Architecture (DAMS). The DAMS framework takes into full consideration of the pixel distributions and value deviations caused during the formation of document images. To balance the information embedding and extraction processes, the encoder and decoder adopt the same Channel Attention Network (CAN) module. In addition, we introduce a Self-Attention Fusion network (SAF), which can perform multi-scale text region feature extraction and fusion. The self-attention mechanism significantly enhances the perceptual capability of text region features, thereby improving the effectiveness of secret information embedding. Extensive experiments demonstrate that DAMS achieves state-of-the-art results, with an average accuracy rate of 99.99% and a PSNR of 40.52 dB under noise-free conditions, and an average accuracy rate of 99.32% and a PSNR of 38.24 dB under combined noise interference. The code will be released.
引用
收藏
页码:118 / 131
页数:14
相关论文
共 50 条
  • [41] Caries-segnet: multi-scale cascaded hybrid spatial channel attention encoder-decoder for semantic segmentation of dental caries
    Priya, Jayaraman
    Raja, Subramanian Kanaga Suba
    BIOMEDICAL ENGINEERING-BIOMEDIZINISCHE TECHNIK, 2025,
  • [42] Stall warning for compressors based on wavelet features and multi-scale convolutional recurrent encoder-decoder
    Zhou, Xiaoping
    Wang, Lufeng
    Yu, Liang
    Wang, Yang
    Wang, Ran
    Dong, Guangming
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2025, 225
  • [43] Encoder-decoder Network with Self-attention Module for Image Restoration
    Jin, Qing
    Yu, Qi
    Liu, Jiying
    Tan, Xintong
    THIRTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2021), 2022, 12083
  • [44] Roadway Crack Segmentation Based on an Encoder-decoder Deep Network with Multi-scale Convolutional Blocks
    Sun, Mengyuan
    Guo, Runhua
    Zhu, Jinhui
    Fan, Wenhui
    2020 10TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2020, : 869 - 874
  • [45] Multi-level Encoder-Decoder Architectures for Image Restoration
    Mastan, Indra Deep
    Raman, Shanmuganathan
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1728 - 1737
  • [46] Multi-Supervised Encoder-Decoder for Image Forgery Localization
    Yu, Chunfang
    Zhou, Jizhe
    Li, Qin
    ELECTRONICS, 2021, 10 (18)
  • [47] ATTENTION-BASED ENCODER-DECODER NETWORK FOR SINGLE IMAGE DEHAZING
    Gao, Shunan
    Zhu, Jinghua
    Xi, Heran
    2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,
  • [48] Uni MS-PS: A multi-scale encoder-decoder transformer for universal photometric stereo
    Hardy, Clement
    Queau, Yvain
    Tschumperle, David
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 248
  • [49] A feature enhancement network based on image partitioning in a multi-branch encoder-decoder architecture
    Wang, Yuefei
    Zhang, Yutong
    Zhang, Li
    Wan, Yuxuan
    Chen, Zhixuan
    Xu, Yuquan
    Cao, Ruixin
    Zhao, Liangyan
    Yang, Yixi
    Yu, Xi
    KNOWLEDGE-BASED SYSTEMS, 2025, 311
  • [50] Image Segmentation using Encoder-Decoder Architecture and Region Consistency Activation
    Naik, Dinesh
    Jaidhar, C. D.
    2016 11TH INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2016, : 724 - 729