DAMS: Document Image Steganography with Dual Attention Multi-scale Encoder-Decoder Architecture

被引:0
|
作者
Li, Kaijiang [1 ]
Qin, Yi [1 ]
Wang, Peisen [1 ]
Guo, Chunyi [1 ]
Wang, Junqi [2 ]
Jia, Ruiyang [1 ]
Jiang, Wenfeng [3 ]
机构
[1] Zhengzhou Univ, Zhengzhou, Peoples R China
[2] Zhengzhou Univ Aeronaut, Zhengzhou, Peoples R China
[3] China Mobile Grp Henan Co Ltd, Zhengzhou, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT II | 2025年 / 15032卷
关键词
Steganography; Document image; Channel attention; Transformer; Multi-scale feature fusion;
D O I
10.1007/978-981-97-8490-5_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the research field of steganography, advances in deep learning techniques have significantly improved the ability to embed secret messages into scene images. However, for document images with significant differences in color and background distributions, it is still a major challenge to ensure the invisibility of hidden information without interfering with the text-reading experience. To address this challenge, we propose an end-to-end framework designed specifically for document images, namely, the Dual Attention Multi-scale Encoder-Decoder Architecture (DAMS). The DAMS framework takes into full consideration of the pixel distributions and value deviations caused during the formation of document images. To balance the information embedding and extraction processes, the encoder and decoder adopt the same Channel Attention Network (CAN) module. In addition, we introduce a Self-Attention Fusion network (SAF), which can perform multi-scale text region feature extraction and fusion. The self-attention mechanism significantly enhances the perceptual capability of text region features, thereby improving the effectiveness of secret information embedding. Extensive experiments demonstrate that DAMS achieves state-of-the-art results, with an average accuracy rate of 99.99% and a PSNR of 40.52 dB under noise-free conditions, and an average accuracy rate of 99.32% and a PSNR of 38.24 dB under combined noise interference. The code will be released.
引用
收藏
页码:118 / 131
页数:14
相关论文
共 50 条
  • [1] MEDUSA: Multi-Scale Encoder-Decoder Self-Attention Deep Neural Network Architecture for Medical Image Analysis
    Aboutalebi, Hossein
    Pavlova, Maya
    Gunraj, Hayden
    Shafiee, Mohammad Javad
    Sabri, Ali
    Alaref, Amer
    Wong, Alexander
    FRONTIERS IN MEDICINE, 2022, 8
  • [2] Encoder-Decoder with Multi-scale Information Fusion for Semantic Image Segmentation
    Ma, Xinxin
    Liu, Kai
    Ding, Chongyang
    Yan, Lin
    Duan, Meiyu
    ELEVENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2019), 2020, 11373
  • [3] HYPERSPECTRAL IMAGE CLASSIFICATION VIA MULTI-SCALE ENCODER-DECODER NETWORK
    Ma, Jingjing
    Wu, Linlin
    Tang, Xu
    Zhang, Xiangrong
    Zhu, Cheng
    Ma, Junyong
    Jiao, Licheng
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 1283 - 1286
  • [4] Multi-Scale Attention and Encoder-Decoder Network for Video Saliency Object Detection
    Hongbo Bi
    Huihui Zhu
    Lina Yang
    Ranwan Wu
    Pattern Recognition and Image Analysis, 2022, 32 : 340 - 350
  • [5] An encoder-decoder network for crowd counting based on multi-scale attention mechanism
    Chuang H.-H.
    Chen Y.-C.
    Lin C.H.
    Multimedia Tools and Applications, 2025, 84 (03) : 1187 - 1210
  • [6] Multi-Scale Attention and Encoder-Decoder Network for Video Saliency Object Detection
    Bi, Hongbo
    Zhu, Huihui
    Yang, Lina
    Wu, Ranwan
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2022, 32 (02) : 340 - 350
  • [7] Multi-scale fusion residual encoder-decoder approach for low illumination image enhancement
    Pan Xiaoying
    Wei Miao
    Wang Hao
    Jia Fengzhu
    TheJournalofChinaUniversitiesofPostsandTelecommunications, 2022, 29 (02) : 63 - 72
  • [8] Multi-scale fusion residual encoder-decoder approach for low illumination image enhancement
    Xiaoying P.
    Miao W.
    Hao W.
    Fengzhü J.
    Journal of China Universities of Posts and Telecommunications, 2022, 29 (02): : 63 - 72
  • [9] A Multi-Scale Fusion Residual Encoder-Decoder Approach for Low Illumination Image Enhancement
    Pan X.
    Wei M.
    Wang H.
    Jia F.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (01): : 104 - 112
  • [10] A Multi-scale Edge Detection Method Based on Encoder-Decoder
    Tian, An-Lin
    Lei, Wei-Min
    Zhang, Peng
    Zhang, Wei
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2024, 45 (07): : 936 - 943