Recurrent Convolutional Neural Networks for AMR Steganalysis Based on Pulse Position

被引:14
作者
Gong, Chen [1 ,2 ]
Yi, Xiaowei [1 ,2 ]
Zhao, Xianfeng [1 ,2 ]
Ma, Yi [3 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, State Key Lab Informat Secur, Beijing 100093, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing 100093, Peoples R China
[3] Beijing Informat Technol Inst, Beijing 100094, Peoples R China
来源
IH&MMSEC '19: PROCEEDINGS OF THE ACM WORKSHOP ON INFORMATION HIDING AND MULTIMEDIA SECURITY | 2019年
关键词
steganalysis; adaptive multi-rate; fixed codebook; pulse position; recurrent neural network; convolutional neural network; STEGANOGRAPHY;
D O I
10.1145/3335203.3335708
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With the rapid development of stream multimedia, the adaptive multi-rate (AMR) audio steganography are emerging recently. However, the traditional steganalysis methods face great challenges in detecting short time speech at low embedding rates. To address this problem, we propose a steganalytic scheme by combining Recurrent Neural Network (RNN) and Convolutional Neural Network (CNN), SRCNet. AMR fixed codebook (FCB) steganography embed messages by modifying the pulse positions, which would destroy the FCB correlation. Firstly we analyzed the FCB correlations at different distances, and summarized these correlations into four categories. Furthermore, we utilizes RNN to extract higher level contextual representations of FCBs and CNN to fuse spatial-temporal features for the steganalysis. The proposed approach was evaluated on a public data-set. The experiment results validate that the proposed framework greatly outperforms the existing state-of-the-art methods. The correct detection rate of SRCNet has been improved above at least 10% when the sample is as short as 100ms at the 20% embedding rate. In particular, the network achieves the significant improvements for detecting the STCs based adaptive AMR steganography.
引用
收藏
页码:2 / 13
页数:12
相关论文
共 28 条
[1]  
[Anonymous], TECHNICAL REPORT
[2]  
[Anonymous], INT WORKSH DIG WAT
[3]  
[Anonymous], 2014, P 2 INT C LEARNING R
[4]  
[Anonymous], 2018, INT WORKSH DIG WAT
[5]  
[Anonymous], IEEE T INFORM FORENS
[6]   The Adaptive Multirate Wideband speech codec (AMR-WB) [J].
Bessette, B ;
Salami, R ;
Lefebvre, R ;
Jelínek, M ;
Rotola-Pukkila, J ;
Vainio, J ;
Mikkola, H ;
Järvinen, K .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (08) :620-636
[7]   An Approach to Information Hiding in Low Bit-rate Speech Stream [J].
Bo, Xiao ;
Yongfeng, Huang ;
Tang, Shanyu .
GLOBECOM 2008 - 2008 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, 2008,
[8]  
Chang C. C., 2011, ACM T INTEL SYST TEC, V2, P1, DOI DOI 10.1145/1961189.1961199
[9]   Audio Steganalysis With Convolutional Neural Network [J].
Chen, Bolin ;
Luo, Weiqi ;
Li, Haodong .
IH&MMSEC'17: PROCEEDINGS OF THE 2017 ACM WORKSHOP ON INFORMATION HIDING AND MULTIMEDIA SECURITY, 2017, :85-90
[10]  
Ding Qi, 2010, WIRELESS COMMUNICATI, P1