Robust Mel-Frequency Cepstral coefficients feature detection and dual-tree complex wavelet transform for digital audio watermarking

被引:31
作者
Yuan, Xiao-Chen [1 ]
Pun, Chi-Man [1 ]
Chen, C. L. Philip [1 ]
机构
[1] Univ Macau, Dept Comp & Informat Sci, Macau, Peoples R China
关键词
Mel-Frequency Cepstral Coefficients; Dual-Tree Complex Wavelet Transform (DT CWT); Time-Scale Modification (TSM); Pitch shifting; Stir-mark; SCHEME; BLIND; ALGORITHM; PATTERNS; DCT;
D O I
10.1016/j.ins.2014.11.040
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A novel digital audio watermarking scheme based on robust Mel-Frequency Cepstral coefficients feature detection and dual-tree complex wavelet transform is proposed in this paper, which is similar as patchwork based methods that several segments are extracted from the host audio clip for watermarking use. The robust Mel-Frequency Cepstral coefficients feature detection method is proposed to extract the feature segments which should be relocated when the host audio signal attacked by various distortions including both the common audio signal processing and the conventional geometric distortions. With the robust feature segments, the approximate shift invariant transform dual-tree complex wavelet transform based watermarking method is proposed to embed the watermark into the DT CWT real low-pass coefficients of each segment, using the spread spectrum techniques. The linear correlation is calculated to judge the existence of the watermark during the watermark detection. Experimental results show that the proposed digital audio watermarking scheme based on robust Mel-Frequency Cepstral coefficients feature detection and dual-tree complex wavelet transform can achieve high robustness against the common audio signal processing, such as low-pass filtering, MP3 compression, echo addition, volume change, and normalization; and geometric distortions, such as resample Time-Scale Modification (TSM), pitch invariant TSM, and tempo invariant pitch shifting. In addition, the proposed audio watermarking scheme is resilient to Stir-mark for Audio, and it performs much better comparing with the existing state-of-the art methods. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:159 / 179
页数:21
相关论文
共 45 条
[1]  
Abdalla M., 2010, Journal of Telecommuni- cations, V1, P16
[2]  
Beyerlein P., 2002, COMMUNICATION
[3]   Highly robust, secure, and perceptual-quality echo hiding scheme [J].
Chen, Oscal T. -C. ;
Wu, Wen-Chih .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (03) :629-638
[4]   A video watermarking scheme based on the dual-tree complex wavelet transform [J].
Coria, Lino E. ;
Pickering, Mark R. ;
Nasiopoulos, Panos ;
Ward, Rabab Kreidieh .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2008, 3 (03) :466-474
[5]  
Cox I. J., 2002, Digital watermarking, V53
[6]   Spread spectrum audio watermarking using frequency hopping and attack characterization [J].
Cvejic, N ;
Seppänen, T .
SIGNAL PROCESSING, 2004, 84 (01) :207-213
[7]   An Information-Geometric Approach to Real-Time Audio Segmentation [J].
Dessein, Arnaud ;
Cont, Arshia .
IEEE SIGNAL PROCESSING LETTERS, 2013, 20 (04) :331-334
[8]   Robust image watermarking using dihedral angle based on maximum-likelihood detector [J].
Hamghalam, Mohammad ;
Mirzakuchaki, Sattar ;
Ali Akhaee, Mohammad .
IET IMAGE PROCESSING, 2013, 7 (05) :451-463
[9]   Perceptual-based DWPT-DCT framework for selective blind audio watermarking [J].
Hu, Hwai-Tsu ;
Hsu, Ling-Yuan ;
Chou, Hsien-Hsin .
SIGNAL PROCESSING, 2014, 105 :316-327
[10]   A Video Watermarking Technique Based on Pseudo-3-D DCT and Quantization Index Modulation [J].
Huang, Hui-Yu ;
Yang, Cheng-Han ;
Hsu, Wen-Hsing .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2010, 5 (04) :625-637