A novel audio watermarking scheme using ensemble-based watermark detector and discrete wavelet transform

被引:21
作者
Pourhashemi, Seyed Mostafa [1 ]
Mosleh, Mohammad [1 ]
Erfani, Yousof [2 ]
机构
[1] Islamic Azad Univ, Dept Comp Engn, Dezful Branch, Dezful, Iran
[2] McMaster Univ, Dept Elect & Comp Engn, AEL Grp, Hamilton, ON, Canada
关键词
Audio watermarking; Discrete wavelet transform; Support vector machine; K-nearest neighbor; Machine learning; HIGH-CAPACITY; ROBUST; DCT; DECOMPOSITION; TRANSPARENT; SYNERGY;
D O I
10.1007/s00521-020-05389-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most existing extraction techniques in audio watermarking use conventional techniques in which some sets of special rules based on reverse embedding rules are used for watermark extraction and have many weaknesses, like very low robustness to destructive attacks. To overcome this problem, the use of machine learning-based methods has increased in recent years in this field. The disadvantage of these methods is the high reliance on a unique classifier and lack of proper efficiency when achieving high capacity, which is a major challenge in audio watermarking. The main purpose of this paper is to present a method that covers the weak points of conventional methods and simple intelligent methods and improves system performance using a synergistic combination of discrete wavelet transform (DWT) and ensemble-intelligent extraction approach by proposed combination of trained machine learning classifiers. For the embedding operation in the proposed method, the DWT and the difference in energy levels obtained through DWT coefficients are used. In the extraction section, three methods are used in parallel: (a) the trained support vector machine (SVM) classifier with RBF kernel, (b) trained SVM classifier with quadratic kernel and (c) the trained K-nearest neighbor classifier; finally, the majority function is used to vote and make a final decision to create an intelligent-based watermark detector. A training set is required to train the classifiers, whose bit sequence is generated by a proposed 5-bit linear-feedback shift register. The results of various experiments indicate that this ensemble method has achieved the appropriate imperceptibility and high capacity, along with higher robustness compared to conventional techniques and individual learning classifiers.
引用
收藏
页码:6161 / 6181
页数:21
相关论文
共 28 条
[1]  
Alpaydin E., 2009, Introduction to machine learning
[2]  
[Anonymous], 2011, Int. J. Phys. Sci.
[3]   Techniques for data hiding [J].
Bender, W ;
Gruhl, D ;
Morimoto, N ;
Lu, A .
IBM SYSTEMS JOURNAL, 1996, 35 (3-4) :313-336
[4]   A phenomenological model of the synapse between the inner hair cell and auditory nerve: Implications of limited neurotransmitter release sites [J].
Bruce, Ian C. ;
Erfani, Yousof ;
Zilany, Muhammad S. A. .
HEARING RESEARCH, 2018, 360 :40-54
[5]   Dual Quantum Audio Watermarking Schemes Based on Quantum Discrete Cosine Transform [J].
Chen, Kehan ;
Yan, Fei ;
Iliyasu, Abdullah M. ;
Zhao, Jianping .
INTERNATIONAL JOURNAL OF THEORETICAL PHYSICS, 2019, 58 (02) :502-521
[6]   Wavelet-domain audio watermarking using optimal modification on low-frequency amplitude [J].
Chen, Shuo-Tsung ;
Hsu, Chih-Yu ;
Huang, Hunag-Nan .
IET SIGNAL PROCESSING, 2015, 9 (02) :166-176
[7]   Audio Watermarking Using Spikegram and a Two-Dictionary Approach [J].
Erfani, Yousof ;
Pichevar, Ramin ;
Rouat, Jean .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2017, 12 (04) :840-852
[8]   Supplementary Schemes to Enhance the Performance of DWT-RDM-Based Blind Audio Watermarking [J].
Hu, Hwai-Tsu ;
Hsu, Ling-Yuan .
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2017, 36 (05) :1890-1911
[9]   Efficient and robust frame-synchronized blind audio watermarking by featuring multilevel DWT and DCT [J].
Hu, Hwai-Tsu ;
Chang, Jieh-Ren .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2017, 20 (01) :805-816
[10]   Robust, transparent and high-capacity audio watermarking in DCT domain [J].
Hu, Hwai-Tsu ;
Hsu, Ling-Yuan .
SIGNAL PROCESSING, 2015, 109 :226-235