Robust Segments Detector for De-Synchronization Resilient Audio Watermarking

被引：39

作者：

Pun, Chi-Man ^{[1
]}

Yuan, Xiao-Chen ^{[1
]}

机构：

[1] Univ Macau, Dept Comp & Informat Sci, Macau 999078, Peoples R China

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2013年 / 21卷 / 11期

关键词：

Robust audio segments extractor (RASE); stationary wavelet transform (SWT); synchronization geometric distortions; time-scale modification (TSM); pitch shifting; DWT;

D O I：

10.1109/TASL.2013.2279312

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

A robust feature points detector for invariant audio watermarking is proposed in this paper. The audio segments centering at the detected feature points are extracted for both watermark embedding and extraction. These feature points are invariant to various attacks and will not be changed much for maintaining high auditory quality. Besides, high robustness and inaudibility can be achieved by embedding the watermark into the approximation coefficients of Stationary Wavelet Transform (SWT) domain, which is shift invariant. The spread spectrum communication technique is adopted to embed the watermark. Experimental results show that the proposed Robust Audio Segments Extractor (RASE) and the watermarking scheme are not only robust against common audio signal processing, such as low-pass filtering, MP3 compression, echo addition, volume change, and normalization; and distortions introduced in Stir-mark benchmark for Audio; but also robust against synchronization geometric distortions simultaneously, such as resample time-scale modification (TSM) with scaling factors up to +/- 50%, pitch invariant TSM by +/- 50%, and tempo invariant pitch shifting by +/- 50%. In general, the proposed scheme can well resist various attacks by the joint RASE and SWT approach, which performs much better comparing with the existing state-of-the art methods.

引用

页码：2412 / 2424

页数：13

共 27 条

[1] [Anonymous], P 2000 ACM WORKSH MU
[2] [Anonymous], 1995, TRANSLATION INVARIAN
[3] [Anonymous], 2000, SDMI PHASE 2 SCREENI
[4] Beauget S., 2004, P WORKSH MULT SEC MA
[5] A COMPUTATIONAL APPROACH TO EDGE-DETECTION
CANNY, J
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1986, 8 (06) : 679 - 698
[6] Cox I. J., 2002, Digital watermarking, V53
[7] An Information-Geometric Approach to Real-Time Audio Segmentation
Dessein, Arnaud
Cont, Arshia
[J]. IEEE SIGNAL PROCESSING LETTERS, 2013, 20 (04) : 331 - 334
[8] Gonzalez R. C., 2007, DIGITAL IMAGE PROCES, P736
[9] A DWT-DFT composite watermarking scheme robust to both affine transform and JPEG compression
Kang, XG
Huang, JW
Shi, YQ
Lin, Y
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2003, 13 (08) : 776 - 786
[10] Geometric Invariant Audio Watermarking Based on an LCM Feature
Kang, Xiangui
Yang, Rui
Huang, Jiwu
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (02) : 181 - 190

← 1 2 3 →