Insertion, deletion codes with feature-based embedding: A new paradigm for watermark synchronization with applications to speech watermarking

被引：36

作者：

Coumou, David J. ^{[1
,2
]}

Sharma, Gaurav ^{[1
,3
]}

机构：

[1] Univ Rochester, Dept Elect & Comp Engn, Rochester, NY 14623 USA

[2] MKS Instruments Inc, Rochester, NY 14623 USA

[3] Univ Rochester, Med Ctr, Dept Biostat & Computat Biol, Rochester, NY 14642 USA

来源：

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY | 2008年 / 3卷 / 02期

关键词：

feature-based watermarking; insertion deletion codes; pitch watermarking; speech watermarking; watermark synchronization;

D O I：

10.1109/TIFS.2008.920728

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

A framework is proposed for synchronization in feature-based data embedding systems that is tolerant of errors in estimated features. The method combines feature-based embedding with codes capable of simultaneous synchronization and error correction, thereby allowing recovery from both desynchronization caused by feature estimation discrepancies between the embedder and receiver; and alterations in estimated symbols arising from other channel perturbations. A speech watermark is presented that constitutes a realization of the framework for 1-D signals. The speech watermark employs pitch modification for data embedding and Davey and Mackay's insertion, deletion, and substitution (IDS) codes for synchronization and error recovery. Experimental results demonstrate that the system indeed allows watermark data recovery, despite feature desynchronization. The performance of the speech watermark is optimized by estimating the channel parameters required for the IDS decoding at the receiver via the expectation-maximization algorithm. In addition, acceptable watermark power levels (i.e., the range of pitch modification that is perceptually tolerable) are determined from psychophysical tests. The proposed watermark demonstrates robustness to low-bit-rate speech coding channels (Global System for Mobile Communications at 13 kb/s and AMR at 5.1 kb/s), which have posed a serious challenge for prior speech watermarks. Thus, the watermark presented in this paper not only highlights the utility of the proposed framework but also represents a significant advance in speech watermarking. Issues in extending the proposed framework to 2-D and 3-D signals and different application scenarios are identified.

引用

页码：153 / 165

页数：13

共 40 条

[1]

ALGHONIEMY M, 2000, IEEE INT C IMAG PROC

[2]

[Anonymous], OPEN SPEECH REPOSITO

[3]

[Anonymous], P 3 INT INF HID WORK

[4]

[Anonymous], THESIS U CAMBRIDGE C

[5] DECODING FOR CHANNELS WITH INSERTIONS, DELETIONS, AND SUBSTITUTIONS WITH APPLICATIONS TO SPEECH RECOGNITION [J].

BAHL, LR ;

JELINEK, F .

IEEE TRANSACTIONS ON INFORMATION THEORY, 1975, 21 (04) :404-411

[6] Geometrically invariant watermarking using feature points [J].

Bas, P ;

Chassery, JM ;

Macq, B .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2002, 11 (09) :1014-1028

[7]

BAS P, 2005, 1 WAV CHALL BARC SPA

[8]

Boersma P, DOING PHONETICS COMP

[9]

BRUHN S, AMR SPEECH CODEC GEN

[10]

CALDELLI R, 2000, IEEE INT C IMAG PROC

← 1 2 3 4 →