Speech watermarking based tamper detection and recovery scheme with high tolerable tamper rate

被引:0
作者
Shengbei Wang
Weitao Yuan
Zhen Zhang
Lin Wang
机构
[1] Tiangong University,Tianjin Key Laboratory of Autonomous Intelligence Technology and Systems
[2] Techfantasy. Co. Ltd.,undefined
来源
Multimedia Tools and Applications | 2024年 / 83卷
关键词
Speech watermarking; Tamper detection; Tamper recovery; Speech authentication;
D O I
暂无
中图分类号
学科分类号
摘要
Speech watermarking has been widely used for tamper detection and recovery. In this paper, we propose a speech watermarking based tamper detection and recovery method after analyzing the characteristics of continuous and discontinuous tamper. The authentication watermarks and recovery watermarks are embedded into the original speech using align embedding and misalign embedding strategies, respectively. In particular, the misalign embedding strategy which distributes the recovery watermarks repeatedly and widely can effectively prevent the speech segment and its recovery watermarks from being tampered simultaneously, which significantly increases the tolerable tamper rate (TTR) of the proposed method. Several experiments concerning inaudibility, recovery rate, sound quality of recovered speech, and recovery percentage were carried out to evaluate the proposed method. The obtained results suggested that the proposed method had good inaudibility. Moreover, it could tolerate high tamper rate (around 50%) and provide satisfactory recovery rate (100%) and speech quality (PESQ≥ 3.0 ODG and LSD ≤ 1.0 dB) under continuous tamper (for N ≥ 6). Similarly, it could recovery most of the speech after discontinuous tamper even under high tamper rate. These results verified the effectiveness of the proposed method.
引用
收藏
页码:6711 / 6729
页数:18
相关论文
共 106 条
[1]  
Galajit K(2019)Semi-fragile speech watermarking based on singular-spectrum analysis with cnn-based parameter estimation for tampering detection APSIPA Trans Signal Inf Process 8 1-13
[2]  
Karnjana J(2009)Adjacent-block based statistical detection method for self-embedding watermarking techniques Signal Process 89 1557-1566
[3]  
Unoki M(2012)Using information theoretic distance measures for solving the permutation problem of blind source separation of speech signals EURASIP J Audio Speech Music Process 2012 14-180,408
[4]  
Aimmanee P(2019)Hybrid blind audio watermarking for proprietary protection, tamper proofing, and self-recovery IEEE Access 7 180,395-238
[5]  
He HJ(2008)Evaluation of objective quality measures for speech enhancement IEEE Trans Audio Speech Lang Process 16 229-242
[6]  
Zhang JS(2016)Twenty years of digital audio watermarking - a comprehensive review Signal Process 128 222-2378
[7]  
Chen F(2018)Robust image-in-audio watermarking technique based on dct-svd transform EURASIP J Audio Speech Music Process 2018 1-74
[8]  
Hoffmann E(2018)Robust image-in-audio watermarking technique based on DCT-SVD transform EURASIP J Audio Speech Music Process 2018 16-166
[9]  
Kolossa D(2013)Robust svd-based audio watermarking scheme with differential evolution optimization IEEE Trans Audio Speech Lang Process 21 2368-166
[10]  
Köhler B(2017)Exposing speech tampering via spectral phase analysis Digit Signal Process 60 63-12,504