Simultaneous speaker identification and watermarking

被引：0

作者：

Basant S. Abd El-Wahab

Heba A. El-khobby

Mustafa M. Abd Elnaby

Fathi E. Abd El-Samie

机构：

[1] Tanta University,Department of Electronics and Electrical Communications Engineering, Faculty of Engineering

[2] Menoufia University,Department of Electronics and Electrical Communications, Faculty of Electronic Engineering

来源：

International Journal of Speech Technology | 2021年 / 24卷

关键词：

Biometric systems; Speech watermarking; Empirical mode decomposition; Mel frequency cepstral coefficients; Speech enhancement; Speaker identification;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Biometric template protection of speech signals and information hiding in speech signals are two challenging issues. To resolve such limitations and increase the level of security, our objective is to build multi-level security systems based on speech signals. So, speech watermarking is used simultaneously with automatic speaker identification. The speech watermarking is performed to embed images into the speech signals that are used for speaker identification. The watermark is extracted for authentication, and then the effect of watermark removal on the performance of the speaker identification system in the presence of degradations is studied. This paper presents an approach for speech watermarking based on empirical mode decomposition (EMD) in different transform domains and singular value decomposition (SVD). The speech signal is decomposed in different transform domains with EMD to yield zero-mean components called intrinsic mode functions (IMFs). The watermark is inserted into one of these IMF components with SVD. A comparison between different transform domains for implementing the proposed watermarking scheme on different IMFs is presented. The log-likelihood ratio (LLR), correlation coefficient (Cr), signal-to-noise ratio (SNR), and spectral distortion (SD) are used as metrics for the comparison. According to the simulation results, we find that the watermark embedding in the discrete sine transform domain provides higher SNR and Cr values and lower SD and LLR values. The proposed approach is robust to different attacks.

引用

页码：205 / 218

页数：13

共 50 条

[31] Real-Time Speaker Identification Using Speaker Model Distance
Zeinali, Hossein
Sameti, Hossein
Hadian, Hossein
2015 23RD IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 643 - 647
[32] IMPROVING SPEAKER IDENTIFICATION FOR SHARED DEVICES BY ADAPTING EMBEDDINGS TO SPEAKER SUBSETS
Tan, Zhenning
Yang, Yuguang
Han, Eunjung
Stolcke, Andreas
2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 1124 - 1131
[33] Improved speaker identification in wireless environment
Vuppala, Anil Kumar
Rao, K. Sreenivasa
Chakrabarti, Saswat
INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2013, 6 (03) : 130 - 137
[34] Survey of Automated Speaker Identification Methods
Sidorov, Maxim
Schmitt, Alexander
Zablotskiy, Sergey
Minker, Wolfgang
NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT ENVIRONMENTS (IE 2013), 2013, : 236 - 239
[35] NMF Based System for Speaker Identification
Costantini, Giovanni
Cesarini, Valerio
Paolizzo, Fabio
2021 IEEE INTERNATIONAL WORKSHOP ON METROLOGY FOR INDUSTRY 4.0 & IOT (IEEE METROIND4.0 & IOT), 2021, : 620 - 624
[36] On the use of Distributed DCT in Speaker Identification
Sahidullah, Md.
Saha, Goutam
2009 ANNUAL IEEE INDIA CONFERENCE (INDICON 2009), 2009, : 245 - 248
[37] Generalized dimensions applied to speaker identification
Hou, LM
Wang, SZ
BIOMETRIC TECHNOLOGY FOR HUMAN IDENTIFICATION, 2004, 5404 : 555 - 560
[38] Fusion features for robust speaker identification
Ben Fredj, Ines
Zouhir, Youssef
Ouni, Kais
INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2018, 11 (02) : 65 - 72
[39] SPEAKER IDENTIFICATION WITH DISTANT MICROPHONE SPEECH
Jin, Qin
Li, Runxin
Yang, Qian
Laskowski, Kornel
Schultz, Tanja
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4518 - 4521
[40] Speaker identification using cepstral analysis
Nazar, MN
ISCON 2002: IEEE STUDENTS CONFERENCE ON EMERGING TECHNOLOGIES, PROCEEDINGS, 2002, : 139 - 143

← 1 2 3 4 5 →