Simultaneous speaker identification and watermarking

被引:0
|
作者
Basant S. Abd El-Wahab
Heba A. El-khobby
Mustafa M. Abd Elnaby
Fathi E. Abd El-Samie
机构
[1] Tanta University,Department of Electronics and Electrical Communications Engineering, Faculty of Engineering
[2] Menoufia University,Department of Electronics and Electrical Communications, Faculty of Electronic Engineering
来源
International Journal of Speech Technology | 2021年 / 24卷
关键词
Biometric systems; Speech watermarking; Empirical mode decomposition; Mel frequency cepstral coefficients; Speech enhancement; Speaker identification;
D O I
暂无
中图分类号
学科分类号
摘要
Biometric template protection of speech signals and information hiding in speech signals are two challenging issues. To resolve such limitations and increase the level of security, our objective is to build multi-level security systems based on speech signals. So, speech watermarking is used simultaneously with automatic speaker identification. The speech watermarking is performed to embed images into the speech signals that are used for speaker identification. The watermark is extracted for authentication, and then the effect of watermark removal on the performance of the speaker identification system in the presence of degradations is studied. This paper presents an approach for speech watermarking based on empirical mode decomposition (EMD) in different transform domains and singular value decomposition (SVD). The speech signal is decomposed in different transform domains with EMD to yield zero-mean components called intrinsic mode functions (IMFs). The watermark is inserted into one of these IMF components with SVD. A comparison between different transform domains for implementing the proposed watermarking scheme on different IMFs is presented. The log-likelihood ratio (LLR), correlation coefficient (Cr), signal-to-noise ratio (SNR), and spectral distortion (SD) are used as metrics for the comparison. According to the simulation results, we find that the watermark embedding in the discrete sine transform domain provides higher SNR and Cr values and lower SD and LLR values. The proposed approach is robust to different attacks.
引用
收藏
页码:205 / 218
页数:13
相关论文
共 50 条
  • [31] Real-Time Speaker Identification Using Speaker Model Distance
    Zeinali, Hossein
    Sameti, Hossein
    Hadian, Hossein
    2015 23RD IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 643 - 647
  • [32] IMPROVING SPEAKER IDENTIFICATION FOR SHARED DEVICES BY ADAPTING EMBEDDINGS TO SPEAKER SUBSETS
    Tan, Zhenning
    Yang, Yuguang
    Han, Eunjung
    Stolcke, Andreas
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 1124 - 1131
  • [33] Improved speaker identification in wireless environment
    Vuppala, Anil Kumar
    Rao, K. Sreenivasa
    Chakrabarti, Saswat
    INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2013, 6 (03) : 130 - 137
  • [34] Survey of Automated Speaker Identification Methods
    Sidorov, Maxim
    Schmitt, Alexander
    Zablotskiy, Sergey
    Minker, Wolfgang
    NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT ENVIRONMENTS (IE 2013), 2013, : 236 - 239
  • [35] NMF Based System for Speaker Identification
    Costantini, Giovanni
    Cesarini, Valerio
    Paolizzo, Fabio
    2021 IEEE INTERNATIONAL WORKSHOP ON METROLOGY FOR INDUSTRY 4.0 & IOT (IEEE METROIND4.0 & IOT), 2021, : 620 - 624
  • [36] On the use of Distributed DCT in Speaker Identification
    Sahidullah, Md.
    Saha, Goutam
    2009 ANNUAL IEEE INDIA CONFERENCE (INDICON 2009), 2009, : 245 - 248
  • [37] Generalized dimensions applied to speaker identification
    Hou, LM
    Wang, SZ
    BIOMETRIC TECHNOLOGY FOR HUMAN IDENTIFICATION, 2004, 5404 : 555 - 560
  • [38] Fusion features for robust speaker identification
    Ben Fredj, Ines
    Zouhir, Youssef
    Ouni, Kais
    INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2018, 11 (02) : 65 - 72
  • [39] SPEAKER IDENTIFICATION WITH DISTANT MICROPHONE SPEECH
    Jin, Qin
    Li, Runxin
    Yang, Qian
    Laskowski, Kornel
    Schultz, Tanja
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4518 - 4521
  • [40] Speaker identification using cepstral analysis
    Nazar, MN
    ISCON 2002: IEEE STUDENTS CONFERENCE ON EMERGING TECHNOLOGIES, PROCEEDINGS, 2002, : 139 - 143