Simultaneous speaker identification and watermarking

被引:2
|
作者
Abd El-Wahab, Basant S. [1 ]
El-khobby, Heba A. [1 ]
Abd Elnaby, Mustafa M. [1 ]
Abd El-Samie, Fathi E. [2 ]
机构
[1] Tanta Univ, Fac Engn, Dept Elect & Elect Commun Engn, Tanta, Egypt
[2] Menoufia Univ, Fac Elect Engn, Dept Elect & Elect Commun, Al Minufiyah, Egypt
关键词
Biometric systems; Speech watermarking; Empirical mode decomposition; Mel frequency cepstral coefficients; Speech enhancement; Speaker identification;
D O I
10.1007/s10772-019-09658-x
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Biometric template protection of speech signals and information hiding in speech signals are two challenging issues. To resolve such limitations and increase the level of security, our objective is to build multi-level security systems based on speech signals. So, speech watermarking is used simultaneously with automatic speaker identification. The speech watermarking is performed to embed images into the speech signals that are used for speaker identification. The watermark is extracted for authentication, and then the effect of watermark removal on the performance of the speaker identification system in the presence of degradations is studied. This paper presents an approach for speech watermarking based on empirical mode decomposition (EMD) in different transform domains and singular value decomposition (SVD). The speech signal is decomposed in different transform domains with EMD to yield zero-mean components called intrinsic mode functions (IMFs). The watermark is inserted into one of these IMF components with SVD. A comparison between different transform domains for implementing the proposed watermarking scheme on different IMFs is presented. The log-likelihood ratio (LLR), correlation coefficient (C-r), signal-to-noise ratio (SNR), and spectral distortion (SD) are used as metrics for the comparison. According to the simulation results, we find that the watermark embedding in the discrete sine transform domain provides higher SNR and C-r values and lower SD and LLR values. The proposed approach is robust to different attacks.
引用
收藏
页码:205 / 218
页数:14
相关论文
共 50 条
  • [1] Simultaneous speaker identification and watermarking
    Basant S. Abd El-Wahab
    Heba A. El-khobby
    Mustafa M. Abd Elnaby
    Fathi E. Abd El-Samie
    International Journal of Speech Technology, 2021, 24 : 205 - 218
  • [2] Sensitivity of automatic speaker identification to SVD digital audio watermarking
    El-Samie, Fathi
    Shafik, Amira
    El-Sayed, Hala
    Elhalafawy, Said
    Diab, Salaheldin
    Sallam, Bassiouny
    Faragallah, Osama
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2015, 18 (04) : 565 - 581
  • [3] Speech fragment decoding techniques for simultaneous speaker identification and speech recognition
    Barker, Jon
    Ma, Ning
    Coy, Andre
    Cooke, Martin
    COMPUTER SPEECH AND LANGUAGE, 2010, 24 (01) : 94 - 111
  • [4] Speaker Re-identification with Speaker Dependent Speech Enhancement
    Shi, Yanpei
    Huang, Qiang
    Hain, Thomas
    INTERSPEECH 2020, 2020, : 1530 - 1534
  • [5] Speech Enhancement for Speaker Identification
    Mahesh, R.
    2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,
  • [6] Spectral Restoration Based Speech Enhancement for Robust Speaker Identification
    Saleem, Nasir
    Tareen, Tayyaba Gul
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2018, 5 (01): : 34 - 39
  • [7] New algorithms for improved speaker identification
    Fang, Eric
    Gowdy, John N.
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2013, 5 (3-4) : 360 - 369
  • [8] Speaker verification security improvement by means of speech watermarking
    Faundez-Zanuy, Marcos
    Hagmueller, Martin
    Kubin, Gernot
    SPEECH COMMUNICATION, 2006, 48 (12) : 1608 - 1619
  • [9] Hierarchical speaker identification using speaker clustering
    Sun, B
    Liu, WJ
    Zhong, QH
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 299 - 304
  • [10] EMARATI SPEAKER IDENTIFICATION
    Shahin, Ismail
    Ba-Hutair, Mohammed Nasser
    2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 488 - 493