Combination of autocorrelation-based features and projection measure technique for speaker identification

被引:11
|
作者
Yuo, KH [1 ]
Hwang, TH [1 ]
Wang, HC [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Elect Engn, Hsinchu 300, Taiwan
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2005年 / 13卷 / 04期
关键词
channel-normalization; relative autocorrelation sequence; projection measure; speaker identification;
D O I
10.1109/TSA.2005.848893
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a robust approach for speaker identification when the speech signal is corrupted by additive noise and channel distortion. Robust features are derived by assuming that the corrupting noise is stationary and the channel effect is fixed during an utterance. A two-step temporal filtering procedure on the autocorrelation sequence is proposed to minimize the effect of additive and convolutional noises. The first step applies a temporal filtering procedure in autocorrelation domain to remove the additive noise, and the second step is to perform the mean subtraction on the filtered autocorrelation sequence in logarithmic spectrum domain to remove the channel effect. No prior knowledge of noise characteristic is necessary. The additive noise can be a colored noise. Then the proposed robust feature is combined with the projection measure technique to gain further improvement in recognition accuracy. Experimental results show that the proposed method can significantly improve the performance of speaker identification task in noisy environment.
引用
收藏
页码:565 / 574
页数:10
相关论文
共 13 条
  • [1] Speaker identification based on combination of MFCC and UMRT based features
    Antony, Anett
    Gopikakumari, R.
    8TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATIONS (ICACC-2018), 2018, 143 : 250 - 257
  • [2] A Histogram Based Speaker Identification Technique
    Sleit, Azzam
    Serhan, Sami
    Nemir, Loai
    2008 FIRST INTERNATIONAL CONFERENCE ON THE APPLICATIONS OF DIGITAL INFORMATION AND WEB TECHNOLOGIES, VOLS 1 AND 2, 2008, : 391 - 395
  • [3] Speaker Identification Based on Curvlet Transform Technique
    AbuAladas, Feras E.
    Zeki, Akram M.
    Al-Ani, Muzhir Shaban
    Messikh, Az-Eddine
    2017 INTERNATIONAL CONFERENCE ON COMPUTING, ENGINEERING, AND DESIGN (ICCED), 2017,
  • [4] Speaker Identification Approach Based On Time Domain Extracted Features
    Lupu, Eugen
    Emerich, Simina
    PROCEEDINGS ELMAR-2010, 2010, : 355 - 358
  • [5] Speaker Identification based on Robust AM-FM Features
    Deshpande, Mangesh S.
    Holambe, Raghunath S.
    2009 SECOND INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN ENGINEERING AND TECHNOLOGY (ICETET 2009), 2009, : 62 - +
  • [6] Weighted Distortion Measure on Standard Deviation for VQ-Based Speaker Identification
    Luo, Xiao-ting
    Ji, Li-xin
    Li, Shao-mei
    2010 2ND INTERNATIONAL CONFERENCE ON E-BUSINESS AND INFORMATION SYSTEM SECURITY (EBISS 2010), 2010, : 422 - 425
  • [7] Speaker identification based on the frame linear predictive coding spectrum technique
    Wu, Jian-Da
    Lin, Bing-Fu
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (04) : 8056 - 8063
  • [8] A speech indexing system for recorded audio source based on speaker identification technique
    Pan, ZB
    Kotam, K
    Ohmi, T
    ADVANCES IN INTELLIGENT SYSTEMS: THEORY AND APPLICATIONS, 2000, 59 : 239 - 243
  • [9] Wavelet Packet Based Mel Frequency Cepstral Features for Text Independent Speaker Identification
    Srivastava, Smriti
    Bhardwaj, Saurabh
    Bhandari, Abhishek
    Gupta, Krit
    Bahl, Hitesh
    Gupta, J. R. P.
    INTELLIGENT INFORMATICS, 2013, 182 : 237 - 247
  • [10] HILBERT ENVELOPE BASED FEATURES FOR ROBUST SPEAKER IDENTIFICATION UNDER REVERBERANT MISMATCHED CONDITIONS
    Sadjadi, Seyed Omid
    Hansen, John H. L.
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5448 - 5451