NOISE AND SPEAKER COMPENSATION IN THE LOG FILTER BANK DOMAIN

被引:0
|
作者
Joshi, Vikas [1 ]
Bilgi, Raghavendra [1 ]
Umesh, S. [1 ]
Garcia, L. [2 ]
Benitez, C. [2 ]
机构
[1] Indian Inst Technol, Dept Elect Engn, Madras 600036, Tamil Nadu, India
[2] Univ Granada, Dept Signal Theory Telemat & Commun, E-18071 Granada, Spain
关键词
Speaker Normalization; Noise Compensation; VTS; TVTLN; Noise and Speaker compensation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a method to compensate for noise and speaker-variability directly in the Log filter-bank (FB) domain, so that MFCC features are robust to noise and speaker-variations. For noise-compensation, we use Vector Taylor Series (VTS) approach in the Log FB domain, and speaker-normalization is also done in the Log FB domain using Linear Vocal tract length (VTLN) matrices. For VTLN, optimal selection of warp-factor is done in Log FB domain using canonical GMM model, avoiding the two-pass approach needed by a HMM model. Further, this can be efficiently implemented using sufficient statistics obtained from the GMM and the FB-VTLN-matrices. The warp-factor selection using GMM can also be done in cepstral domain by applying DCT matrices without the usual approximations associated with conventional linear-VTLN. The elegance of the proposed approach is that given the speech data, we obtain directly MFCC features that are robust to noise and speaker-variations. The proposed approach, show a significant relative improvement of 31% over baseline on Aurora-4 task.
引用
收藏
页码:4709 / 4712
页数:4
相关论文
共 50 条
  • [41] LDI filter bank for ADC frequency domain analysis
    Rebai, C
    Dallet, D
    Marchegay, P
    ICES 2002: 9TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS, VOLS I-111, CONFERENCE PROCEEDINGS, 2002, : 907 - 910
  • [42] Multiple operating points in a CMOS log-domain filter
    Fox, RM
    Nagarajan, M
    ISCAS '99: PROCEEDINGS OF THE 1999 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 2: ANALOG AND DIGITAL CIRCUITS, 1999, : 689 - 692
  • [43] Log-domain synthesis of nth order universal filter
    Shah, Nisar Ahmad
    Khanday, Farooq Ahmad
    ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING, 2009, 59 (03) : 309 - 315
  • [44] Multiple operating points in a CMOS log-domain filter
    Fox, RM
    Nagarajan, M
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-ANALOG AND DIGITAL SIGNAL PROCESSING, 1999, 46 (06): : 705 - 710
  • [45] Approximation of arbitrary complex filter responses and their realisation in log domain
    Teplechuk, M. A.
    Sewell, J. I.
    IEE PROCEEDINGS-CIRCUITS DEVICES AND SYSTEMS, 2006, 153 (06): : 583 - 590
  • [46] Log-domain synthesis of nth order universal filter
    Nisar Ahmad Shah
    Farooq Ahmad Khanday
    Analog Integrated Circuits and Signal Processing, 2009, 59 : 309 - 315
  • [47] An analysis of matching in the Tau Cell log-domain filter
    Hamilton, Tara Julia
    Jin, Craig
    van Schaik, Andre
    2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 421 - +
  • [48] Log-domain all pass filter based on integrators
    N. A. Shah
    S. Z. Iqbal
    Nusrat Parveen
    Analog Integrated Circuits and Signal Processing, 2011, 67 : 85 - 88
  • [49] Syllabically companding log domain filter using dynamic biasing
    Frey, DR
    Tsividis, YP
    ELECTRONICS LETTERS, 1997, 33 (18) : 1506 - 1507
  • [50] A Log-domain Bandpass Filter for Implementation of Wavelet Transform
    Huang Qingxiu
    He Yigang
    PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON TEST AUTOMATION AND INSTRUMENTATION, VOL 4, 2008, : 2080 - 2083