Channel Compensation in the Generalised Vector Taylor Series Approach to Robust ASR

被引:4
|
作者
Loweimi, Erfan [1 ]
Barker, Jon [1 ]
Hain, Thomas [1 ]
机构
[1] Univ Sheffield, Speech & Hearing Res Grp SPandH, Sheffield, S Yorkshire, England
来源
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION | 2017年
关键词
robust speech recognition; generalised Vector Taylor Series; Channel noise estimation;
D O I
10.21437/Interspeech.2017-211
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vector Taylor Series (VTS) is a powerful technique for robust ASR but, in its standard form, it can only be applied to log-filter bank and MFCC features. In earlier work, we presented a generalised VTS (gVTS) that extends the applicability of VTS to front-ends which employ a power transformation non-linearity. gVTS was shown to provide performance improvements in both clean and additive noise conditions. This paper makes two novel contributions. Firstly, while the previous gVTS formulation assumed that noise was purely additive. we now derive gVTS formulae for the case of speech in the presence of both additive noise and channel distortion. Second. we propose a novel iterative method for estimating the channel distortion which utilises gVTS itself and converges after a few iterations. Since the new gVTS blindly assumes the existence of both additive noise and channel effects, it is important not to introduce extra distortion when either arc absent. Experimental results conducted on LVCSR Aurora-4 database show that the new formulation passes this test. In the presence of channel noise only, it provides relative WER reductions of up to 30% and 26%, compared with previous gVTS and multi-style training with cepstral mean normalisation. respectively.
引用
收藏
页码:2466 / 2470
页数:5
相关论文
共 50 条
  • [1] Use of Generalised Nonlinearity in Vector Taylor Series Noise Compensation for Robust Speech Recognition
    Loweimi, Erfan
    Barker, Jon
    Hain, Thomas
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3798 - 3802
  • [2] CVX-Optimized Beamforming and Vector Taylor Series Compensation with German ASR Employing Star-Shaped Microphone Array
    Morales-Cordovilla, Juan A.
    Pessentheiner, Hannes
    Hagmueller, Martin
    Gonzalez, Jose A.
    Kubin, Gernot
    ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, IBERSPEECH 2014, 2014, 8854 : 148 - 157
  • [3] Feature compensation algorithm based on vector Taylor series for speaker recognition
    Wu, Haiyang
    Yang, Feiran
    Zhou, Lin
    Wu, Zhenyang
    Shengxue Xuebao/Acta Acustica, 2013, 38 (01): : 105 - 112
  • [4] NOISE ADAPTIVE TRAINING USING A VECTOR TAYLOR SERIES APPROACH FOR NOISE ROBUST AUTOMATIC SPEECH RECOGNITION
    Kalinli, Ozlem
    Seltzer, Michael L.
    Acero, Alex
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3825 - 3828
  • [5] ON NOISE ESTIMATION FOR ROBUST SPEECH RECOGNITION USING VECTOR TAYLOR SERIES
    Zhao, Yong
    Juang, Biing-Hwang
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4290 - 4293
  • [6] SECOND ORDER VECTOR TAYLOR SERIES BASED ROBUST SPEECH RECOGNITION
    Bu, Suliang
    Qian, Yanmin
    Sim, Khe Chai
    You, Yongbin
    Yu, Kai
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [7] Some theorums on the Taylor series of generalised gaps
    Gergen, JJ
    COMPTES RENDUS HEBDOMADAIRES DES SEANCES DE L ACADEMIE DES SCIENCES, 1927, 184 : 1040 - 1043
  • [8] A NOISE ROBUST I-VECTOR EXTRACTOR USING VECTOR TAYLOR SERIES FOR SPEAKER RECOGNITION
    Lei, Yun
    Burget, Lukas
    Scheffer, Nicolas
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6788 - 6791
  • [9] Robust ASR using support vector machines
    Solera-Urena, R.
    Martin-Iglesias, D.
    Gallardo-Antolin, A.
    Pelaez-Moreno, C.
    Diaz-de-Maria, F.
    SPEECH COMMUNICATION, 2007, 49 (04) : 253 - 267
  • [10] Support Vector Machines for Noise Robust ASR
    Gales, M. J. F.
    Ragni, A.
    AlDamarki, H.
    Gautier, C.
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 205 - 210