Dual-Channel VTS Feature Compensation with Improved Posterior Estimation

被引:0
作者
Lopez-Espejo, Ivan [1 ]
Peinado, Antonio M. [2 ]
Gomez, Angel M. [2 ]
Gonzalez, Jose A. [3 ]
Prieto-Calero, Santiago [1 ]
机构
[1] VeriDas Das Nano, Tajonar, Spain
[2] Univ Granada, Dept Signal Theory Telemat & Commun, Granada, Spain
[3] Univ Malaga, Dept Languages & Comp Sci, Malaga, Spain
来源
2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO) | 2018年
关键词
VTS feature compensation; Posterior probability; Robust ASR; Dual-channel; Mobile device;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The use of dual-microphones is a powerful tool for noise-robust automatic speech recognition (ASR). In particular, it allows the reformulation of classical techniques like vector Taylor series (VTS) feature compensation. In this work, we consider a critical issue of VTS compensation such as posterior computation and propose an alternative way to estimate more accurately these probabilities when VTS is applied to enhance noisy speech captured by dual-microphone mobile devices. Our proposal models the conditional dependence of a noisy secondary channel given a primary one not only to outperform single-channel VTS feature compensation, but also a previous dual-channel VTS approach based on a stacked formulation. This is confirmed by recognition experiments on two different dual-channel extensions of the Aurora-2 corpus. Such extensions emulate the use of a dual-microphone smartphone in close-and far-talk conditions, obtaining our proposal relevant improvements in the latter case.
引用
收藏
页码:2065 / 2069
页数:5
相关论文
共 15 条
[1]  
[Anonymous], 201108 ETSI ES
[2]  
[Anonymous], ICSLP
[3]  
[Anonymous], 202050 ETSI ES
[4]  
Faubel F., 2010, P ICASSP 2010 35 INT
[5]   An Overview of Noise-Robust Automatic Speech Recognition [J].
Li, Jinyu ;
Deng, Li ;
Gong, Yifan ;
Haeb-Umbach, Reinhold .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (04) :745-777
[6]   Dual-channel VTS feature compensation for noise-robust speech recognition on mobile devices [J].
Lopez-Espejo, Ivan ;
Peinado, Antonio M. ;
Gomez, Angel M. ;
Gonzalez, Jose A. .
IET SIGNAL PROCESSING, 2017, 11 (01) :17-25
[7]  
López-Espejo I, 2014, EUR SIGNAL PR CONF, P21
[8]   A deep neural network approach for missing-data mask estimation on dual-microphone smartphones: Application to noise-robust speech recognition [J].
López-Espejo, I. ;
González, José A. ;
Gómez, Ángel M. ;
Peinado, Antonio M. .
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8854 :119-128
[9]  
McCowan I.A., 2002, Int. Conf. on Spoken Language Process, P2181
[10]  
Mestre X, 2003, PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, P459