Developing the ETSI Aurora advanced distributed speech recognition front-end & What next?

被引:0
作者
Pearce, D [1 ]
机构
[1] Motorola Labs, Basingstoke, Hants, England
来源
ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS | 2001年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The ETSI STQ-Aurora DSR working group are developing the standard for the Advanced DSR front-end. One of the main goals of the advanced front-end is improved robustness to noise compared to the existing ETSI DSR standard for the Mel-Cepstrum front-end. The purpose of the paper is firstly to inform the wider speech research community about this activity and then to promote discussion on what further needs there are for DSR front-end standards. The scope of the DSR standard is described and the set of performance requirements that Aurora has specified for the Advanced Front-end. An important part of this the evaluation and characterisation of the performance of candidate front-ends on noisy databases and an overview of these is given. As the competition to select the best proposal draws to a close (submission deadline 28(th) Nov 2001) an interesting question is "what next?".
引用
收藏
页码:131 / 134
页数:4
相关论文
共 50 条
[21]   The speech recognition based on the bark wavelet front-end processing [J].
Zhang, XY ;
Jiao, ZP ;
Zhao, ZF .
FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 2, PROCEEDINGS, 2005, 3614 :302-305
[22]   Robust Front-End Processing For Emotion Recognition In Noisy Speech [J].
Pandharipande, Meghna ;
Chakraborty, Rupayan ;
Panda, Ashish ;
Kopparapu, Sunil Kumar .
2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, :324-328
[23]   Wavelet-based Front-End for Electromyographic Speech Recognition [J].
Wand, Michael ;
Jou, Szu-Chen Stan ;
Schultz, Tanja .
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, :1773-+
[24]   Front-end design by using auditory modeling in speech recognition [J].
Tian, JL ;
Laurila, K ;
Hariharan, R ;
Kiss, I .
COMPUTATIONAL MODELS OF AUDITORY FUNCTION, 2001, 312 :329-342
[25]   ROBUST FRONT-END PROCESSING FOR SPEECH RECOGNITION IN NOISY CONDITIONS [J].
Das, Biswajit ;
Panda, Ashish .
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, :5235-5239
[26]   Blind equalization techniques for ETSI standard DSR front-end [J].
Kuroiwa, S ;
Tsuge, S .
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, :392-395
[27]   Distributed speaker recognition using the ETSI distributed speech recognition standard [J].
Broun, CC ;
Campbell, WM ;
Pearce, D ;
Kelleher, H .
IC-AI'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS I-III, 2001, :244-248
[28]   Optimization of Speech Enhancement Front-end with Speech Recognition-level Criterion [J].
Higuchi, Takuya ;
Yoshioka, Takuya ;
Nakatani, Tomohiro .
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, :3808-3812
[29]   Voicing Class Dependent Huffman Coding of Compressed Front-End Feature Vector for Distributed Speech Recognition [J].
Kim, Deok Su ;
Kim, Hong Kook .
2008 SECOND INTERNATIONAL CONFERENCE ON FUTURE GENERATION COMMUNICATION AND NETWORKING SYMPOSIA, VOLS 1-5, PROCEEDINGS, 2008, :218-221
[30]   Enhanced Sparse Imputation Techniques for a Robust Speech Recognition Front-End [J].
Tan, Qun Feng ;
Georgiou, Panayiotis G. ;
Narayanan, Shrikanth .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (08) :2418-2429