Developing the ETSI Aurora advanced distributed speech recognition front-end & What next?

被引:0
作者
Pearce, D [1 ]
机构
[1] Motorola Labs, Basingstoke, Hants, England
来源
ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS | 2001年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The ETSI STQ-Aurora DSR working group are developing the standard for the Advanced DSR front-end. One of the main goals of the advanced front-end is improved robustness to noise compared to the existing ETSI DSR standard for the Mel-Cepstrum front-end. The purpose of the paper is firstly to inform the wider speech research community about this activity and then to promote discussion on what further needs there are for DSR front-end standards. The scope of the DSR standard is described and the set of performance requirements that Aurora has specified for the Advanced Front-end. An important part of this the evaluation and characterisation of the performance of candidate front-ends on noisy databases and an overview of these is given. As the competition to select the best proposal draws to a close (submission deadline 28(th) Nov 2001) an interesting question is "what next?".
引用
收藏
页码:131 / 134
页数:4
相关论文
共 50 条
[41]   Front-end Feature Compensation and Denoising for Noise Robust Speech Emotion Recognition [J].
Chakraborty, Rupayan ;
Panda, Ashish ;
Pandharipande, Meghna ;
Joshi, Sonal ;
Kopparapu, Sunil Kumar .
INTERSPEECH 2019, 2019, :3257-3261
[42]   MULTICHANNEL AUDIO FRONT-END FOR FAR-FIELD AUTOMATIC SPEECH RECOGNITION [J].
Chhetri, Amit ;
Hilmes, Philip ;
Kristjansson, Trausti ;
Chu, Wai ;
Mansour, Mohamed ;
Li, Xiaoxue ;
Zhang, Xianxian .
2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, :1527-1531
[43]   Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End [J].
Ray, Swayambhu Nath ;
Wu, Minhua ;
Raju, Anirudh ;
Ghahremani, Pegah ;
Bilgi, Raghavendra ;
Rao, Milind ;
Arsikere, Harish ;
Rastrow, Ariya ;
Stolcke, Andreas ;
Droppo, Jasha .
INTERSPEECH 2021, 2021, :3455-3459
[44]   Band-wise Front-end Distortion Suppression for Robust Speech Recognition [J].
Zhao, Siyi ;
Wang, Wei ;
Qian, Yanmin .
2024 IEEE 14TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, ISCSLP 2024, 2024, :681-685
[45]   Speech emotion recognition using MFCCs extracted from a mobile terminal based on ETSI front end [J].
Beritelli, Francesco ;
Casale, Salvatore ;
Russo, Alessandra ;
Serrano, Salvatore .
2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, :1607-+
[46]   A distributed RF front-end for UWB receivers [J].
Safarian, Aminghasem ;
Zhou, Lei ;
Heydari, Payam. .
PROCEEDINGS OF THE IEEE 2006 CUSTOM INTEGRATED CIRCUITS CONFERENCE, 2006, :805-808
[47]   MERMAID - A FRONT-END TO DISTRIBUTED HETEROGENEOUS DATABASES [J].
TEMPLETON, MJ ;
BRILL, D ;
DAO, SK ;
LUND, E ;
WARD, P ;
CHEN, ALP ;
MACGREGOR, R .
PROCEEDINGS OF THE IEEE, 1987, 75 (05) :695-708
[48]   A Unified Front-end Anti-interference Approach for Robust Automatic Speech Recognition [J].
Liang, Yunming ;
Zhou, Yi ;
Ma, Yongbao ;
Liu, Hongqing .
2019 IEEE 19TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2019), 2019,
[49]   Comparing Front-End Enhancement Techniques and Multiconditioned Training for Robust Automatic Speech Recognition [J].
Soni, Meet H. ;
Joshi, Sonal ;
Panda, Ashish .
TEXT, SPEECH, AND DIALOGUE (TSD 2019), 2019, 11697 :329-340
[50]   Robust connected digit recognition using speech enhancement and an auditory model front-end [J].
Flynn, Ronan ;
Jones, Edward .
2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, :410-+