Investigation into a Mel subspace based front-end processing for robust speech recognition

被引:1
|
作者
Selouani, SA [1 ]
O'Shaughnessy, D [1 ]
机构
[1] Univ Moncton, Moncton, NB E1A 3E9, Canada
关键词
speech recognition; neural networks; genetic algorithms; noise reduction;
D O I
10.1109/ISSPIT.2004.1433718
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the issue of noise reduction applied to robust large- vocabulary continuous-speech recognition (CSR). We investigate strategies based on the subspace filtering that has been proven very effective in the area of speech enhancement. We compare original hybrid techniques that combine the Karhonen-Loeve Transform (KLT), Multilayer Perceptron (MLP) and Genetic Algorithms (GAs) in order to get less-variant Mel-frequency parameters. The advantages of these methods include that they do not require estimation of either noise or speech spectra. To evaluate the effecteveness of these methods, an extensive set of recognition experiments are carried out in a severe interfering car noise environmentfor a wide range of SNRs varying from 16 dB to -4 dB using a noisy version of the TIMIT database.
引用
收藏
页码:187 / 190
页数:4
相关论文
共 50 条
  • [1] Robust Front-End based on MVA processing for Arabic Speech Recognition
    Techini, Elhem
    Sakka, Zied
    Bouhlel, MedSalim
    2017 INTERNATIONAL CONFERENCE ON ENGINEERING & MIS (ICEMIS), 2017,
  • [2] Robust Front-End Processing For Emotion Recognition In Noisy Speech
    Pandharipande, Meghna
    Chakraborty, Rupayan
    Panda, Ashish
    Kopparapu, Sunil Kumar
    2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 324 - 328
  • [3] ROBUST FRONT-END PROCESSING FOR SPEECH RECOGNITION IN NOISY CONDITIONS
    Das, Biswajit
    Panda, Ashish
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5235 - 5239
  • [4] Investigation of Speech Separation as a Front-End for Noise Robust Speech Recognition
    Narayanan, Arun
    Wang, DeLiang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (04) : 826 - 835
  • [5] A robust front-end for telephone speech recognition
    Cho, HY
    Chi, SM
    Oh, YH
    PRICAI'98: TOPICS IN ARTIFICIAL INTELLIGENCE, 1998, 1531 : 636 - 644
  • [6] A biological front-end processing for speech recognition
    Ferrandez, JM
    del Valle, D
    Rodellar, V
    Gomez, P
    BIOLOGICAL AND ARTIFICIAL COMPUTATION: FROM NEUROSCIENCE TO TECHNOLOGY, 1997, 1240 : 1058 - 1067
  • [7] The speech recognition based on the bark wavelet front-end processing
    Zhang, XY
    Jiao, ZP
    Zhao, ZF
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 2, PROCEEDINGS, 2005, 3614 : 302 - 305
  • [8] Investigation of Monaural Front-End Processing for Robust Speech Recognition Without Retraining or Joint-Training
    Du, Zhihao
    Zhang, Xueliang
    Han, Jiqing
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 249 - 254
  • [9] A comparison of front-end configurations for robust speech recognition
    Milner, B
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 797 - 800
  • [10] Auditory masking based acoustic front-end for robust speech recognition
    Paliwal, KK
    Lilly, BT
    IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 165 - 168