Auditory pathway model and its VLSI implementation for robust speech recognition in real-world noisy environment

被引:0
|
作者
Lee, SY [1 ]
Kim, CM [1 ]
Won, YG [1 ]
Park, HM [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Brain Sci Res Ctr, Dept BioSyst, Taejon 305701, South Korea
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A robust speech recognition system is reported based on mathematical models of auditory pathway and also their VLSI implementations. The developed auditory model consists of 3 components, i.e., nonlinear feature extraction at cochlea, binaural processing at superior olivery complex, and top-down attention through backward path. The feature extraction is based on cochlear filter bank and time-frequency masking, which is modeled with lateral inhibition in both time and frequency domain. Unlike the popular binaural processing models based on simple interaural time delay and interaural intensity difference our model incorporates hundreds of time-delays for noisy reverberated signals. The top-down (TD) attention comes from familiarity and/or importance of the sound, and a simple but efficient TD attention model had been developed based on error backpropagation algorithm. These auditory models require intensive computing, and special hardwares had been developed for real-time applications. Experimental results demonstrate much better recognition performance in real-world noisy environments.
引用
收藏
页码:1728 / 1733
页数:6
相关论文
共 50 条
  • [1] Auditory model for robust speech recognition in real world noisy environments
    Kim, DS
    Lee, SY
    Kil, RM
    Zhu, XL
    ELECTRONICS LETTERS, 1997, 33 (01) : 12 - 13
  • [2] Auditory processing of speech signals for robust speech recognition in real-world noisy environments
    Kim, DS
    Lee, SY
    Kil, RM
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (01): : 55 - 69
  • [3] Voice Command II: A DSP implementation of robust speech recognition in real-world noisy environments
    Lee, SY
    Kim, DS
    Ahn, KH
    Jeong, JH
    Kim, H
    Park, SY
    Kim, LY
    Lee, JS
    Lee, HY
    PROGRESS IN CONNECTIONIST-BASED INFORMATION SYSTEMS, VOLS 1 AND 2, 1998, : 1051 - 1054
  • [4] VLSI Architecture for Robust Speech Recognition Systems and its Implementation on a Verification Platform
    Yoshizawa, Shingo
    Hayasaka, Noboru
    Wada, Naoya
    Miyanaga, Yoshikazu
    JOURNAL OF ROBOTICS AND MECHATRONICS, 2005, 17 (04) : 447 - 455
  • [5] An auditory model for robust speech recognition
    Luo, Xuewen
    Soon, Ing Yann
    Yeo, Chai Kiat
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1105 - 1109
  • [6] A digital chip for robust speech recognition in noisy environment
    Kim, CM
    Lee, SY
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 1089 - 1092
  • [7] A model of dynamic auditory perception and its application to robust speech recognition
    Strope, B
    Alwan, A
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 37 - 40
  • [8] Robust emotional speech recognition based on binaural model and emotional auditory mask in noisy environments
    Bashirpour, Meysam
    Geravanchizadeh, Masoud
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2018,
  • [9] Robust emotional speech recognition based on binaural model and emotional auditory mask in noisy environments
    Meysam Bashirpour
    Masoud Geravanchizadeh
    EURASIP Journal on Audio, Speech, and Music Processing, 2018
  • [10] Speech recognition with wavelet spectral subtraction in real noisy environment
    Denda, N
    Nishiura, T
    Kawahara, H
    Irino, T
    2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 638 - 641