Noise reduction based on microphone array and post-filtering for robust speech recognition

被引:0
|
作者
Li, Junfeng [1 ]
Akagi, Masato [2 ]
Suzuki, Yoiti [1 ]
机构
[1] Tohoku Univ, Res Inst Elect Commun, 2-1-1 Katahira, Sendai, Miyagi 980, Japan
[2] JAIST, Sch Informat Sci, Ishikawa, Japan
来源
2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4 | 2006年
关键词
microphone array; post-filter; speech recognition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes a noise reduction system using microphone array and post-filtering to improve the recognition accuracy and robustness of hands-free speech recognition systems in adverse environments. In this research, we suppose that undesired noises are of localized and non-localized noise components. To deal with localized noise, we propose a hybrid noise estimation technique and a robust and accurate speech absence probability estimator to calculate the spectra of localized noise, which is further reduced by spectral subtraction. To deal with non-localized noise, we propose a hybrid post-filter with an assumption of a diffuse noise field. Speech recognition results show that the proposed noise reduction algorithm outperforms the other traditional algorithms in the tested noisy conditions.
引用
收藏
页码:680 / +
页数:2
相关论文
共 50 条
  • [31] An improved noise reduction algorithm for speech signals using a microphone array
    Van Binh Truong
    Due Minh Nguyen
    Quang Hieu Dang
    2014 IEEE FIFTH INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS (ICCE), 2014, : 472 - 477
  • [32] Multichannel post-filtering in nonstationary noise environments
    Cohen, I
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (05) : 1149 - 1160
  • [33] Microphone array system for speech recognition
    Kiyohara, K
    Kaneda, Y
    Takahashi, S
    Nomura, H
    Kojima, J
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS, 1997, : 215 - 218
  • [34] MICROPHONE ARRAY POST-FILTER BASED ON AUDITORY FILTERING
    Li, Peng
    Liao, Fengchai
    Cheng, Ning
    Xu, Bo
    Liu, Wenju
    2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 374 - 377
  • [35] Robust Feature Combination for Speech Recognition Using Linear Microphone Array in a Car
    Obuchi, Yasunari
    Hataoka, Nobuo
    IN-VEHICLE CORPUS AND SIGNAL PROCESSING FOR DRIVER BEHAVIOR, 2009, : 187 - +
  • [36] Signal subspace speech enhancement with perceptual post-filtering
    Klein, M
    Kabal, P
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 537 - 540
  • [37] Improved Kalman filter-based speech enhancement with perceptual post-filtering
    Wei, JQ
    Du, LM
    Yan, ZL
    Hui, Z
    CHINESE JOURNAL OF ELECTRONICS, 2004, 13 (02): : 300 - 304
  • [38] Microphone array based speech recognition with different talker-array positions
    Omologo, M
    Matassoni, M
    Svaizer, P
    Giuliani, D
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 227 - 230
  • [39] Signal-to-noise ratio adaptive post-filtering method for intelligibility enhancement of telephone speech
    Jokinen, Emma
    Yrttiaho, Santeri
    Pulakka, Hannu
    Vainio, Martti
    Alku, Paavo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (06): : 3990 - 4001
  • [40] STATISTICAL MODIFICATION BASED POST-FILTERING TECHNIQUE FOR HMM-BASED SPEECH SYNTHESIS
    Wen, Zhengqi
    Tao, Jianhua
    Che, Hao
    2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 146 - 149