LEARNING SPECTRAL MAPPING FOR SPEECH DEREVERBERATION

被引:0
|
作者
Han, Kun [1 ,2 ]
Wang, Yuxuan [1 ,2 ]
Wang, DeLiang [1 ,2 ]
机构
[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
[2] Ohio State Univ, Ctr Cognit & Brain Sci, Columbus, OH 43210 USA
关键词
Speech Dereverberation; Deep Neural Networks; Spectral Mapping; REVERBERANT SPEECH; INTELLIGIBILITY; ALGORITHM;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Reverberation distorts human speech and usually has negative effects on speech intelligibility, especially for hearing-impaired listeners. It also causes performance degradation in automatic speech recognition and speaker identification systems. Therefore, the dereverberation problem must be dealt with in daily listening environments. We propose to use deep neural networks (DNNs) to learn a spectral mapping from the reverberant speech to the anechoic speech. The trained DNN produces the estimated spectral representation of the corresponding anechoic speech. We demonstrate that distortion caused by reverberation is substantially attenuated by the DNN whose outputs can be resynthesized to the dereverebrated speech signal. The proposed approach is simple, and our systematic evaluation shows promising dereverberation results, which are significantly better than those of related systems.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] A Forced Spectral Diversity Algorithm for Speech Dereverberation in the Presence of Near-Common Zeros
    Lin, Xiang
    Khong, Andy W. H.
    Naylor, Patrick A.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (03): : 888 - 899
  • [32] Dual microphone speech dereverberation
    Abutalebi, Hamid Reza
    Faghani, Farhad
    2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 847 - 850
  • [33] JOINT ACOUSTIC AND SPECTRAL MODELING FOR SPEECH DEREVERBERATION USING NON-NEGATIVE REPRESENTATIONS
    Mohammadiha, Nasser
    Smaragdis, Paris
    Doclo, Simon
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4410 - 4414
  • [34] Speech Recognition by Denoising and Dereverberation Based on Spectral Subtraction in a Real Noisy Reverberant Environment
    Odani, Kyohei
    Wang, Longbiao
    Kai, Atsuhiko
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1250 - 1253
  • [35] On phase recovery and preserving early reflections for deep-learning speech dereverberation
    Luo, Xiaoxue
    Ke, Yuxuan
    Li, Xiaodong
    Zheng, Chengshi
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2024, 155 (01): : 436 - 451
  • [36] Learning Complex Spectral Mapping for Speech Enhancement with Improved Cross-corpus Generalization
    Pandey, Ashutosh
    Wang, DeLiang
    INTERSPEECH 2020, 2020, : 4511 - 4515
  • [37] Learning Complex Spectral Mapping With Gated Convolutional Recurrent Networks for Monaural Speech Enhancement
    Tan, Ke
    Wang, DeLiang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 380 - 390
  • [38] Multimicrophone Speech Dereverberation: Experimental Validation
    Koen Eneman
    Marc Moonen
    EURASIP Journal on Audio, Speech, and Music Processing, 2007
  • [39] Overfitting-resistant speech dereverberation
    Yoshioka, Takuya
    Nakatani, Tomohiro
    Hikichi, Takafumi
    Miyoshi, Masato
    2007 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2007, : 101 - 104
  • [40] Towards speech recognition oriented dereverberation
    Jinachitra, P
    Prieto, RE
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 437 - 440