Single-Channel Speech Dereverberation in Noisy Environment for Non-Orthogonal Signals

被引:0
|
作者
Fahim, Abdullah [1 ]
Samarasinghe, Prasanga N. [1 ]
Abhayapala, Thushara D. [1 ]
机构
[1] Australian Natl Univ, Res Sch Engn, Canberra, ACT, Australia
基金
澳大利亚研究理事会;
关键词
SPECTRAL SUBTRACTION; REVERBERANT; INTELLIGIBILITY; SUPPRESSION; QUALITY;
D O I
10.3813/AAA.919270
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The detrimental effect of speech reverberation reduces speech quality, limits the performance of automatic speech recognition systems and impairs hearing aids. Spectral enhancement (SE) is a popular method for suppressing the late reverberation and background noise. However, conventional SE-based approaches assume orthogonality between the desired and undesired signal components. This orthogonality assumption does not hold true in most of the practical cases due to a limited time-domain support and the short-time stationarity of the speech signals, and thereby, affects estimation accuracy. To circumvent this issue, Lu et al. relaxed the orthogonality assumption by proposing a geometric approach to spectral subtraction (GSS) and evaluated their algorithm against different kinds of background noise. In our work, we comprehensively analyze the model by virtue of a simplified GSS transfer function to gain an insight into the algorithm. We conduct a series of experiments to validate GSS and explore its limitations in diverse realistic scenarios with both reverberation and background noise through a comprehensive end-to-end system for speech dereverberation and noise suppression. We also analyze the performance of GSS using the experimental data of the 2014 REVERB challenge and compare it with other conventional approaches such as spectral subtraction, Wiener Filter, minimum mean square error short-time spectral amplitude estimator and log spectral amplitude estimator, as well as with the contemporary methods of the 2014 REVERB challenge.
引用
收藏
页码:1041 / 1055
页数:15
相关论文
共 50 条
  • [41] Time offsets for non-orthogonal CDMA signals
    Lyu, Dugin
    AEU-Archiv fur Elektronik und Ubertragungstechnik, 2002, 56 (05): : 355 - 358
  • [42] Single-Channel Speech Dereverberation Based on Block-wise Weighted Prediction Error and Nonnegative Matrix Factorization
    Kwak, Chan Woong
    Jeon, Kwang Myung
    Park, In Young
    Kim, Hong Kook
    Lim, Jeong Eun
    Park, Ji Hyun
    2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2018,
  • [43] Supervised single-channel speech dereverberation and denoising using a two-stage model based sparse representation
    Zhang Long
    Xu Xu
    Chen Huang
    Chen Jiaxu
    Ye Zhongfu
    SPEECH COMMUNICATION, 2018, 97 : 1 - 8
  • [44] Single-channel dereverberation for distant-talking speech recognition by combining denoising autoencoder and temporal structure normalization
    Ueda, Yuma
    Wang, Longbiao
    Kai, Atsuhiko
    Xiao, Xiong
    Chng, Eng Siong
    Li, Haizhou
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 379 - +
  • [45] Single-channel Dereverberation for Distant-Talking Speech Recognition by Combining Denoising Autoencoder and Temporal Structure Normalization
    Ueda, Yuma
    Wang, Longbiao
    Kai, Atsuhiko
    Xiao, Xiong
    Chng, Eng Siong
    Li, Haizhou
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 82 (02): : 151 - 161
  • [46] Single-channel Dereverberation for Distant-Talking Speech Recognition by Combining Denoising Autoencoder and Temporal Structure Normalization
    Yuma Ueda
    Longbiao Wang
    Atsuhiko Kai
    Xiong Xiao
    Eng Siong Chng
    Haizhou Li
    Journal of Signal Processing Systems, 2016, 82 : 151 - 161
  • [47] Single-Channel Multitalker Speech Recognition
    Rennie, Steven J.
    Hershey, John R.
    Olsen, Peder A.
    IEEE SIGNAL PROCESSING MAGAZINE, 2010, 27 (06) : 66 - 80
  • [48] BINAURAL EXTENSION AND PERFORMANCE OF SINGLE-CHANNEL SPECTRAL SUBTRACTION DEREVERBERATION ALGORITHMS
    Tsilfidis, Alexandros
    Georganti, Eleftheria
    Mourjopoulos, John
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 1737 - 1740
  • [49] A comprehensive study on supervised single-channel noisy speech separation with multi-task learning
    Dang, Shaoxiang
    Matsumoto, Tetsuya
    Takeuchi, Yoshinori
    Kudo, Hiroaki
    SPEECH COMMUNICATION, 2025, 167
  • [50] Weak Speech Recovery for Single-Channel Speech Enhancement
    Wong, Arthur
    Ming, Kok
    Low, Siow Yong
    2012 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT AND ADVANCED SYSTEMS (ICIAS), VOLS 1-2, 2012, : 627 - 631