Two-Stage Temporal Processing for Single-Channel Speech Enhancement

被引:4
|
作者
Samui, Sunzan [1 ]
Chakrabarti, Indrajit [1 ]
Ghosh, Soumya Kanti [1 ]
机构
[1] Indian Inst Technol, Kharagpur, W Bengal, India
关键词
Speech enhancement; noise-reduction; noise estimation; temporal processing; ALGORITHMS;
D O I
10.21437/Interspeech.2016-307
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Most of the conventional speech enhancement methods operating in the spectral domain often suffer from spurious artifact called musical noise. Moreover, these methods also incur an extra overhead time for noise power spectral density estimation. In this paper, a speech enhancement framework is proposed by cascading two temporal processing stages. The first stage performs excitation source based temporal processing that involves identifying and boosting the excitation source based speech specific features present at the gross and fine temporal levels, whereas the second stage provides noise reduction by estimating standard deviation of noise in time-domain by using a robust estimator. The proposed noise reduction stage is quite simply implementable and computationally less complex as it does not require noise estimation in spectral domain as a pre-processing phase. The experimental results have established that the proposed scheme produces on an average 60-65 % improvement in the speech quality (PESQ scores) and intelligibility (STOI scores) at 0 and -5 dB input SNR when compared to existing standard approaches.
引用
收藏
页码:3723 / 3727
页数:5
相关论文
共 50 条
  • [1] A two-stage method for single-channel speech enhancement
    Hamid, ME
    Fukabayashi, T
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2006, E89A (04) : 1058 - 1068
  • [2] Two-Stage Single-Channel Speech Enhancement with Multi-Frame Filtering
    Lin, Shaoxiong
    Zhang, Wangyou
    Qian, Yanmin
    APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [3] Supervised Single-Channel Speech Dereverberation and Denoising Using a Two-Stage Processing
    Zhang, Long
    Ehen, Jiaxu
    Luo, You
    Fu, Jiafei
    Ye, Zhongfu
    2017 2ND INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2017), 2017, : 818 - 822
  • [4] Phase Processing for Single-Channel Speech Enhancement
    Gerkmann, Timo
    Krawczyk-Becker, Martin
    Le Roux, Jonathan
    IEEE SIGNAL PROCESSING MAGAZINE, 2015, 32 (02) : 55 - 66
  • [5] TWO-STAGE DATA-DRIVEN SINGLE CHANNEL SPEECH ENHANCEMENT WITH CEPSTRAL ANALYSIS PRE-PROCESSING
    Rao, Yu
    Vahanesa, Chetan
    Reddy, Chandan K. A.
    Panahi, Issa M. S.
    2015 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2015, : 702 - 706
  • [6] Two-stage UNet with channel and temporal-frequency attention for multi-channel speech enhancement
    Xu, Shiyun
    Cao, Yinghan
    Zhang, Zehua
    Wang, Mingjiang
    SPEECH COMMUNICATION, 2025, 166
  • [7] Weak Speech Recovery for Single-Channel Speech Enhancement
    Wong, Arthur
    Ming, Kok
    Low, Siow Yong
    2012 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT AND ADVANCED SYSTEMS (ICIAS), VOLS 1-2, 2012, : 627 - 631
  • [8] Supervised single-channel speech dereverberation and denoising using a two-stage model based sparse representation
    Zhang Long
    Xu Xu
    Chen Huang
    Chen Jiaxu
    Ye Zhongfu
    SPEECH COMMUNICATION, 2018, 97 : 1 - 8
  • [9] Single-Channel Speech Enhancement Techniques for Distant Speech Recognition
    Ashwini, Jaya
    Kumaraswamy, Ramaswamy
    JOURNAL OF INTELLIGENT SYSTEMS, 2013, 22 (02) : 81 - 93
  • [10] A TWO-STAGE SINGLE-CHANNEL SPEAKER-DEPENDENT SPEECH SEPARATION APPROACH FOR CHIME-5 CHALLENGE
    Sun, Lei
    Du, Jun
    Gao, Tian
    Fang, Yi
    Ma, Feng
    Pan, Jia
    Lee, Chin-Hui
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6650 - 6654