Single-Channel Speech Enhancement Based on Improved Frame-Iterative Spectral Subtraction in the Modulation Domain

被引:0
作者
Li, Chao [1 ]
Jiang, Ting [1 ]
Wu, Sheng [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China
基金
中国国家自然科学基金; 国家自然科学基金重大项目;
关键词
short-time modulation domain; single-channel speech enhancement; modulation improved frame iterative spectral subtraction; low SNRs; MEAN-SQUARE ERROR; NOISE; MAGNITUDE;
D O I
暂无
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Aiming at the problem of music noise introduced by classical spectral subtraction, a short-time modulation domain (STM) spectral subtraction method has been successfully applied for single-channel speech enhancement. However, due to the inaccurate voice activity detection (VAD), the residual music noise and enhanced performance still need to be further improved, especially in the low signal to noise ratio (SNR) scenarios. To address this issue, an improved frame iterative spectral subtraction in the STM domain (IMModSSub) is proposed. More specifically, with the inter-frame correlation, the noise subtraction is directly applied to handle the noisy signal for each frame in the STM domain. Then, the noisy signal is classified into speech or silence frames based on a predefined threshold of segmented SNR. With these classification results, a corresponding mask function is developed for noisy speech after noise subtraction. Finally, exploiting the increased sparsity of speech signal in the modulation domain, the orthogonal matching pursuit (OMP) technique is employed to the speech frames for improving the speech quality and intelligibility. The effectiveness of the proposed method is evaluated with three types of noise, including white noise, pink noise, and hfchannel noise. The obtained results show that the proposed method outperforms some established baselines at lower SNRs (5 to +5 dB).
引用
收藏
页码:100 / 115
页数:16
相关论文
共 50 条
  • [31] On Speech Intelligibility Estimation of Phase-Aware Single-Channel Speech Enhancement
    Gaich, Andreas
    Mowlaee, Pejman
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2553 - 2557
  • [32] Single-channel speech enhancement using improved progressive deep neural network and masking-based harmonic regeneration
    Ping, Huang
    Yafeng, Wu
    SPEECH COMMUNICATION, 2022, 145 : 36 - 46
  • [33] DNN TRAINING BASED ON CLASSIC GAIN FUNCTION FOR SINGLE-CHANNEL SPEECH ENHANCEMENT AND RECOGNITION
    Tu, Yan-Hui
    Du, Jun
    Lee, Chin-Hui
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 910 - 914
  • [34] RESEARCH ON ENGLISH SPEECH ENHANCEMENT ALGORITHM BASED ON IMPROVED SPECTRAL SUBTRACTION AND DEEP NEURAL NETWORK
    Zhou, Qiaoling
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2020, 16 (05): : 1711 - 1723
  • [35] IMPROVING NOISE ROBUST AUTOMATIC SPEECH RECOGNITION WITH SINGLE-CHANNEL TIME-DOMAIN ENHANCEMENT NETWORK
    Kinoshita, Keisuke
    Ochiai, Tsubasa
    Delcroix, Marc
    Nakatani, Tomohiro
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7009 - 7013
  • [36] Supervised single-channel speech enhancement using ratio mask with joint dictionary learning
    Zhang, Long
    Bao, Guangzhao
    Zhang, Jing
    Ye, Zhongfu
    SPEECH COMMUNICATION, 2016, 82 : 38 - 52
  • [37] Adaptive recurrent nonnegative matrix factorization with phase compensation for Single-Channel speech enhancement
    Tank, Vanita Raj
    Mahajan, Shrinivas Padmakar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (20) : 28249 - 28294
  • [38] Glance and gaze: A collaborative learning framework for single-channel speech enhancement
    Li, Andong
    Zheng, Chengshi
    Zhang, Lu
    Li, Xiaodong
    APPLIED ACOUSTICS, 2022, 187
  • [39] Perceptual Weighting Deep Neural Networks for Single-channel Speech Enhancement
    Han, Wei
    Zhang, Xiongwei
    Min, Gang
    Zhou, Xingyu
    Zhang, Wei
    PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 446 - 450
  • [40] Deep Learning with Augmented Kalman Filter for Single-Channel Speech Enhancement
    Roy, Sujan Kumar
    Nicolson, Aaron
    Paliwal, Kuldip K.
    2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,