Single-Channel Speech Enhancement Based on Improved Frame-Iterative Spectral Subtraction in the Modulation Domain

被引：0

作者：

Li, Chao ^{[1
]}

Jiang, Ting ^{[1
]}

Wu, Sheng ^{[1
]}

机构：

[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China

来源：

CHINA COMMUNICATIONS | 2021年 / 18卷 / 09期

基金：

中国国家自然科学基金; 国家自然科学基金重大项目;

关键词：

short-time modulation domain; single-channel speech enhancement; modulation improved frame iterative spectral subtraction; low SNRs; MEAN-SQUARE ERROR; NOISE; MAGNITUDE;

D O I：

暂无

中图分类号：

TN [电子技术、通信技术];

学科分类号：

0809 ;

摘要：

Aiming at the problem of music noise introduced by classical spectral subtraction, a short-time modulation domain (STM) spectral subtraction method has been successfully applied for single-channel speech enhancement. However, due to the inaccurate voice activity detection (VAD), the residual music noise and enhanced performance still need to be further improved, especially in the low signal to noise ratio (SNR) scenarios. To address this issue, an improved frame iterative spectral subtraction in the STM domain (IMModSSub) is proposed. More specifically, with the inter-frame correlation, the noise subtraction is directly applied to handle the noisy signal for each frame in the STM domain. Then, the noisy signal is classified into speech or silence frames based on a predefined threshold of segmented SNR. With these classification results, a corresponding mask function is developed for noisy speech after noise subtraction. Finally, exploiting the increased sparsity of speech signal in the modulation domain, the orthogonal matching pursuit (OMP) technique is employed to the speech frames for improving the speech quality and intelligibility. The effectiveness of the proposed method is evaluated with three types of noise, including white noise, pink noise, and hfchannel noise. The obtained results show that the proposed method outperforms some established baselines at lower SNRs (5 to +5 dB).

引用

页码：100 / 115

页数：16

共 50 条

[11] A GENERALIZED LOG-SPECTRAL AMPLITUDE ESTIMATOR FOR SINGLE-CHANNEL SPEECH ENHANCEMENT
Chinaev, Aleksej
Haeb-Umbach, Reinhold
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4980 - 4984
[12] Single-channel speech enhancement with correlated spectral components: Limits-potential
Mowlaee, Pejman
Stahl, Johannes K. W.
SPEECH COMMUNICATION, 2020, 121 (121) : 58 - 69
[13] Iterative Closed-Loop Phase-Aware Single-Channel Speech Enhancement
Mowlaee, Pejman
Saeidi, Rahim
IEEE SIGNAL PROCESSING LETTERS, 2013, 20 (12) : 1235 - 1239
[14] Two-Stage Single-Channel Speech Enhancement with Multi-Frame Filtering
Lin, Shaoxiong
Zhang, Wangyou
Qian, Yanmin
APPLIED SCIENCES-BASEL, 2023, 13 (08):
[15] SINGLE CHANNEL SPEECH ENHANCEMENT IN THE MODULATION DOMAIN: NEW INSIGHTS IN THE MODULATION CHANNEL SELECTION FRAMEWORK
Boldt, Jesper B.
Bertelsen, Andreas T.
Gran, Fredrik
Jorgensen, Soren
Dau, Torsten
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5748 - 5752
[16] Phase Based Single-Channel Speech Enhancement Using Phase Ratio
Singh, Sachin
Mutawa, A. M.
Gupta, Monika
Tripathy, Manoj
Anand, R. S.
2017 6TH INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS IN ELECTRICAL ENGINEERING - RECENT ADVANCES (CERA), 2017, : 393 - 396
[17] Single-channel speech enhancement based on joint constrained dictionary learning
Linhui Sun
Yunyi Bu
Pingan Li
Zihao Wu
EURASIP Journal on Audio, Speech, and Music Processing, 2021
[18] Single-channel speech enhancement based on joint constrained dictionary learning
Sun, Linhui
Bu, Yunyi
Li, Pingan
Wu, Zihao
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2021, 2021 (01)
[19] Enhancement of Single-Channel Periodic Signals in the Time-Domain
Jensen, Jesper Rindom
Benesty, Jacob
Christensen, Mads Graesboll
Jensen, Soren Holdt
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (07): : 1948 - 1963
[20] Single-Channel Speech Enhancement Based on Adaptive Low-Rank Matrix Decomposition
Li, Chao
Jiang, Ting
Wu, Sheng
Xie, Jianxiao
IEEE ACCESS, 2020, 8 : 37066 - 37076

← 1 2 3 4 5 →