Single-Channel Speech Enhancement Based on Improved Frame-Iterative Spectral Subtraction in the Modulation Domain

被引：0

作者：

Li, Chao ^{[1
]}

Jiang, Ting ^{[1
]}

Wu, Sheng ^{[1
]}

机构：

[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China

来源：

CHINA COMMUNICATIONS | 2021年 / 18卷 / 09期

基金：

中国国家自然科学基金; 国家自然科学基金重大项目;

关键词：

short-time modulation domain; single-channel speech enhancement; modulation improved frame iterative spectral subtraction; low SNRs; MEAN-SQUARE ERROR; NOISE; MAGNITUDE;

D O I：

暂无

中图分类号：

TN [电子技术、通信技术];

学科分类号：

0809 ;

摘要：

Aiming at the problem of music noise introduced by classical spectral subtraction, a short-time modulation domain (STM) spectral subtraction method has been successfully applied for single-channel speech enhancement. However, due to the inaccurate voice activity detection (VAD), the residual music noise and enhanced performance still need to be further improved, especially in the low signal to noise ratio (SNR) scenarios. To address this issue, an improved frame iterative spectral subtraction in the STM domain (IMModSSub) is proposed. More specifically, with the inter-frame correlation, the noise subtraction is directly applied to handle the noisy signal for each frame in the STM domain. Then, the noisy signal is classified into speech or silence frames based on a predefined threshold of segmented SNR. With these classification results, a corresponding mask function is developed for noisy speech after noise subtraction. Finally, exploiting the increased sparsity of speech signal in the modulation domain, the orthogonal matching pursuit (OMP) technique is employed to the speech frames for improving the speech quality and intelligibility. The effectiveness of the proposed method is evaluated with three types of noise, including white noise, pink noise, and hfchannel noise. The obtained results show that the proposed method outperforms some established baselines at lower SNRs (5 to +5 dB).

引用

页码：100 / 115

页数：16

共 50 条

[21] DUAL-COMPRESSION NEURAL NETWORK WITH OPTIMIZED OUTPUT WEIGHTING FOR IMPROVED SINGLE-CHANNEL SPEECH ENHANCEMENT
Thaleiser, Stefan
Chinaev, Aleksej
Martin, Rainer
Enzner, Gerald
2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
[22] Phase-Aware Single-channel Speech Enhancement
Mowlaee, Pejman
Watanabe, Mario Kaoru
Saeidi, Rahim
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1871 - 1873
[23] CompNet: Complementary network for single-channel speech enhancement
Fan, Cunhang
Zhang, Hongmei
Li, Andong
Xiang, Wang
Zheng, Chengshi
Lv, Zhao
Wu, Xiaopei
NEURAL NETWORKS, 2023, 168 : 508 - 517
[24] SINGLE-CHANNEL ENHANCEMENT OF CONVOLUTIVE NOISY SPEECH BASED ON A DISCRIMINATIVE NMF ALGORITHM
Chung, Hanwook
Plourde, Eric
Champagne, Benoit
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2302 - 2306
[25] Single-channel noise suppression by wavelets in spectral domain
Smekal, Zdenek
Sysel, Petr
VERBAL AND NONVERBAL COMMUNICATION BEHAVIOURS, 2007, 4775 : 150 - 164
[26] Single-Channel Speech Enhancement With Phase Reconstruction Based on Phase Distortion Averaging
Wakabayashi, Yukoh
Fukumori, Takahiro
Nakayama, Masato
Nishiura, Takanobu
Yamashita, Yoichi
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1559 - 1569
[27] Single-channel Speech Enhancement Using Graph Fourier Transform
Zhang, Chenhui
Pan, Xiang
INTERSPEECH 2022, 2022, : 946 - 950
[28] Deep Neural Network for Supervised Single-Channel Speech Enhancement
Saleem, Nasir
Irfan Khattak, Muhammad
Ali, Muhammad Yousaf
Shafi, Muhammad
ARCHIVES OF ACOUSTICS, 2019, 44 (01) : 3 - 12
[29] INVESTIGATION OF A PARAMETRIC GAIN APPROACH TO SINGLE-CHANNEL SPEECH ENHANCEMENT
Huang, Gongping
Chen, Jingdong
Benesty, Jacob
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 206 - 210
[30] ON PHASE IMPORTANCE IN PARAMETER ESTIMATION IN SINGLE-CHANNEL SPEECH ENHANCEMENT
Mowlaee, Pejman
Saeidi, Rahim
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7462 - 7466

← 1 2 3 4 5 →