Single-channel speech enhancement by subspace affinity minimization

被引：3

作者：

Tran, Dung N. ^{[1
]}

Koishida, Kazuhito ^{[1
]}

机构：

[1] Microsoft Corp, Redmond, WA 98052 USA

来源：

INTERSPEECH 2020 | 2020年

关键词：

speech enhancement; noise reduction; deep neural network; convolutional neural network; regression; subspace affinity;

D O I：

10.21437/Interspeech.2020-2982

中图分类号：

R36 [病理学]; R76 [耳鼻咽喉科学];

学科分类号：

100104 ; 100213 ;

摘要：

In data-driven speech enhancement frameworks, learning informative representations is crucial to obtain a high-quality estimate of the target speech. State-of-the-art speech enhancement methods based on deep neural networks (DNN) commonly learn a single embedding from the noisy input to predict clean speech. This compressed representation inevitably contains both noise and speech information leading to speech distortion and poor noise reduction performance. To alleviate this issue, we proposed to learn from the noisy input separate embeddings for speech and noise and introduced a subspace affinity loss function to prevent information leaking between the two representations. We rigorously proved that minimizing this loss function yields maximally uncorrelated speech and noise representations, which can block information leaking. We empirically showed that our proposed framework outperforms traditional and state-of-the-art speech enhancement methods in various unseen nonstationary noise environments. Our results suggest that learning uncorrelated speech and noise embeddings can improve noise reduction and reduces speech distortion in speech enhancement applications.

引用

页码：2447 / 2451

页数：5

共 50 条

[21] SINGLE-CHANNEL SPEECH ENHANCEMENT WITH SEQUENTIALLY TRAINED DNN SYSTEM
Sun, Yang
Xian, Yang
Wang, Wenwu
Naqvi, Syed Mohsen
2019 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2019,
[22] Deep Learning Models for Single-Channel Speech Enhancement on Drones
Mukhutdinov, Dmitrii
Alex, Ashish
Cavallaro, Andrea
Wang, Lin
IEEE ACCESS, 2023, 11 : 22993 - 23007
[23] Modified Amplitude Spectral Estimator for Single-Channel Speech Enhancement
Zhai, Zhenhui
Ou, Shifeng
Gao, Ying
PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN MECHANICAL ENGINEERING AND INDUSTRIAL INFORMATICS (AMEII 2016), 2016, 73 : 1115 - 1120
[24] GAUSSIAN DENSITY GUIDED DEEP NEURAL NETWORK FOR SINGLE-CHANNEL SPEECH ENHANCEMENT
Chai, Li
Du, Jun
Wang, Yan-nan
2017 IEEE 27TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2017,
[25] Single-channel multiple regression for in-car speech enhancement
Li, WF
Itou, K
Takeda, K
Itakura, F
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (03) : 1032 - 1039
[26] A SPECTRAL CONVERSION BASED SINGLE-CHANNEL SINGLE-MICROPHONE SPEECH ENHANCEMENT
Huy-Khoi Do
Quang Vinh Thai
FOURTH INTERNATIONAL CONFERENCE ON COMPUTER AND ELECTRICAL ENGINEERING (ICCEE 2011), 2011, : 583 - +
[27] A COMPUTATIONALLY-EFFICIENT SINGLE-CHANNEL SPEECH ENHANCEMENT ALGORITHM FOR MONAURAL HEARING AIDS
Ayllon, David
Gil-Pita, Roberto
Utrilla-Manso, Manuel
Rosa-Zurera, Manuel
2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 2050 - 2054
[28] Glance and gaze: A collaborative learning framework for single-channel speech enhancement
Li, Andong
Zheng, Chengshi
Zhang, Lu
Li, Xiaodong
APPLIED ACOUSTICS, 2022, 187
[29] Phase Estimation in Single-Channel Speech Enhancement: Limits-Potential
Mowlaee, Pejman
Kulmer, Josef
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (08) : 1283 - 1294
[30] A Single-channel Speech Enhancement Approach Based on Perceptual Masking Deep Neural Network
Han W.
Zhang X.-W.
Min G.
Zhang Q.-Y.
Zhang, Xiong-Wei (xwzhang9898@163.com), 2017, Science Press (43): : 248 - 258

← 1 2 3 4 5 →