Single-channel speech enhancement by subspace affinity minimization

被引:3
|
作者
Tran, Dung N. [1 ]
Koishida, Kazuhito [1 ]
机构
[1] Microsoft Corp, Redmond, WA 98052 USA
来源
INTERSPEECH 2020 | 2020年
关键词
speech enhancement; noise reduction; deep neural network; convolutional neural network; regression; subspace affinity;
D O I
10.21437/Interspeech.2020-2982
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
In data-driven speech enhancement frameworks, learning informative representations is crucial to obtain a high-quality estimate of the target speech. State-of-the-art speech enhancement methods based on deep neural networks (DNN) commonly learn a single embedding from the noisy input to predict clean speech. This compressed representation inevitably contains both noise and speech information leading to speech distortion and poor noise reduction performance. To alleviate this issue, we proposed to learn from the noisy input separate embeddings for speech and noise and introduced a subspace affinity loss function to prevent information leaking between the two representations. We rigorously proved that minimizing this loss function yields maximally uncorrelated speech and noise representations, which can block information leaking. We empirically showed that our proposed framework outperforms traditional and state-of-the-art speech enhancement methods in various unseen nonstationary noise environments. Our results suggest that learning uncorrelated speech and noise embeddings can improve noise reduction and reduces speech distortion in speech enhancement applications.
引用
收藏
页码:2447 / 2451
页数:5
相关论文
共 50 条
  • [21] SINGLE-CHANNEL SPEECH ENHANCEMENT WITH SEQUENTIALLY TRAINED DNN SYSTEM
    Sun, Yang
    Xian, Yang
    Wang, Wenwu
    Naqvi, Syed Mohsen
    2019 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2019,
  • [22] Deep Learning Models for Single-Channel Speech Enhancement on Drones
    Mukhutdinov, Dmitrii
    Alex, Ashish
    Cavallaro, Andrea
    Wang, Lin
    IEEE ACCESS, 2023, 11 : 22993 - 23007
  • [23] Modified Amplitude Spectral Estimator for Single-Channel Speech Enhancement
    Zhai, Zhenhui
    Ou, Shifeng
    Gao, Ying
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN MECHANICAL ENGINEERING AND INDUSTRIAL INFORMATICS (AMEII 2016), 2016, 73 : 1115 - 1120
  • [24] GAUSSIAN DENSITY GUIDED DEEP NEURAL NETWORK FOR SINGLE-CHANNEL SPEECH ENHANCEMENT
    Chai, Li
    Du, Jun
    Wang, Yan-nan
    2017 IEEE 27TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2017,
  • [25] Single-channel multiple regression for in-car speech enhancement
    Li, WF
    Itou, K
    Takeda, K
    Itakura, F
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (03) : 1032 - 1039
  • [26] A SPECTRAL CONVERSION BASED SINGLE-CHANNEL SINGLE-MICROPHONE SPEECH ENHANCEMENT
    Huy-Khoi Do
    Quang Vinh Thai
    FOURTH INTERNATIONAL CONFERENCE ON COMPUTER AND ELECTRICAL ENGINEERING (ICCEE 2011), 2011, : 583 - +
  • [27] A COMPUTATIONALLY-EFFICIENT SINGLE-CHANNEL SPEECH ENHANCEMENT ALGORITHM FOR MONAURAL HEARING AIDS
    Ayllon, David
    Gil-Pita, Roberto
    Utrilla-Manso, Manuel
    Rosa-Zurera, Manuel
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 2050 - 2054
  • [28] Glance and gaze: A collaborative learning framework for single-channel speech enhancement
    Li, Andong
    Zheng, Chengshi
    Zhang, Lu
    Li, Xiaodong
    APPLIED ACOUSTICS, 2022, 187
  • [29] Phase Estimation in Single-Channel Speech Enhancement: Limits-Potential
    Mowlaee, Pejman
    Kulmer, Josef
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (08) : 1283 - 1294
  • [30] A Single-channel Speech Enhancement Approach Based on Perceptual Masking Deep Neural Network
    Han W.
    Zhang X.-W.
    Min G.
    Zhang Q.-Y.
    Zhang, Xiong-Wei (xwzhang9898@163.com), 2017, Science Press (43): : 248 - 258