NOISE-ADAPTIVE DEEP NEURAL NETWORK FOR SINGLE-CHANNEL SPEECH ENHANCEMENT

被引：0

作者：

Chung, Hanwook ^{[1
]}

Kim, Taesup ^{[2
]}

Plourde, Eric ^{[3
]}

Champagne, Benoit ^{[1
]}

机构：

[1] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ, Canada

[2] Univ Montreal, MILA, Montreal, PQ, Canada

[3] Sherbrooke Univ, Dept Elect & Comp Engn, Sherbrooke, PQ, Canada

来源：

2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP) | 2018年

基金：

加拿大自然科学与工程研究理事会;

关键词：

Single-channel speech enhancement; deep neural network; classification; RECOGNITION; NMF;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We introduce a noise-adaptive feed-forward deep neural network (DNN) for single-channel speech enhancement. The goal is to better exploit individual noise characteristics while training a spectral mapping DNN. To this end, we employ noise-dependent adaptation vectors, which are obtained based on the output of an auxiliary noise classification DNN, to adjust the weights and biases of the spectral mapping DNN. The parameters of the spectral mapping DNN, noise classification DNN and adaptation vectors are estimated jointly during the training stage. During the enhancement stage, we combine a classical unsupervised speech enhancement algorithm with the proposed DNN-based approach to further improve the enhanced speech quality. Experiments show that the proposed method provides better enhancement performance than the selected benchmark algorithms.

引用

页数：6

共 37 条

[1] Representation Learning: A Review and New Perspectives [J].

Bengio, Yoshua ;

Courville, Aaron ;

Vincent, Pascal .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828

[2]

Cemgil Ali Taylan, 2009, Comput Intell Neurosci, P785152, DOI 10.1155/2009/785152

[3] Training and compensation of class-conditioned NMF bases for speech enhancement [J].

Chung, Hanwook ;

Badeau, Roland ;

Plourde, Eric ;

Champagne, Benoit .

NEUROCOMPUTING, 2018, 284 :107-118

[4]

Ciresan D, 2012, PROC CVPR IEEE, P3642, DOI 10.1109/CVPR.2012.6248110

[5]

Deng L, 2013, IEEE INT NEW CIRC

[6] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR [J].

EPHRAIM, Y ;

MALAH, D .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06) :1109-1121

[7] SNR-Aware Convolutional Neural Network Modeling for Speech Enhancement [J].

Fu, Szu-Wei ;

Tsao, Yu ;

Lu, Xugang .

17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, :3768-3772

[8] A unified DNN approach to speaker-dependent simultaneous speech enhancement and speech separation in low SNR environments [J].

Gao, Tian ;

Du, Jun ;

Dai, Li-Rong ;

Lee, Chin-Hui .

SPEECH COMMUNICATION, 2017, 95 :28-39

[9]

Garofolo J.S., 1993, LINGUIST DATA CONSOR, DOI DOI 10.35111/17GK-BN40

[10] Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay [J].

Gerkmann, Timo ;

Hendriks, Richard C. .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (04) :1383-1393

← 1 2 3 4 →