Unsupervised Speech Denoising Method based on Deep Neural Network

被引：0

作者：

Chen, Xiaohan ^{[1
]}

机构：

[1] SUNY Stony Brook, Coll Engn & Appl Sci, Stony Brook, NY 11794 USA

来源：

2018 11TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2 | 2018年

关键词：

Unsupervised Speech Denoising Method; Deep Neural Network; speech feature extraction; RECOGNITION;

D O I：

10.1109/ISCID.2018.10159

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

To solve the problem of speech feature extraction in noisy environment, an unsupervised speech denoising method based on deep neural network is proposed in this paper. The deep neural network is used to estimate the noisy speech, and the real-time received speech signal is transformed in S-domain. The power spectrum characteristic information is analyzed, and the restricted Boltzmann machine is used to train and tune the real-time speech information unsupervised, decode the model trained in the off-line stage, and get the estimation of logarithmic power spectrum of speech. Then synthesize the subjective audible speech waveform file with the phase of noisy speech to improve the anti-noise ability in speech feature extraction.

引用

页码：254 / 258

页数：5

共 10 条

[1] Effects of WDRC Release Time and Number of Channels on Output SNR and Speech Recognition [J].

Alexander, Joshua M. ;

Masterson, Katie .

EAR AND HEARING, 2015, 36 (02) :E35-E49

[2]

[Anonymous], 2013, INT J ADV RES COMPUT, V2

[3]

Deng L, 2013, IEEE INT NEW CIRC

[4] Combining media-specific FEC and error concealment for robust distributed speech recognition over loss-prone packet channels [J].

Gomez, Angel M. ;

Peinado, Antonio M. ;

Sanchez, Victoria ;

Rubio, Antonio J. .

IEEE TRANSACTIONS ON MULTIMEDIA, 2006, 8 (06) :1228-1238

[5]

Hirsch HG, 2002, EXPT FRAMEWORK PERFO

[6]

Lippmann R P, 1997, P SPIE INT SOC OPTIC, P46

[7] Speech recognition by machines and humans [J].

Lippmann, RP .

SPEECH COMMUNICATION, 1997, 22 (01) :1-15

[8]

Peinado A., 2006, SPEECH RECOGNITION D

[9] SPEECH RECOGNITION BY MACHINE - REVIEW [J].

REDDY, DR .

PROCEEDINGS OF THE IEEE, 1976, 64 (04) :501-531

[10] Using geometric spectral subtraction approach for feature extraction for DSR front-end Arabic system [J].

Sakka Z. ;

Techini E. ;

Bouhlel M.S. .

International Journal of Speech Technology, 2017, 20 (03) :645-650

← 1 →