DNN-BASED SPEECH ENHANCEMENT USING MBE MODEL

被引:0
作者
Huang, Qizheng [1 ]
Bao, Changchun [1 ]
Wang, Xianyun [1 ]
Xiang, Yang [1 ]
机构
[1] Beijing Univ Technol, Speech & Audio Signal Proc Lab, Fac Informat Technol, Beijing 100124, Peoples R China
来源
2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC) | 2018年
基金
中国国家自然科学基金;
关键词
Speech enhancement; MBE model; DNN; acoustic features; analysis-with-synthesis; NOISE ESTIMATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper provides a novel deep neural networks (DNN) based speech enhancement method using multi-band excitation (MBE) model. Generally, the proposed system contains two stages, namely training stage and enhancing stage. In the training stage, two DNNs with different targets are trained. The training targets are harmonic magnitude and band difference function of clean speech, respectively. The input feature for two DNNs is log-power spectra (LPS) of noisy speech. In the enhancing stage, using the output of DNNs and online estimated pitch period, the enhanced speech can be obtained by MBE speech synthesis. Using the proposed method, the parameters of MBE model can be accurately estimated to synthesize the enhanced speech with the high quality. At the same time, the noise between the harmonics is effectively eliminated. The experiments show that the proposed method outperforms the reference methods for speech quality and intelligibility.
引用
收藏
页码:196 / 200
页数:5
相关论文
共 16 条
[1]   Perceptual improvement of Wiener filtering [J].
Amehraye, A. ;
Pastor, D. ;
Tamtaoui, A. .
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :2081-+
[2]  
[Anonymous], 1988, Objective measures of speech quality
[3]  
[Anonymous], 2007, Speech Enhancement: Theory and Practice
[4]  
[Anonymous], 2001, NON TRADITIONAL REF, P862
[5]  
[Anonymous], 2013, ICML
[6]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[7]   Noise estimation by minima controlled recursive averaging for robust speech enhancement [J].
Cohen, I ;
Berdugo, B .
IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (01) :12-15
[8]   SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR LOG-SPECTRAL AMPLITUDE ESTIMATOR [J].
EPHRAIM, Y ;
MALAH, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02) :443-445
[9]   MULTIBAND EXCITATION VOCODER [J].
GRIFFIN, DW ;
LIM, JS .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1988, 36 (08) :1223-1235
[10]   Improved Codebook-based Speech Enhancement based on MBE Model [J].
Huang, Qizheng ;
Bao, Changchun ;
Wang, Xianyun .
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, :3627-3631