Speech Magnitude Spectrum Reconstruction from MFCCs Using Deep Neural Network

被引:11
作者
Jiang Wenbin [1 ]
Liu Peilin [1 ]
Wen Fei [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Air Force Engn Univ, Air Control & Nav Inst, Xian 710000, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep neural network (DNN); Mel-frequency cepstral coefficients (MFCCs); Spectrum reconstruction; Speech reconstruction; REPRESENTATIONS; RECOGNITION;
D O I
10.1049/cje.2017.09.018
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This work proposes a Deep neural network (DNN) based method for reconstructing speech magnitude spectrum from Mel-frequency cepstral coefficients (MFCCs). We train a DNN using MFCC vectors as input and the corresponding speech magnitude spectrum as desired output. Exploiting the strong inference power of DNN, the proposed method has the capability to accurately estimate the speech magnitude spectrum even from truncated MFCC vectors. Experiments on TIMIT corpus demonstrate that the proposed method achieves significantly better performance compared with traditional methods.
引用
收藏
页码:393 / 398
页数:6
相关论文
共 25 条
[1]  
[Anonymous], 2013, COMPUT REV
[2]  
[Anonymous], 2010, P PYTH SCI C
[3]  
[Anonymous], 2005, 2122005 202 ETSI ES
[4]  
[Anonymous], 2001, REC ITU T P 862
[5]  
[Anonymous], 2014, P 2014 IEEE INT C MU
[6]  
[Anonymous], 2015, THCHS 30 FREE CHINES
[7]   Learning Deep Architectures for AI [J].
Bengio, Yoshua .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127
[8]   Low Bit-Rate Speech Coding Through Quantization of Mel-Frequency Cepstral Coefficients [J].
Boucheron, Laura E. ;
De Leon, Phillip L. ;
Sandoval, Steven .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (02) :610-619
[9]   COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES [J].
DAVIS, SB ;
MERMELSTEIN, P .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04) :357-366
[10]  
Garofolo J. S., 1998, GETTING STARTED DARP, P107