Neural network-based artificial bandwidth expansion of speech

被引:40
作者
Kontio, Juho [1 ]
Laaksonen, Laura
Alku, Paavo
机构
[1] Bugbear Entertainment, Helsinki 00510, Finland
[2] Nokia Res Ctr, Nokia Grp, FI-00045 Helsinki, Finland
[3] Aalto Univ, TKK, Lab Acoust & Audio Signal Proc, FI-02015 Espoo, Finland
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2007年 / 15卷 / 03期
关键词
artificial bandwidth expansion; genetic algorithm; neural network; neuroevolution; speech processing;
D O I
10.1109/TASL.2006.885934
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The limited bandwidth of 0.3-3.4 kHz in current telephone systems reduces both the quality and the intelligibility of speech. Artificial bandwidth expansion is a method that expands the bandwidth of the narrowband speech signal in the receiving end of the transmission link by adding new fiequency components to the higher frequencies, i.e., up to 8 kHz. In this paper, a new method for artificial bandwidth expansion, termed Neuroevolution Artificial Bandwidth Expansion (NEABE) is proposed. The method uses spectral folding to create the initial spectral components above the telephone band. The spectral envelope is then shaped in the frequency domain, based on a set of parameters given by a neural network. Subjective listening tests were used to evaluate the performance of the proposed algorithm, and the results showed that NFABE speech was prefered over narrowband speech in about 80% of the test cases.
引用
收藏
页码:873 / 881
页数:9
相关论文
共 31 条
[1]  
*3GPP, 2001, 3GPPTS26171
[2]  
AVENDANO C, 1995, P EUR C SPEECH COMM, P165
[3]   Evolution, neural networks, games, and intelligence [J].
Chellapilla, K ;
Fogel, DB .
PROCEEDINGS OF THE IEEE, 1999, 87 (09) :1471-1496
[4]   Statistical Recovery of Wideband Speech from Narrowband Speech [J].
Cheng, Yan Ming ;
O'Shaughnessy, Douglas ;
Mermelstein, Paul .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (04) :544-548
[5]  
Chennoukh S, 2001, INT CONF ACOUST SPEE, P665, DOI 10.1109/ICASSP.2001.940919
[6]  
ENBOM N, 1999, P IEEE WORKSH SPEECH, P171
[7]  
EPPS J, 1999, P IEEE WORKSH SPEECH, P174
[8]  
Fant G., 1960, ACOUSTIC THEORY SPEE
[9]  
Gomez F., 2003, THESIS U TEXAS AUSTI
[10]   Low-complexity feature-mapped speech bandwidth extension [J].
Gustafsson, H ;
Lindgren, UA ;
Claesson, I .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (02) :577-588