SUPER-WIDEBAND BANDWIDTH EXTENSION FOR SPEECH IN THE 3GPP EVS CODEC

被引:0
作者
Atti, Venkatraman [1 ]
Krishnan, Venkatesh [1 ]
Dewasurendra, Duminda [1 ]
Chebiyyam, Venkata [1 ]
Subasingha, Shaminda [1 ]
Sinder, Daniel J. [1 ]
Rajendran, Vivek [1 ]
Varga, Imre [1 ]
Gibbs, Jon [2 ]
Miao, Lei [2 ]
Grancharov, Volodya [3 ]
Pobloth, Harald [3 ]
机构
[1] Qualcomm Technol Inc, San Diego, CA 92121 USA
[2] Huawei Technol Co Ltd, Shenzhen, Peoples R China
[3] Ericsson AB, Stockholm, Sweden
来源
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) | 2015年
关键词
3GPP EVS codec; low bitrate bandwidth extension; super-wideband; harmonic nonlinear extension; temporal envelope modulation; AUDIO CODING STANDARD;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes the time-domain bandwidth extension (TBE) framework employed to code wideband and super-wideband speech in the newly standardized 3GPP EVS codec. The TBE algorithm uses a nonlinear harmonic modeling technique that incorporates principles of time-domain envelope-modulated noise mixing. At 13.2 kbps, the super-wideband coding of speech uses as low as 1.55 kbps for encoding the spectral content from 6.4-14.4 kHz. Subjective evaluation results from ITU-T P.800 Mean Opinion Score (MOS) tests are provided, showing significantly improved quality compared to the other standardized SWB codecs under both clean speech and speech with background noise.
引用
收藏
页码:5927 / 5931
页数:5
相关论文
共 21 条
[1]  
3GPP, 2014, 26445 3GPP TS
[2]  
[Anonymous], 2010, CS0014D 3GPP2
[3]  
[Anonymous], 1996, P800 ITUT P
[4]  
[Anonymous], AUD ENG SOC CONV 112
[5]  
[Anonymous], 2007, Audio Signal Processing and Coding
[6]  
Atti V., 2015, IEEE ICASSP UNPUB
[7]   The Adaptive Multirate Wideband speech codec (AMR-WB) [J].
Bessette, B ;
Salami, R ;
Lefebvre, R ;
Jelínek, M ;
Rotola-Pukkila, J ;
Vainio, J ;
Mikkola, H ;
Järvinen, K .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (08) :620-636
[8]  
Bruhn S., 2015, IEEE ICASSP UNPUB
[9]  
Chu W.C., 2003, SPEECH CODING ALGORI
[10]  
Dietz M., 2015, IEEE ICASSP UNPUB