Single-Mode-Based Unified Speech and Audio Coding by Extending the Linear Prediction Domain Coding Mode

被引:5
作者
Beack, Seungkwon [1 ]
Seong, Jongmo [1 ]
Lee, Misuk [1 ]
Lee, Taejin [1 ]
机构
[1] ETRI, Broadcasting & Media Res Lab, Daejeon, South Korea
关键词
USAC; HE-AACv2; AMR-WB; STANDARD;
D O I
10.4218/etrij.17.0116.0397
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unified speech and audio coding (USAC) is one of the latest coding technologies. It is based on a switchable coding structure, and has demonstrated the highest levels of performance for both speech and music contents. In this paper, we propose an extended version of USAC with a single-mode of operation-which does not require a switching system-by extending the linear prediction-coding mode. The main concept of this extension is the adoption of the advantages of frequency-domain coding schemes, such as windowing and transition control. Subjective test results indicate that the proposed scheme covers speech, music, and mixed streams with adequate levels of performance. The obtained quality levels are comparable with those of USAC.
引用
收藏
页码:310 / 318
页数:9
相关论文
共 14 条
[1]  
*3GPP, 2002, 26171 3GPP TS
[2]  
Aarts RM, 1999, J AUDIO ENG SOC, V47, P720
[3]  
[Anonymous], 2011, 2011N12232 ISOIEC JT
[4]  
[Anonymous], 2008, N9638 ISOIEC SC29 WG
[5]   The Adaptive Multirate Wideband speech codec (AMR-WB) [J].
Bessette, B ;
Salami, R ;
Lefebvre, R ;
Jelínek, M ;
Rotola-Pukkila, J ;
Vainio, J ;
Mikkola, H ;
Järvinen, K .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (08) :620-636
[6]   ADVANCES IN SPEECH AND AUDIO COMPRESSION [J].
GERSHO, A .
PROCEEDINGS OF THE IEEE, 1994, 82 (06) :900-918
[7]  
International Telecommunication Union, 2001, METH SUBJ ASS INT SO
[8]  
ISO/IEC, 2012, International Standard 23003-3:2012
[9]  
Jayme GAB, 2006, J AUDIO ENG SOC, V54, P571