Cascaded RLS-LMS prediction in MPEG-4 Lossless Audio Coding

被引:36
作者
Huang, Haibin [1 ]
Franti, Pasi [2 ]
Huang, Dongyan [1 ]
Rahardja, Susanto [1 ]
机构
[1] Inst Infocomm Res, Singapore 119613, Singapore
[2] Univ Joensuu, Dept Comp Sci, FIN-80101 Joensuu, Finland
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2008年 / 16卷 / 03期
关键词
adaptive prediction; least mean square; lossless audio; MPEG-4 Audio Lossless Coding; recursive least square;
D O I
10.1109/TASL.2007.911675
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes the cascaded recursive least square-least mean square (RLS-LMS) prediction, which is part of the recently published MPEG-4 Audio Lossless Coding international standard. The predictor consists of cascaded stages of simple linear predictors, with the prediction error at the output of one stage passed to the next stage as the input signal. A linear combiner adds up the intermediate estimates at the output of each prediction stage to give a final estimate of the RLS-LMS predictor. In the RLS-LMS predictor, the first prediction stage is a simple first-order predictor with a fixed coefficient value 1. The second prediction stage uses the recursive least square algorithm to adaptively update the predictor coefficients. The subsequent prediction stages use the normalized least mean square algorithm to update the predictor coefficients. The coefficients of the linear combiner are then updated using the sign-sign least mean square algorithm. For stereo audio signals, the RLS-LMS predictor uses both intrachannel prediction and interchannel prediction, which results in a 3% improvement in compression ratio over using only the intrachannel prediction. Through extensive tests, the MPEG-4 Audio Lossless coder using the RLS-LMS predictor has demonstrated a compression ratio that is on par with the best lossless audio coders in the field. In this paper, the structure of the RLS-LMS predictor is described in detail, and the optimal predictor configuration is studied through various experiments.
引用
收藏
页码:554 / 562
页数:9
相关论文
共 20 条
[1]  
*ALS, 2006, 1449632005AMD ISOIEC
[2]  
Bosi M, 1997, J AUDIO ENG SOC, V45, P789
[3]   VARIABLE-LENGTH BINARY ENCODINGS [J].
GILBERT, EN ;
MOORE, EF .
BELL SYSTEM TECHNICAL JOURNAL, 1959, 38 (04) :933-967
[4]  
HAYKIN S, 1999, ADAPTIVE FILTER THEO
[5]  
HUANG H, 2006, INT CONF ACOUST SPEE, P181
[6]  
*ISO IEC JTCI, 2002, ISOIECJTC1SC29WG11
[7]  
LIEBCHEN T, 2005, P 119 AES CONV NEW Y
[8]  
Rabiner L.R., 2010, Digital Processing of Speech Signals
[9]  
Reznik YA, 2004, 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS, P1024
[10]  
RICE RF, 1979, 7922 JPL