THE CONTEXT-TREE WEIGHTING METHOD - BASIC PROPERTIES

被引:518
作者
WILLEMS, FMJ
SHTARKOV, YM
TJALKENS, TJ
机构
[1] Eindhoven University of Technology, Electrical Engineering Department
[2] Eindhoven University of Technology, Electrical, Engineering Department
[3] Institute for Problems of Information Transmission, 101447, Moscow
关键词
SEQUENTIAL DATA COMPRESSION; UNIVERSAL SOURCE CODING; TREE SOURCES; MODELING PROCEDURE; CONTEXT TREE; ARITHMETIC CODING; CUMULATIVE REDUNDANCY BOUNDS;
D O I
10.1109/18.382012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We describe a sequential universal data compression procedure for binary tree sources that performs the ''double mixture.'' Using a context tree, this method weights in an efficient recursive way the coding distributions corresponding to all bounded memory tree sources, and achieves a desirable coding distribution for tree sources with an unknown model and unknown parameters, Computational and storage complexity of the proposed procedure are both linear in the source sequence length, We derive a natural upper bound on the cumulative redundancy of our method for individual sequences. The three terms in this bound can be identified as coding, parameter, and model redundancy, The bound holds for all source sequence lengths, not only for asymptotically large lengths, The analysis that leads to this bound is based on standard techniques and turns out to be extremely simple. Our upper bound on the redundancy shows that the proposed context-tree weighting procedure is optimal in the sense that it achieves the Rissanen (1984) lower bound.
引用
收藏
页码:653 / 664
页数:12
相关论文
共 25 条
[1]  
Abramson N., Information Theory and Coding, pp. 61-62, (1963)
[2]  
Cover T.M., Enumerative source encoding, IEEE Trans. Inform. Theory, IT-19, pp. 73-77, (1973)
[3]  
Cover T.M., Thomas J.A., Elements of Information Theory, (1991)
[4]  
Jelinek F., Probabilistic Information Theory, pp. 476-489, (1968)
[5]  
Krichevsky R.E., Trofimov V.K., The performance of universal encoding, IEEE Trans. Inform. Theory, IT-27, pp. 199-207, (1981)
[6]  
Pasco R., Source coding algorithms for fast data compression, (1976)
[7]  
Rissanen J., Generalized Kraft inequality and arithmetic coding, IBM J. Res. Devel., 20, (1976)
[8]  
Rissanen J., A universal data compression system, IEEE Trans. Inform. Theory, IT-29, pp. 656-664, (1983)
[9]  
Rissanen J., Universal coding, information, prediction, and estimation, IEEE Trans. Inform. Theory, IT-30, pp. 629-636, (1984)
[10]  
Rissanen J., Complexity of strings in the class of Markov sources, IEEE Trans. Inform. Theory, IT-32, pp. 526-532, (1986)