Vector quantization using information theoretic concepts

被引:33
作者
Lehn-Schiøler T. [1 ]
Hegde A. [2 ]
Erdogmus D. [2 ]
Principe J.C. [2 ]
机构
[1] Intell. Signal Proc. Info./Math. Mod, Technical University of Denmark
[2] Computational NeuroEngineering Lab., Electrical/Computer Eng. Department, University of Florida, Gainesville
基金
美国国家科学基金会;
关键词
Information particles; Information theoretic learning; Parzen density estimate; Self-organizing map; Vector-quantization;
D O I
10.1007/s11047-004-9619-8
中图分类号
学科分类号
摘要
The process of representing a large data set with a smaller number of vectors in the best possible way, also known as vector quantization, has been intensively studied in the recent years. Very efficient algorithms like the Kohonen self-organizing map (SOM) and the Linde Buzo Gray (LBG) algorithm have been devised. In this paper a physical approach to the problem is taken, and it is shown that by considering the processing elements as points moving in a potential field an algorithm equally efficient as the before mentioned can be derived. Unlike SOM and LBG this algorithm has a clear physical interpretation and relies on minimization of a well defined cost function. It is also shown how the potential field approach can be linked to information theory by use of the Parzen density estimator. In the light of information theory it becomes clear that minimizing the free energy of the system is in fact equivalent to minimizing a divergence measure between the distribution of the data and the distribution of the processing elements, hence, the algorithm can be seen as a density matching method. © Springer 2005.
引用
收藏
页码:39 / 51
页数:12
相关论文
共 22 条
[11]  
Lampinen J., Kostiainen T., Generative probability density model in the self organizing map, (2002)
[12]  
Linde Y., Buzo A., Gray R.M., An algorithm for vector quantizer design, IEEE Trans Commun COM, 28, pp. 84-95, (1980)
[13]  
MacQueen J., Some methods for classification and analysis of multivariate observations, Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, 1, pp. 281-297, (1967)
[14]  
Mercer J., Functions of positive and negative type and their connection with the theory of integral equations, Philosophical Transactions Royal Society London A, 209, pp. 415-446, (1909)
[15]  
Parzen E., On estimation of a probability density function and mode, Annals of Mathematical Statistic, 27, pp. 1065-1076, (1962)
[16]  
Principe J.C., Xu C., Zhao Q., Fisher J., Learning from examples with information theoretic criteria, Journal of VLSI Signal Processing-Systems, 26, 1-2, pp. 61-77, (2000)
[17]  
Renyi A., Probability Theory, (1970)
[18]  
Scofield C.L., Unsupervised learning in the N-dimensional Coulomb network, Neural Networks, 1, pp. 1-129, (1988)
[19]  
Sum J., Leung C.-S., Chan L.-W., Xu L., Yet another algorithm which can generate topography map, IEEE Transactions on Neural Networks, 8, 5, pp. 1204-1207, (1997)
[20]  
Van Hulle M.M., Kernel-based topographic map formation achieved with an information-theoretic approach, Neural Networks, 15, 8-9, pp. 1029-1039, (2002)