Duality of Maximum Entropy and Minimum Divergence

被引:11
作者
Eguchi, Shinto [1 ,2 ]
Komori, Osamu [1 ]
Ohara, Atsumi [3 ]
机构
[1] Inst Stat Math, Tachikawa Tokyo 1908562, Japan
[2] Grad Univ Adv Studies, Tachikawa Tokyo 1908562, Japan
[3] Univ Fukui, Dept Elect & Elect Engn, Fukui 9108507, Japan
基金
日本科学技术振兴机构;
关键词
beta-divergence; dual connections; information geometry; MaxEnt; multivariate t-distribution; power exponential family; sufficiency; INFORMATION GEOMETRY; ALPHA-BETA; ROBUST; DISTRIBUTIONS; COEFFICIENT; FAMILIES;
D O I
10.3390/e16073552
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
We discuss a special class of generalized divergence measures by the use of generator functions. Any divergence measure in the class is separated into the difference between cross and diagonal entropy. The diagonal entropy measure in the class associates with a model of maximum entropy distributions; the divergence measure leads to statistical estimation via minimization, for arbitrarily giving a statistical model. The dualistic relationship between the maximum entropy model and the minimum divergence estimation is explored in the framework of information geometry. The model of maximum entropy distributions is characterized to be totally geodesic with respect to the linear connection associated with the divergence. A natural extension for the classical theory for the maximum likelihood method under the maximum entropy model in terms of the Boltzmann-Gibbs-Shannon entropy is given. We discuss the duality in detail for Tsallis entropy as a typical example.
引用
收藏
页码:3552 / 3572
页数:21
相关论文
共 46 条
[1]   Information Geometry of Positive Measures and Positive-Definite Matrices: Decomposable Dually Flat Structure [J].
Amari, Shun-ichi .
ENTROPY, 2014, 16 (04) :2131-2145
[2]  
[Anonymous], 2006, SUGAKU EXPO
[3]  
[Anonymous], 1978, WILEY SERIES PROBABI
[4]  
[Anonymous], 1985, LECT NOTES STAT
[5]  
[Anonymous], 1997, ANN FACULTE SCI TOUL
[6]  
[Anonymous], 1991, STAT SIGNAL PROCESSI
[7]  
[Anonymous], 2009, SPRINGER
[8]   Robust and efficient estimation by minimising a density power divergence [J].
Basu, A ;
Harris, IR ;
Hjort, NL ;
Jones, MC .
BIOMETRIKA, 1998, 85 (03) :549-559
[9]  
Berger AL, 1996, COMPUT LINGUIST, V22, P39
[10]  
Bregman L. M., 1967, USSR Comput Math Math Phys, V7, P200, DOI [10.1016/0041-5553(67)90040-7, DOI 10.1016/0041-5553(67)90040-7]