Discriminant analysis with Gaussian graphical tree models

被引:2
作者
Perez-de-la-Cruz, Gonzalo [1 ]
Eslava-Gomez, Guillermina [2 ]
机构
[1] Univ Nacl Autonoma Mexico, Fac Sci, Dept Math, Grad Studies Math, Circuito Exterior, Mexico City 04510, DF, Mexico
[2] Univ Nacl Autonoma Mexico, Fac Sci, Dept Math, Circuito Exterior, Mexico City 04510, DF, Mexico
关键词
Discriminant analysis; Error rates; Gaussian graphical tree models; Maximum likelihood estimation; Minimum weight spanning tree; Structure estimation; DISTRIBUTIONS;
D O I
10.1007/s10182-015-0256-6
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider Gaussian graphical tree models in discriminant analysis for two populations. Both the parameters and the structure of the graph are assumed to be unknown. For the estimation of the parameters maximum likelihood is used, and for the estimation of the structure of the tree graph we propose three methods; in these, the function to be optimized is the J-divergence for one and the empirical log-likelihood ratio for the two others. The main contribution of this paper is the introduction of these three computationally efficient methods. We show that the optimization problem of each proposed method is equivalent to one of finding a minimum weight spanning tree, which can be solved efficiently even if the number of variables is large. This property together with the existence of the maximum likelihood estimators for small group sample sizes is the main advantage of the proposed methods. A numerical comparison of the classification performance of discriminant analysis using these methods, as well as three other existing ones, is presented. This comparison is based on the estimated error rates of the corresponding plug-in allocation rules obtained from real and simulated data. Diagonal discriminant analysis is considered as a benchmark, as well as quadratic and linear discriminant analysis whenever the sample size is sufficient. The results show that discriminant analysis with Gaussian tree models, using these methods for selecting the graph structure, is competitive with diagonal discriminant analysis in high-dimensional settings.
引用
收藏
页码:161 / 187
页数:27
相关论文
共 23 条