Covariance Matrix Estimation with Multi-Regularization Parameters based on MDL Principle

被引:5
作者
Zhou, Xiuling [1 ,2 ]
Guo, Ping [1 ]
Chen, C. L. Philip [3 ]
机构
[1] Beijing Normal Univ, Lab Image Proc & Pattern Recognit, Beijing 100875, Peoples R China
[2] Beijing City Univ, Res Dept, Beijing, Peoples R China
[3] Univ Macau, Fac Sci & Technol, Macau, Peoples R China
基金
中国国家自然科学基金;
关键词
Gaussian classifier; Covariance matrix estimation; Multi-regularization parameters selection; Minimum description length Principle; CLASSIFICATION;
D O I
10.1007/s11063-012-9272-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Regularization is a solution for the problem of unstable estimation of covariance matrix with a small sample set in Gaussian classifier. In many applications such as image restoration, sparse representation, we have to deal with multi-regularization parameters problem. In this paper, the case of covariance matrix estimation with multi-regularization parameters is investigated, and an estimate method called as KLIM_L is derived theoretically based on Minimum Description Length (MDL) principle for the small sample size problem with high dimension setting. KLIM_L estimator can be regarded as a generalization of KLIM estimator in which local difference in each dimension is considered. Under the framework of MDL principle, a selection method of multi-regularization parameters is also developed based on the minimization of the Kullback-Leibler information measure, which is simply and directly estimated by point estimation under the approximation of two-order Taylor expansion. The computational cost to estimate multi-regularization parameters with KLIM_L method is less than those with RDA (Regularized Discriminant Analysis) and LOOC (leave-one-out covariance matrix estimate) in which cross validation technique is adopted. Experiments show that higher classification accuracy can be achieved by using the proposed KLIM_L estimator.
引用
收藏
页码:227 / 238
页数:12
相关论文
共 25 条
[1]   COMPARATIVE-ANALYSIS OF STATISTICAL PATTERN-RECOGNITION METHODS IN HIGH-DIMENSIONAL SETTINGS [J].
AEBERHARD, S ;
COOMANS, D ;
DEVEL, O .
PATTERN RECOGNITION, 1994, 27 (08) :1065-1077
[2]  
[Anonymous], 2013, Finite Mixture Distributions
[3]  
[Anonymous], PRINCIPAL COMPONENT
[4]   The minimum description length principle in coding and modeling [J].
Barron, A ;
Rissanen, J ;
Yu, B .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1998, 44 (06) :2743-2760
[5]   Regularized estimation of large covariance matrices [J].
Bickel, Peter J. ;
Levina, Elizaveta .
ANNALS OF STATISTICS, 2008, 36 (01) :199-227
[6]  
Bishop C.M., 2006, J ELECTRON IMAGING, V16, P049901, DOI DOI 10.1117/1.2819119
[7]   The Sparse Matrix Transform for Covariance Estimation and Analysis of High Dimensional Signals [J].
Cao, Guangzhi ;
Bachega, Leonardo R. ;
Bouman, Charles A. .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (03) :625-640
[8]   Noniterative MAP Reconstruction Using Sparse Matrix Representations [J].
Cao, Guangzhi ;
Bouman, Charles A. ;
Webb, Kevin J. .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2009, 18 (09) :2085-2099
[9]   Shrinkage estimators for covariance matrices [J].
Daniels, MJ ;
Kass, RE .
BIOMETRICS, 2001, 57 (04) :1173-1184
[10]  
Frank A., 2010, UCI machine learning repository, V213