Simultaneous Clustering and Estimation of Heterogeneous Graphical Models

被引:0
|
作者
Hao, Botao [1 ]
Sun, Will Wei [2 ]
Liu, Yufeng [3 ]
Cheng, Guang [1 ]
机构
[1] Purdue Univ, Dept Stat, W Lafayette, IN 47906 USA
[2] Univ Miami, Dept Management Sci, Sch Business Adm, Miami, FL 33146 USA
[3] Univ N Carolina, Dept Stat & Operat Res, Lineberger Comprehens Canc Ctr, Dept Biostat,Carolina Ctr Genome Sci,Dept Genet, Chapel Hill, NC 27599 USA
关键词
Clustering; finite-sample analysis; graphical models; high-dimensional statistics; non-convex optimization; INVERSE COVARIANCE ESTIMATION; JOINT ESTIMATION; SELECTION; LASSO;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider joint estimation of multiple graphical models arising from heterogeneous and high-dimensional observations. Unlike most previous approaches which assume that the cluster structure is given in advance, an appealing feature of our method is to learn cluster structure while estimating heterogeneous graphical models. This is achieved via a high dimensional version of Expectation Conditional Maximization (ECM) algorithm (Meng and Rubin, 1993). A joint graphical lasso penalty is imposed on the conditional maximization step to extract both homogeneity and heterogeneity components across all clusters. Our algorithm is computationally efficient due to fast sparse learning routines and can be implemented without unsupervised learning knowledge. The superior performance of our method is demonstrated by extensive experiments and its application to a Glioblastoma cancer dataset reveals some new insights in understanding the Glioblastoma cancer. In theory, a non-asymptotic error bound is established for the output directly from our high dimensional ECM algorithm, and it consists of two quantities: statistical error (statistical accuracy) and optimization error (computational complexity). Such a result gives a theoretical guideline in terminating our ECM iterations.
引用
收藏
页数:58
相关论文
empty
未找到相关数据