共 50 条
Integrating approximate single factor graphical models
被引:6
|作者:
Fan, Xinyan
[1
]
Fang, Kuangnan
[2
,3
]
Ma, Shuangge
[4
]
Zhang, Qingzhao
[2
,3
,5
]
机构:
[1] Renmin Univ China, Sch Stat, Beijing, Peoples R China
[2] Xiamen Univ, Sch Econ, Dept Stat, Xiamen, Fujian, Peoples R China
[3] Xiamen Univ, Key Lab Econometr, Minist Educ, Xiamen, Fujian, Peoples R China
[4] Yale Sch Publ Hlth, Dept Biostat, New Haven, CT USA
[5] Xiamen Univ, Wang Yanan Inst Studies Econ, Xiamen, Fujian, Peoples R China
基金:
美国国家卫生研究院;
中国国家自然科学基金;
关键词:
approximate single factor graphical model;
integrative analysis;
penalized high dimensional analysis;
INVERSE COVARIANCE ESTIMATION;
GENE-EXPRESSION;
CANCER;
SELECTION;
D O I:
10.1002/sim.8408
中图分类号:
Q [生物科学];
学科分类号:
07 ;
0710 ;
09 ;
摘要:
In the analysis of complex and high-dimensional data, graphical models have been commonly adopted to describe associations among variables. When common factors exist which make the associations dense, the single factor graphical model has been proposed, which first extracts the common factor and then conducts graphical modeling. Under other simpler contexts, it has been recognized that results generated from analyzing a single dataset are often unsatisfactory, and integrating multiple datasets can effectively improve variable selection and estimation. In graphical modeling, the increased number of parameters makes the "lack of information" problem more severe. In this article, we integrate multiple datasets and conduct the approximate single factor graphical model analysis. A novel penalization approach is developed for the identification and estimation of important loadings and edges. An effective computational algorithm is developed. A wide spectrum of simulations and the analysis of breast cancer gene expression datasets demonstrate the competitive performance of the proposed approach. Overall, this study provides an effective new venue for taking advantage of multiple datasets and improving graphical model analysis.
引用
收藏
页码:146 / 155
页数:10
相关论文