Integrating approximate single factor graphical models

被引:6
|
作者
Fan, Xinyan [1 ]
Fang, Kuangnan [2 ,3 ]
Ma, Shuangge [4 ]
Zhang, Qingzhao [2 ,3 ,5 ]
机构
[1] Renmin Univ China, Sch Stat, Beijing, Peoples R China
[2] Xiamen Univ, Sch Econ, Dept Stat, Xiamen, Fujian, Peoples R China
[3] Xiamen Univ, Key Lab Econometr, Minist Educ, Xiamen, Fujian, Peoples R China
[4] Yale Sch Publ Hlth, Dept Biostat, New Haven, CT USA
[5] Xiamen Univ, Wang Yanan Inst Studies Econ, Xiamen, Fujian, Peoples R China
基金
美国国家卫生研究院; 中国国家自然科学基金;
关键词
approximate single factor graphical model; integrative analysis; penalized high dimensional analysis; INVERSE COVARIANCE ESTIMATION; GENE-EXPRESSION; CANCER; SELECTION;
D O I
10.1002/sim.8408
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In the analysis of complex and high-dimensional data, graphical models have been commonly adopted to describe associations among variables. When common factors exist which make the associations dense, the single factor graphical model has been proposed, which first extracts the common factor and then conducts graphical modeling. Under other simpler contexts, it has been recognized that results generated from analyzing a single dataset are often unsatisfactory, and integrating multiple datasets can effectively improve variable selection and estimation. In graphical modeling, the increased number of parameters makes the "lack of information" problem more severe. In this article, we integrate multiple datasets and conduct the approximate single factor graphical model analysis. A novel penalization approach is developed for the identification and estimation of important loadings and edges. An effective computational algorithm is developed. A wide spectrum of simulations and the analysis of breast cancer gene expression datasets demonstrate the competitive performance of the proposed approach. Overall, this study provides an effective new venue for taking advantage of multiple datasets and improving graphical model analysis.
引用
收藏
页码:146 / 155
页数:10
相关论文
共 50 条
  • [31] Estimating graphical models for count data with applications to single-cell gene network
    Xiao, Feiyi
    Tang, Junjie
    Fang, Huaying
    Xi, Ruibin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [32] Tests for Gaussian graphical models
    Verzelen, N.
    Villers, F.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2009, 53 (05) : 1894 - 1905
  • [33] Sparse Nonparametric Graphical Models
    Lafferty, John
    Liu, Han
    Wasserman, Larry
    STATISTICAL SCIENCE, 2012, 27 (04) : 519 - 537
  • [34] COMBINATORIAL INFERENCE FOR GRAPHICAL MODELS
    Neykov, Matey
    Lu, Junwei
    Liu, Han
    ANNALS OF STATISTICS, 2019, 47 (02) : 795 - 827
  • [35] Heterogeneous reciprocal graphical models
    Ni, Yang
    Muller, Peter
    Zhu, Yitan
    Ji, Yuan
    BIOMETRICS, 2018, 74 (02) : 606 - 615
  • [36] Elementary Estimators for Graphical Models
    Yang, Eunho
    Lozano, Aurelie C.
    Ravikumar, Pradeep
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [37] The cluster graphical lasso for improved estimation of Gaussian graphical models
    Tan, Kean Ming
    Witten, Daniela
    Shojaie, Ali
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2015, 85 : 23 - 36
  • [38] Graphical Models for Ordinal Data
    Guo, Jian
    Levina, Elizaveta
    Michailidis, George
    Zhu, Ji
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2015, 24 (01) : 183 - 204
  • [39] Learning Graphical Models With Hubs
    Tan, Kean Ming
    London, Palma
    Mohan, Karthik
    Lee, Su-In
    Fazel, Maryam
    Witten, Daniela
    JOURNAL OF MACHINE LEARNING RESEARCH, 2014, 15 : 3297 - 3331
  • [40] Conditional Functional Graphical Models
    Lee, Kuang-Yao
    Ji, Dingjue
    Li, Lexin
    Constable, Todd
    Zhao, Hongyu
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (541) : 257 - 271