Targeted Fused Ridge Estimation of Inverse Covariance Matrices from Multiple High-Dimensional Data Classes

被引:0
|
作者
Bilgrau, Anders Ellern [1 ,2 ]
Peeters, Carel F. W. [3 ]
Eriksen, Poul Svante [1 ]
Bogsted, Martin [2 ,4 ]
van Wieringen, Wessel N. [3 ,5 ]
机构
[1] Aalborg Univ, Dept Math Sci, DK-9220 Aalborg O, Denmark
[2] Aalborg Univ Hosp, Dept Haematol, DK-9000 Aalborg, Denmark
[3] Amsterdam Univ Med Ctr, Dept Epidemiol & Biostat, Locat VUmc, Postbus 7057, NL-1007 MB Amsterdam, Netherlands
[4] Aalborg Univ, Dept Clin Med, DK-9000 Aalborg, Denmark
[5] Vrije Univ Amsterdam, Dept Math, NL-1081 HV Amsterdam, Netherlands
关键词
differential network estimation; Gaussian graphical modeling; generalized fused ridge; high-dimensional data; l(2)-penalized maximum likelihood; structural metaanalysis; B-CELL LYMPHOMA; CLASSIFICATION; MICROARRAY; SELECTION; NETWORKS; MODEL; KEGG;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of jointly estimating multiple inverse covariance matrices from high-dimensional data consisting of distinct classes. An l(2)-penalized maximum likelihood approach is employed. The suggested approach is flexible and generic, incorporating several other l(2) -penalized estimators as special cases. In addition, the approach allows specification of target matrices through which prior knowledge may be incorporated and which can stabilize the estimation procedure in high-dimensional settings. The result is a targeted fused ridge estimator that is of use when the precision matrices of the constituent classes are believed to chiefly share the same structure while potentially differing in a number of locations of interest. It has many applications in (multi)factorial study designs. We focus on the graphical interpretation of precision matrices with the proposed estimator then serving as a basis for integrative or meta-analytic Gaussian graphical modeling. Situations are considered in which the classes are defined by data sets and subtypes of diseases. The performance of the proposed estimator in the graphical modeling setting is assessed through extensive simulation experiments. Its practical usability is illustrated by the differential network modeling of 12 large-scale gene expression data sets of diffuse large B-cell lymphoma subtypes. The estimator and its related procedures are incorporated into the R-package rags2ridges.
引用
收藏
页数:52
相关论文
共 50 条
  • [41] On d-Asymptotics for High-Dimensional Discriminant Analysis with Different Variance-Covariance Matrices
    Ayano, Takanori
    Suzuki, Joe
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (12) : 3106 - 3108
  • [42] Estimating large covariance matrix with network topology for high-dimensional biomedical data
    Chen, Shuo
    Kang, Jian
    Xing, Yishi
    Zhao, Yunpeng
    Milton, Donald K.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2018, 127 : 82 - 95
  • [43] TESTING HIGH-DIMENSIONAL COVARIANCE MATRICES, WITH APPLICATION TO DETECTING SCHIZOPHRENIA RISK GENES
    Zhu, Lingxue
    Lei, Jing
    Devlin, Bernie
    Roeder, Kathryn
    ANNALS OF APPLIED STATISTICS, 2017, 11 (03) : 1810 - 1831
  • [44] Estimating high-dimensional covariance and precision matrices under general missing dependence
    Park, Seongoh
    Wang, Xinlei
    Lim, Johan
    ELECTRONIC JOURNAL OF STATISTICS, 2021, 15 (02): : 4868 - 4915
  • [45] A Dynamic Structure for High-Dimensional Covariance Matrices and Its Application in Portfolio Allocation
    Guo, Shaojun
    Box, John Leigh
    Zhang, Wenyang
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (517) : 235 - 253
  • [46] The minimum weighted covariance determinant estimator for high-dimensional data
    Kalina, Jan
    Tichavsky, Jan
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2022, 16 (04) : 977 - 999
  • [47] The minimum weighted covariance determinant estimator for high-dimensional data
    Jan Kalina
    Jan Tichavský
    Advances in Data Analysis and Classification, 2022, 16 : 977 - 999
  • [48] High-dimensional covariance forecasting based on principal component analysis of high-frequency data
    Jian, Zhihong
    Deng, Pingjun
    Zhu, Zhican
    ECONOMIC MODELLING, 2018, 75 : 422 - 431
  • [49] Riemannian Statistics Meets Random Matrix Theory: Toward Learning From High-Dimensional Covariance Matrices
    Said, Salem
    Heuveline, Simon
    Mostajeran, Cyrus
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2023, 69 (01) : 472 - 481
  • [50] DOUBLY PENALIZED ESTIMATION IN ADDITIVE REGRESSION WITH HIGH-DIMENSIONAL DATA
    Tan, Zhiqiang
    Zhang, Cun-Hui
    ANNALS OF STATISTICS, 2019, 47 (05) : 2567 - 2600