Targeted Fused Ridge Estimation of Inverse Covariance Matrices from Multiple High-Dimensional Data Classes

被引:0
|
作者
Bilgrau, Anders Ellern [1 ,2 ]
Peeters, Carel F. W. [3 ]
Eriksen, Poul Svante [1 ]
Bogsted, Martin [2 ,4 ]
van Wieringen, Wessel N. [3 ,5 ]
机构
[1] Aalborg Univ, Dept Math Sci, DK-9220 Aalborg O, Denmark
[2] Aalborg Univ Hosp, Dept Haematol, DK-9000 Aalborg, Denmark
[3] Amsterdam Univ Med Ctr, Dept Epidemiol & Biostat, Locat VUmc, Postbus 7057, NL-1007 MB Amsterdam, Netherlands
[4] Aalborg Univ, Dept Clin Med, DK-9000 Aalborg, Denmark
[5] Vrije Univ Amsterdam, Dept Math, NL-1081 HV Amsterdam, Netherlands
关键词
differential network estimation; Gaussian graphical modeling; generalized fused ridge; high-dimensional data; l(2)-penalized maximum likelihood; structural metaanalysis; B-CELL LYMPHOMA; CLASSIFICATION; MICROARRAY; SELECTION; NETWORKS; MODEL; KEGG;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of jointly estimating multiple inverse covariance matrices from high-dimensional data consisting of distinct classes. An l(2)-penalized maximum likelihood approach is employed. The suggested approach is flexible and generic, incorporating several other l(2) -penalized estimators as special cases. In addition, the approach allows specification of target matrices through which prior knowledge may be incorporated and which can stabilize the estimation procedure in high-dimensional settings. The result is a targeted fused ridge estimator that is of use when the precision matrices of the constituent classes are believed to chiefly share the same structure while potentially differing in a number of locations of interest. It has many applications in (multi)factorial study designs. We focus on the graphical interpretation of precision matrices with the proposed estimator then serving as a basis for integrative or meta-analytic Gaussian graphical modeling. Situations are considered in which the classes are defined by data sets and subtypes of diseases. The performance of the proposed estimator in the graphical modeling setting is assessed through extensive simulation experiments. Its practical usability is illustrated by the differential network modeling of 12 large-scale gene expression data sets of diffuse large B-cell lymphoma subtypes. The estimator and its related procedures are incorporated into the R-package rags2ridges.
引用
收藏
页数:52
相关论文
共 50 条
  • [21] Discriminative Ridge Machine: A Classifier for High-Dimensional Data or Imbalanced Data
    Peng, Chong
    Cheng, Qiang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (06) : 2595 - 2609
  • [22] Non-Negativity Constrained Missing Data Estimation for High-Dimensional and Sparse Matrices from Industrial Applications
    Luo, Xin
    Zhou, MengChu
    Li, Shuai
    Hu, Lun
    Shang, Mingsheng
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (05) : 1844 - 1855
  • [23] High-dimensional covariance estimation under the presence of outliers
    Huang, Hsin-Cheng
    Lee, Thomas C. M.
    STATISTICS AND ITS INTERFACE, 2016, 9 (04) : 461 - 468
  • [24] Robust Ridge Regression for High-Dimensional Data
    Maronna, Ricardo A.
    TECHNOMETRICS, 2011, 53 (01) : 44 - 53
  • [25] HIGH-DIMENSIONAL SPARSE COVARIANCE ESTIMATION FOR RANDOM SIGNALS
    Nasif, Ahmed O.
    Tian, Zhi
    Ling, Qing
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 4658 - 4662
  • [26] High-dimensional covariance matrix estimation with missing observations
    Lounici, Karim
    BERNOULLI, 2014, 20 (03) : 1029 - 1058
  • [27] A scalable sparse Cholesky based approach for learning high-dimensional covariance matrices in ordered data
    Kshitij Khare
    Sang-Yun Oh
    Syed Rahman
    Bala Rajaratnam
    Machine Learning, 2019, 108 : 2061 - 2086
  • [28] High-dimensional covariance matrices tests for analyzing multi-tumor gene expression data
    Qayed, Abdullah
    Han, Dong
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2021, 30 (08) : 1904 - 1916
  • [29] Multiple imputation in the presence of high-dimensional data
    Zhao, Yize
    Long, Qi
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2016, 25 (05) : 2021 - 2035
  • [30] Likelihood ratio tests for covariance matrices of high-dimensional normal distributions
    Jiang, Dandan
    Jiang, Tiefeng
    Yang, Fan
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2012, 142 (08) : 2241 - 2256