Prior robust empirical Bayes inference for large-scale data by conditioning on rank with application to microarray data

被引:3
作者
Liao, J. G. [1 ]
McMurry, Timothy [2 ]
Berg, Arthur [1 ]
机构
[1] Penn State Univ, Div Biostat & Bioinformat, Hershey, PA 17033 USA
[2] Univ Virginia, Div Biostat, Charlottesville, VA 22908 USA
关键词
Bayesian shrinkage; Confidence intervals; Ranking bias; Robust multiple estimation; MULTIPLE CONFIDENCE-INTERVALS; GENE-EXPRESSION; SELECTION; MODEL;
D O I
10.1093/biostatistics/kxt026
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Empirical Bayes methods have been extensively used for microarray data analysis by modeling the large number of unknown parameters as random effects. Empirical Bayes allows borrowing information across genes and can automatically adjust for multiple testing and selection bias. However, the standard empirical Bayes model can perform poorly if the assumed working prior deviates from the true prior. This paper proposes a new rank-conditioned inference in which the shrinkage and confidence intervals are based on the distribution of the error conditioned on rank of the data. Our approach is in contrast to a Bayesian posterior, which conditions on the data themselves. The new method is almost as efficient as standard Bayesian methods when the working prior is close to the true prior, and it is much more robust when the working prior is not close. In addition, it allows a more accurate (but also more complex) non-parametric estimate of the prior to be easily incorporated, resulting in improved inference. The new method's prior robustness is demonstrated via simulation experiments. Application to a breast cancer gene expression microarray dataset is presented. Our R package rank. Shrinkage provides a ready-to-use implementation of the proposed methodology.
引用
收藏
页码:60 / 73
页数:14
相关论文
共 50 条
  • [41] Predicting causal effects in large-scale systems from observational data
    Maathuis, Marloes H.
    Colombo, Diego
    Kalisch, Markus
    Buehlmann, Peter
    [J]. NATURE METHODS, 2010, 7 (04) : 247 - 248
  • [42] An Improved Affinity Propagation Clustering Algorithm for Large-scale Data Sets
    Liu, Xiaonan
    Yin, Meijuan
    Luo, Junyong
    Chen, Wuping
    [J]. 2013 NINTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2013, : 894 - 899
  • [43] Learning Compact Model for Large-Scale Multi-Label Data
    Wei, Tong
    Li, Yu-Feng
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5385 - 5392
  • [44] An Empirical Framework for Large-Scale Policy Analysis, with an Application to School Finance Reform in Michigan
    Ferreyra, Maria Marta
    [J]. AMERICAN ECONOMIC JOURNAL-ECONOMIC POLICY, 2009, 1 (01) : 147 - 180
  • [45] Local Feature Selection for Large-Scale Data Sets With Limited Labels
    Yang, Tian
    Deng, Yanfang
    Yu, Bin
    Qian, Yuhua
    Dai, Jianhua
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (07) : 7152 - 7163
  • [46] Weighted logistic regression for large-scale imbalanced and rare events data
    Maalouf, Maher
    Siddiqi, Mohammad
    [J]. KNOWLEDGE-BASED SYSTEMS, 2014, 59 : 142 - 148
  • [47] Human and organizational error data challenges in complex, large-scale systems
    Grabowski, Martha
    You, Zhuyu
    Zhou, Zhi
    Song, Huawei
    Steward, Michael
    Steward, Brittany
    [J]. SAFETY SCIENCE, 2009, 47 (08) : 1185 - 1194
  • [48] In silico drug repositioning: from large-scale transcriptome data to therapeutics
    Kwon, Ok-Seon
    Kim, Wankyu
    Cha, Hyuk-Jin
    Lee, Haeseung
    [J]. ARCHIVES OF PHARMACAL RESEARCH, 2019, 42 (10) : 879 - 889
  • [49] Computing the power profiles for an Airborne Wind Energy system based on large-scale wind data
    Malz, E. C.
    Verendel, V.
    Gros, S.
    [J]. RENEWABLE ENERGY, 2020, 162 : 766 - 778
  • [50] Efficient statistical estimation for a non-randomly distributed system with application to large-scale data neural network
    Wang, Kangning
    Zhang, Benle
    Sun, Xiaofei
    Li, Shaomin
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 197