Robust linear regression methods in association studies

被引:44
|
作者
Lourenco, V. M. [1 ]
Pires, A. M. [2 ,3 ]
Kirst, M. [4 ]
机构
[1] Univ Nova Lisboa, Fac Ciencias & Tecnol, Dept Math, P-2829516 Caparica, Portugal
[2] Univ Tecn Lisboa, Inst Super Tecn,Dept Math, P-1049001 Lisbon, Portugal
[3] Univ Tecn Lisboa, CEMAT, Inst Super Tecn, P-1049001 Lisbon, Portugal
[4] Univ Florida, Genet Inst, Plant Mol & Cellular Biol Program, Sch Forest Resources & Conservat, Gainesville, FL 32611 USA
关键词
SINGLE-NUCLEOTIDE POLYMORPHISMS; MAYS SSP PARVIGLUMIS; LINKAGE DISEQUILIBRIUM; STRUCTURED POPULATIONS; QUANTITATIVE TRAITS; GENETIC-ASSOCIATION; CANDIDATE GENES; STRATIFICATION; STATISTICS; INFERENCE;
D O I
10.1093/bioinformatics/btr006
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: It is well known that data deficiencies, such as coding/rounding errors, outliers or missing values, may lead to misleading results for many statistical methods. Robust statistical methods are designed to accommodate certain types of those deficiencies, allowing for reliable results under various conditions. We analyze the case of statistical tests to detect associations between genomic individual variations (SNP) and quantitative traits when deviations from the normality assumption are observed. We consider the classical analysis of variance tests for the parameters of the appropriate linear model and a robust version of those tests based on M-regression. We then compare their empirical power and level using simulated data with several degrees of contamination. Results: Data normality is nothing but a mathematical convenience. In practice, experiments usually yield data with non-conforming observations. In the presence of this type of data, classical least squares statistical methods perform poorly, giving biased estimates, raising the number of spurious associations and often failing to detect true ones. We show through a simulation study and a real data example, that the robust methodology can be more powerful and thus more adequate for association studies than the classical approach.
引用
收藏
页码:815 / 821
页数:7
相关论文
共 50 条
  • [1] Robust Permutation Tests in Linear Instrumental Variables Regression
    Tuvaandorj, Purevdorj
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024,
  • [2] Robust transmission regression models for linkage and association
    Schaid, DJ
    Rowland, CM
    GENETIC EPIDEMIOLOGY, 2000, 19 : S78 - S84
  • [3] A COMPARISON OF PRINCIPAL COMPONENT METHODS BETWEEN MULTIPLE PHENOTYPE REGRESSION AND MULTIPLE SNP REGRESSION IN GENETIC ASSOCIATION STUDIES
    Liu, Zhonghua
    Barnett, Ian
    Lin, Xihong
    ANNALS OF APPLIED STATISTICS, 2020, 14 (01) : 433 - 451
  • [4] Genetic model selection in genome-wide association studies: robust methods and the use of meta-analysis
    Bagos, Pantelis G.
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2013, 12 (03) : 285 - 308
  • [5] Robust change point detection for linear regression models
    Alin, Aylin
    Beyaztas, Ufuk
    Martin, Michael A.
    STATISTICS AND ITS INTERFACE, 2019, 12 (02) : 203 - 213
  • [6] Globally robust confidence intervals for simple linear regression
    Adrover, Jorge
    Salibian-Barrera, Matias
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (12) : 2899 - 2913
  • [7] Mixed logistic regression in genome-wide association studies
    Milet, Jacqueline
    Courtin, David
    Garcia, Andre
    Perdry, Herve
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [8] Robust Phylogenetic Regression
    Adams, Richard
    Cain, Zoe
    Assis, Raquel
    Degiorgio, Michael
    SYSTEMATIC BIOLOGY, 2024, 73 (01) : 140 - 157
  • [9] Robust regression for large-scale neuroimaging studies
    Fritsch, Virgile
    Da Mota, Benoit
    Loth, Eva
    Varoquauxa, Gael
    Banaschewski, Tobias
    Barker, Gareth J.
    Bokde, Arun L. W.
    Bruehl, Ruediger
    Butzek, Brigitte
    Conrod, Patricia
    Flor, Herta
    Garavan, Hugh
    Lemaitre, Herve
    Mann, Karl
    Nees, Frauke
    Paus, Tomas
    Schad, Daniel J.
    Schuemann, Gunter
    Frouin, Vincent
    Poline, Jean-Baptiste
    Thirion, Bertrand
    NEUROIMAGE, 2015, 111 : 431 - 441
  • [10] On the Meta-Analysis of Genome-Wide Association Studies: A Robust and Efficient Approach to Combine population and Family-Based Studies
    Won, Sungho
    Lu, Qing
    Bertram, Lars
    Tanzi, Rudolph E.
    Lange, Christoph
    HUMAN HEREDITY, 2012, 73 (01) : 35 - 46