Genome-wide discovery for biomarkers using quantile regression at biobank scale

被引:4
作者
Wang, Chen [1 ,2 ]
Wang, Tianying [3 ]
Kiryluk, Krzysztof [2 ]
Wei, Ying [1 ]
Aschard, Hugues [4 ]
Ionita-Laza, Iuliana [1 ,5 ]
机构
[1] Columbia Univ, Dept Biostat, New York, NY 10027 USA
[2] Columbia Univ, Vagelos Coll Phys & Surg, Dept Med, Div Nephrol, New York, NY USA
[3] Colorado State Univ, Ft Collins, CO USA
[4] Univ Paris Cite, Inst Pasteur, Dept Computat Biol, Paris, France
[5] Lund Univ, Dept Stat, Lund, Sweden
基金
美国国家卫生研究院;
关键词
PHENOTYPIC VARIABILITY; DISEASE; OPPORTUNITIES; CHALLENGES; MODELS; LOCI; GENE;
D O I
10.1038/s41467-024-50726-x
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Genome-wide association studies (GWAS) for biomarkers important for clinical phenotypes can lead to clinically relevant discoveries. Conventional GWAS for quantitative traits are based on simplified regression models modeling the conditional mean of a phenotype as a linear function of genotype. We draw attention here to an alternative, lesser known approach, namely quantile regression that naturally extends linear regression to the analysis of the entire conditional distribution of a phenotype of interest. Quantile regression can be applied efficiently at biobank scale, while having some unique advantages such as (1) identifying variants with heterogeneous effects across quantiles of the phenotype distribution; (2) accommodating a wide range of phenotype distributions including non-normal distributions, with invariance of results to trait transformations; and (3) providing more detailed information about genotype-phenotype associations even for those associations identified by conventional GWAS. We show in simulations that quantile regression is powerful across both homogeneous and various heterogeneous models. Applications to 39 quantitative traits in the UK Biobank demonstrate that quantile regression can be a helpful complement to linear regression in GWAS and can identify variants with larger effects on high-risk subgroups of individuals but with lower or no contribution overall. Here, the authors propose using quantile regression for genome-wide association studies with quantitative traits in UK Biobank, showing its advantages over linear regression in handling nonnormal distributions and identifying heterogeneous genetic effects.
引用
收藏
页数:13
相关论文
共 52 条
[1]   Rank-Based Inverse Normal Transformations are Increasingly Used, But are They Merited? [J].
Beasley, T. Mark ;
Erickson, Stephen ;
Allison, David B. .
BEHAVIOR GENETICS, 2009, 39 (05) :580-595
[2]   QUANTILE REGRESSION-OPPORTUNITIES AND CHALLENGES FROM A USER'S PERSPECTIVE [J].
Beyerlein, Andreas .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2014, 180 (03) :330-331
[3]   Optimization Methods for Large-Scale Machine Learning [J].
Bottou, Leon ;
Curtis, Frank E. ;
Nocedal, Jorge .
SIAM REVIEW, 2018, 60 (02) :223-311
[4]   Genetic interactions affecting human gene expression identified by variance association mapping [J].
Brown, Andrew Anand ;
Buil, Alfonso ;
Vinuela, Ana ;
Lappalainen, Tuuli ;
Zheng, Hou-Feng ;
Richards, John B. ;
Small, Kerrin S. ;
Spector, Timothy D. ;
Dermitzakis, Emmanouil T. ;
Durbin, Richard .
ELIFE, 2014, 3
[5]  
Brown M.B., 1974, Journal of the American Statistical Association, V69, P364, DOI [10.1137/1003016 10.1137/1003016, DOI 10.1137/1003016]
[6]   ROBUST TESTS FOR EQUALITY OF VARIANCES [J].
BROWN, MB ;
FORSYTHE, AB .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1974, 69 (346) :364-367
[7]   The UK Biobank resource with deep phenotyping and genomic data [J].
Bycroft, Clare ;
Freeman, Colin ;
Petkova, Desislava ;
Band, Gavin ;
Elliott, Lloyd T. ;
Sharp, Kevin ;
Motyer, Allan ;
Vukcevic, Damjan ;
Delaneau, Olivier ;
O'Connell, Jared ;
Cortes, Adrian ;
Welsh, Samantha ;
Young, Alan ;
Effingham, Mark ;
McVean, Gil ;
Leslie, Stephen ;
Allen, Naomi ;
Donnelly, Peter ;
Marchini, Jonathan .
NATURE, 2018, 562 (7726) :203-+
[8]   Second-generation PLINK: rising to the challenge of larger and richer datasets [J].
Chang, Christopher C. ;
Chow, Carson C. ;
Tellier, Laurent C. A. M. ;
Vattikuti, Shashaank ;
Purcell, Shaun M. ;
Lee, James J. .
GIGASCIENCE, 2015, 4
[9]   Developing and evaluating polygenic risk prediction models for stratified disease prevention [J].
Chatterjee, Nilanjan ;
Shi, Jianxin ;
Garcia-Closas, Montserrat .
NATURE REVIEWS GENETICS, 2016, 17 (07) :392-406
[10]   Limited statistical evidence for shared genetic effects of eQTLs and autoimmune-disease-associated loci in three major immune-cell types [J].
Chun, Sung ;
Casparino, Alexandra ;
Patsopoulos, Nikolaos A. ;
Croteau-Chonka, Damien C. ;
Raby, Benjamin A. ;
De Jager, Philip L. ;
Sunyaev, Shamil R. ;
Cotsapas, Chris .
NATURE GENETICS, 2017, 49 (04) :600-+