An Optimum Projection and Noise Reduction Approach for Detecting Rare and Common Variants Associated with Complex Diseases

被引:2
作者
Turkmen, Asuman [1 ,2 ]
Lin, Shili [2 ]
机构
[1] Ohio State Univ Newark, Newark, OH 43055 USA
[2] Ohio State Univ, Dept Stat, Columbus, OH 43210 USA
关键词
Missing heritability; Noise reduction; Partial least squares; Rare variants; Regularization/LASSO; Super variant; PARTIAL LEAST-SQUARES; CLASSIFICATION; CONTRIBUTE; REGRESSION;
D O I
10.1159/000343797
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: Despite the thrilling advances in identifying gene variants that influence common diseases, most of the heritable risk for many common diseases still remains unidentified. One of the possible reasons for this missing heritability is that the genome-wide association study (GWAS) approaches have been focusing on common rather than rare single nucleotide variants (SNVs). Consequently, there is currently a great deal of interest in developing methods that can interrogate rare variants for association with diseases. Methods: We propose a two-step method (termed rPLS) to reveal possible genetic effects related to rare as well as common variants. The procedure starts with removing irrelevant variants using penalized regression (regularization) which is followed by partial least squares (PLS) on the surviving SNVs to find an optimal linear combination of rare and common SNVs within a genomic region that is tested for its association with the trait of interest. Results: Simulation settings based on the 1000 Genomes sequencing data and reflecting real situations demonstrated that rPLS performs well compared to existing methods especially when there are a large number of noncausal variants (both rare and common) present in the gene and when causal SNVs have different effect sizes and directions. Copyright (C) 2012 S. Karger AG, Basel
引用
收藏
页码:51 / 60
页数:10
相关论文
共 26 条
[1]   Genetic Analysis Workshop 17 mini-exome simulation [J].
Laura Almasy ;
Thomas D Dyer ;
Juan Manuel Peralta ;
Jack W Kent ;
Jac C Charlesworth ;
Joanne E Curran ;
John Blangero .
BMC Proceedings, 5 (Suppl 9)
[2]   Rare Variant Association Analysis Methods for Complex Traits [J].
Asimit, Jennifer ;
Zeggini, Eleftheria .
ANNUAL REVIEW OF GENETICS, VOL 44, 2010, 44 :293-308
[3]   Multilocus association testing with penalized regression [J].
Basu, Saonli ;
Pan, Wei ;
Shen, Xiaotong ;
Oetting, William S. .
GENETIC EPIDEMIOLOGY, 2011, 35 (08) :755-765
[4]   Comparison of Statistical Tests for Disease Association With Rare Variants [J].
Basu, Saonli ;
Pan, Wei .
GENETIC EPIDEMIOLOGY, 2011, 35 (07) :606-619
[5]   A Covering Method for Detecting Genetic Associations between Rare Variants and Common Phenotypes [J].
Bhatia, Gaurav ;
Bansal, Vikas ;
Harismendy, Olivier ;
Schork, Nicholas J. ;
Topol, Eric J. ;
Frazer, Kelly ;
Bafna, Vineet .
PLOS COMPUTATIONAL BIOLOGY, 2010, 6 (10)
[6]  
Boulesteix A.-L., 2004, STAT APPL GENET MOL, V3, P1, DOI [DOI 10.2202/1544-6115.1075, 10.2202/1544-6115.1075]
[7]  
Chapman JM, 2008, GENET EPIDEMIOL, V32
[8]   Multiple rare Alleles contribute to low plasma levels of HDL cholesterol [J].
Cohen, JC ;
Kiss, RS ;
Pertsemlidis, A ;
Marcel, YL ;
McPherson, R ;
Hobbs, HH .
SCIENCE, 2004, 305 (5685) :869-872
[9]   SIMPLS - AN ALTERNATIVE APPROACH TO PARTIAL LEAST-SQUARES REGRESSION [J].
DEJONG, S .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1993, 18 (03) :251-263
[10]   Testing against a high dimensional alternative [J].
Goeman, JJ ;
van de Geer, SA ;
van Houwelingen, HC .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2006, 68 :477-493