A data-driven weighting scheme for family-based genome-wide association studies

被引:6
作者
Qin, Huaizhen [1 ]
Feng, Tao [1 ,2 ]
Zhang, Shuanglin [1 ,2 ]
Sha, Qiuying [1 ]
机构
[1] Michigan Technol Univ, Dept Math Sci, Houghton, MI 49931 USA
[2] Heilongjiang Univ, Dept Math, Harbin, Peoples R China
关键词
two stage; data-driven weighting; linkage disequilibrium; population stratification; LINKAGE DISEQUILIBRIUM; TESTS;
D O I
10.1038/ejhg.2009.201
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Recently, Steen et al proposed a novel two-stage approach for family-based genome-wide association studies. In the first stage, a test based on between-family information is used to rank SNPs according to their P-values or conditional power of the test. In the second stage, the R most promising SNPs are tested using a family-based association test. We call this two-stage approach top R method. Ionita-Laza et al proposed an exponential weighting method within a two-stage framework. In the second stage of this approach, instead of testing top R SNPs, it tests all SNPs and weights the P-values of association test according to the information of the first stage. However, both of the top R and exponential weighting methods only use the information from the first stage to rank SNPs. It seems that the two methods do not use information from the first stage efficiently. Furthermore, it may be unreasonable for the exponential weighting method to use the same weight for all SNPs within a group when only one or a few SNPs are related with a disease. In this article, we propose a data-driven weighting scheme within a two-stage framework. In this method, we use the information from the first stage to determine a SNP-specific weight for each SNP. We use simulation studies to evaluate the performance of our method. The simulation results showed that our proposed method is consistently more powerful than the top R method and the exponential weighting method, regardless of the LD structure, population structure, and family structure. European Journal of Human Genetics (2010) 18, 596-603; doi:10.1038/ejhg.2009.201; published online 25 November 2009
引用
收藏
页码:596 / 603
页数:8
相关论文
共 19 条
[1]   A METHOD FOR QUANTIFYING DIFFERENTIATION BETWEEN POPULATIONS AT MULTI-ALLELIC LOCI AND ITS IMPLICATIONS FOR INVESTIGATING IDENTITY AND PATERNITY [J].
BALDING, DJ ;
NICHOLS, RA .
GENETICA, 1995, 96 (1-2) :3-12
[2]   Interpretation of simultaneous linkage and family-based association tests in genome screens [J].
Chung, Ren-Hua ;
Hauser, Elizabeth R. ;
Martin, Eden R. .
GENETIC EPIDEMIOLOGY, 2007, 31 (02) :134-142
[3]   Transmission/disequilibrium tests for extended marker haplotypes [J].
Clayton, D ;
Jones, H .
AMERICAN JOURNAL OF HUMAN GENETICS, 1999, 65 (04) :1161-1169
[4]   Two-stage association tests for genome-wide association studies based on family data with arbitrary family structure [J].
Feng, Tao ;
Zhang, Shuanglin ;
Sha, Qiuying .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2007, 15 (11) :1169-1175
[5]   A common genetic variant is associated with adult and childhood obesity [J].
Herbert, A ;
Gerry, NP ;
McQueen, MB ;
Heid, IM ;
Pfeufer, A ;
Illig, T ;
Wichmann, HE ;
Meitinger, T ;
Hunter, D ;
Hu, FB ;
Colditz, G ;
Hinney, A ;
Hebebrand, J ;
Koberwitz, K ;
Zhu, XF ;
Cooper, R ;
Ardlie, K ;
Lyon, H ;
Hirschhorn, JN ;
Laird, NM ;
Lenburg, ME ;
Lange, C ;
Christman, MF .
SCIENCE, 2006, 312 (5771) :279-283
[6]   Generating samples under a Wright-Fisher neutral model of genetic variation [J].
Hudson, RR .
BIOINFORMATICS, 2002, 18 (02) :337-338
[7]   Genomewide weighted hypothesis testing in family-based association studies, with an application to a 100K scan [J].
Ionita-Laza, Iuliana ;
McQueen, Matthew B. ;
Laird, Nan M. ;
Lange, Christoph .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (03) :607-614
[8]   A fast method for computing high-significance disease association in large population-based studies [J].
Kimmel, Gad ;
Shamir, Ron .
AMERICAN JOURNAL OF HUMAN GENETICS, 2006, 79 (03) :481-492
[9]   On the replication of genetic associations:: Timing can be everything! [J].
Lasky-Su, Jessica ;
Lyon, Helen N. ;
Emilsson, Valur ;
Heid, Iris M. ;
Molony, Cliona ;
Raby, Benjamin A. ;
Lazarus, Ross ;
Klanderman, Barbara ;
Soto-Quiros, Manuel E. ;
Avila, Lydiana ;
Silverman, Edwin K. ;
Thorleifsson, Gudmar ;
Thorsteinsdottir, Unnur ;
Kronenberg, Florian ;
Vollmert, Caren ;
Illig, Thomas ;
Fox, Caroline S. ;
Levy, Daniel ;
Laird, Nan ;
Ding, Xiao ;
McQueen, Matt B. ;
Butler, Johannah ;
Ardlie, Kristin ;
Papoutsakis, Constantina ;
Dedoussis, George ;
O'Donnell, Christopher J. ;
Wichmann, H. -Erich ;
Celedon, Juan C. ;
Schadt, Eric ;
Hirschhorn, Joel ;
Weiss, Scott T. ;
Stefansson, Kari ;
Lange, Christoph .
AMERICAN JOURNAL OF HUMAN GENETICS, 2008, 82 (04) :849-858
[10]   Genetic analysis of genome-wide variation in human gene expression [J].
Morley, M ;
Molony, CM ;
Weber, TM ;
Devlin, JL ;
Ewens, KG ;
Spielman, RS ;
Cheung, VG .
NATURE, 2004, 430 (7001) :743-747