Efficient whole-genome association mapping using local phylogenies for unphased genotype data

被引:7
作者
Ding, Zhihong [2 ]
Mailund, Thomas [1 ]
Song, Yun S. [3 ,4 ]
机构
[1] Univ Aarhus, Bioinformat Res Ctr, DK-8000 Aarhus C, Denmark
[2] Univ Calif Davis, Dept Comp Sci, Davis, CA 95616 USA
[3] Univ Calif Berkeley, Div Comp Sci, Berkeley, CA 94720 USA
[4] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
基金
美国国家科学基金会;
关键词
D O I
10.1093/bioinformatics/btn406
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Recent advances in genotyping technology has made data acquisition for whole-genome association study cost effective, and a current active area of research is developing efficient methods to analyze such large-scale datasets. Most sophisticated association mapping methods that are currently available take phased haplotype data as input. However, phase information is not readily available from sequencing methods and inferring the phase via computational approaches is time-consuming, taking days to phase a single chromosome. Results: In this article, we devise an efficient method for scanning unphased whole-genome data for association. Our approach combines a recently found linear-time algorithm for phasing genotypes on trees with a recently proposed tree-based method for association mapping. From unphased genotype data, our algorithm builds local phylogenies along the genome, and scores each tree according to the clustering of cases and controls. We assess the performance of our new method on both simulated and real biological datasets.
引用
收藏
页码:2215 / 2221
页数:7
相关论文
共 26 条
[1]   A common variant associated with prostate cancer in European and African populations [J].
Amundadottir, Laufey T. ;
Sulem, Patrick ;
Gudmundsson, Julius ;
Helgason, Agnar ;
Baker, Adam ;
Agnarsson, Bjarni A. ;
Sigurdsson, Asgeir ;
Benediktsdottir, Kristrun R. ;
Cazier, Jean-Baptiste ;
Sainz, Jesus ;
Jakobsdottir, Margret ;
Kostic, Jelena ;
Magnusdottir, Droplaug N. ;
Ghosh, Shyamali ;
Agnarsson, Kari ;
Birgisdottir, Birgitta ;
Le Roux, Louise ;
Olafsdottir, Adalheidur ;
Blondal, Thorarinn ;
Andresdottir, Margret ;
Gretarsdottir, Olafia Svandis ;
Bergthorsson, Jon T. ;
Gudbjartsson, Daniel ;
Gylfason, Arnaldur ;
Thorleifsson, Gudmar ;
Manolescu, Andrei ;
Kristjansson, Kristleifur ;
Geirsson, Gudmundur ;
Isaksson, Helgi ;
Douglas, Julie ;
Johansson, Jan-Erik ;
Balter, Katarina ;
Wiklund, Fredrik ;
Montie, James E. ;
Yu, Xiaoying ;
Suarez, Brian K. ;
Ober, Carole ;
Cooney, Kathleen A. ;
Gronberg, Henrik ;
Catalona, William J. ;
Einarsson, Gudmundur V. ;
Barkardottir, Rosa B. ;
Gulcher, Jeffrey R. ;
Kong, Augustine ;
Thorsteinsdottir, Unnur ;
Stefansson, Kari .
NATURE GENETICS, 2006, 38 (06) :652-658
[2]   A common genetic variant in the NOS1 regulator NOS1AP modulates cardiac repolarization [J].
Arking, Dan E. ;
Pfeufer, Arne ;
Post, Wendy ;
Kao, W. H. Linda ;
Newton-Cheh, Christopher ;
Ikeda, Morna ;
West, Kristen ;
Kashuk, Carl ;
Akyol, Mahmut ;
Perz, Siegfried ;
Jalilzadeh, Shapour ;
Illig, Thomas ;
Gieger, Christian ;
Guo, Chao-Yu ;
Larson, Martin G. ;
Wichmann, H. Erich ;
Marban, Eduardo ;
O'Donnell, Christopher J. ;
Hirschhorn, Joel N. ;
Kaeaeb, Stefan ;
Spooner, Peter M. ;
Meitinger, Thomas ;
Chakravarti, Aravinda .
NATURE GENETICS, 2006, 38 (06) :644-651
[3]   Evaluating coverage of genome-wide association studies [J].
Barrett, Jeffrey C. ;
Cardon, Lon R. .
NATURE GENETICS, 2006, 38 (06) :659-662
[4]   Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering [J].
Browning, Sharon R. ;
Browning, Brian L. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (05) :1084-1097
[5]   Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls [J].
Burton, Paul R. ;
Clayton, David G. ;
Cardon, Lon R. ;
Craddock, Nick ;
Deloukas, Panos ;
Duncanson, Audrey ;
Kwiatkowski, Dominic P. ;
McCarthy, Mark I. ;
Ouwehand, Willem H. ;
Samani, Nilesh J. ;
Todd, John A. ;
Donnelly, Peter ;
Barrett, Jeffrey C. ;
Davison, Dan ;
Easton, Doug ;
Evans, David ;
Leung, Hin-Tak ;
Marchini, Jonathan L. ;
Morris, Andrew P. ;
Spencer, Chris C. A. ;
Tobin, Martin D. ;
Attwood, Antony P. ;
Boorman, James P. ;
Cant, Barbara ;
Everson, Ursula ;
Hussey, Judith M. ;
Jolley, Jennifer D. ;
Knight, Alexandra S. ;
Koch, Kerstin ;
Meech, Elizabeth ;
Nutland, Sarah ;
Prowse, Christopher V. ;
Stevens, Helen E. ;
Taylor, Niall C. ;
Walters, Graham R. ;
Walker, Neil M. ;
Watkins, Nicholas A. ;
Winzer, Thilo ;
Jones, Richard W. ;
McArdle, Wendy L. ;
Ring, Susan M. ;
Strachan, David P. ;
Pembrey, Marcus ;
Breen, Gerome ;
St Clair, David ;
Caesar, Sian ;
Gordon-Smith, Katherine ;
Jones, Lisa ;
Fraser, Christine ;
Green, Elain K. .
NATURE, 2007, 447 (7145) :661-678
[6]   Bayesian logistic regression using a perfect phylogeny [J].
Clark, Taane G. ;
De Iorio, Maria ;
Griffiths, Robert C. .
BIOSTATISTICS, 2007, 8 (01) :32-52
[7]   A linear-time algorithm for the Perfect Phylogeny Haplotyping (PPH) problem [J].
Ding, ZH ;
Filkov, V ;
Gusfield, D .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2006, 13 (02) :522-553
[8]  
Ding ZH, 2005, LECT NOTES COMPUT SC, V3500, P585
[9]   Genome-wide genotyping in Parkinson's disease and neurologically normal controls:: first stage analysis and public release of data [J].
Fung, Hon-Chung ;
Scholz, Sonja ;
Matarin, Mar ;
Simon-Sanchez, Javier ;
Hernandez, Dena ;
Britton, Angela ;
Gibbs, J. Raphael ;
Langefeld, Carl ;
Stiegert, Matt L. ;
Schymick, Jennifer ;
Okun, Michael S. ;
Mandel, Ronald J. ;
Fernandez, Hubert H. ;
Foote, Kelly D. ;
Rodriguez, Ramon L. ;
Peckham, Elizabeth ;
De Vrieze, Fabienne Wavrant ;
Gwinn-Hardy, Katrina ;
Hardy, John A. ;
Singleton, Andrew .
LANCET NEUROLOGY, 2006, 5 (11) :911-916
[10]   Highly scalable genotype phasing by entropy minimization [J].
Gusev, Alexander ;
Mandoiu, Ion I. ;
Pasaniuc, Bogdan .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2008, 5 (02) :252-261