Efficient Replication of over 180 Genetic Associations with Self-Reported Medical Data

被引:93
作者
Tung, Joyce Y. [1 ]
Do, Chuong B. [1 ]
Hinds, David A. [1 ]
Kiefer, Amy K. [1 ]
Macpherson, J. Michael [1 ]
Chowdry, Arnab B. [1 ]
Francke, Uta [1 ,2 ]
Naughton, Brian T. [1 ]
Mountain, Joanna L. [1 ]
Wojcicki, Anne [1 ]
Eriksson, Nicholas [1 ]
机构
[1] 23andMe Inc, Mountain View, CA 94043 USA
[2] Stanford Univ, Dept Genet, Stanford, CA 94305 USA
关键词
GENOME-WIDE ASSOCIATION; RHEUMATOID-ARTHRITIS; VALIDATION; DISEASE; QUESTIONNAIRE; LOCI; PREVALENCE; VARIANTS; STROKE; ONSET;
D O I
10.1371/journal.pone.0023473
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
While the cost and speed of generating genomic data have come down dramatically in recent years, the slow pace of collecting medical data for large cohorts continues to hamper genetic research. Here we evaluate a novel online framework for obtaining large amounts of medical information from a recontactable cohort by assessing our ability to replicate genetic associations using these data. Using web-based questionnaires, we gathered self-reported data on 50 medical phenotypes from a generally unselected cohort of over 20,000 genotyped individuals. Of a list of genetic associations curated by NHGRI, we successfully replicated about 75% of the associations that we expected to (based on the number of cases in our cohort and reported odds ratios, and excluding a set of associations with contradictory published evidence). Altogether we replicated over 180 previously reported associations, including many for type 2 diabetes, prostate cancer, cholesterol levels, and multiple sclerosis. We found significant variation across categories of conditions in the percentage of expected associations that we were able to replicate, which may reflect systematic inflation of the effects in some initial reports, or differences across diseases in the likelihood of misdiagnosis or misreport. We also demonstrated that we could improve replication success by taking advantage of our recontactable cohort, offering more in-depth questions to refine self-reported diagnoses. Our data suggest that online collection of self-reported data from a recontactable cohort may be a viable method for both broad and deep phenotyping in large populations.
引用
收藏
页数:9
相关论文
共 48 条
[1]   Genome-wide association study of ulcerative colitis identifies three new susceptibility loci, including the HNF4A region [J].
Barrett, Jeffrey C. ;
Lee, James C. ;
Lees, Charles W. ;
Prescott, Natalie J. ;
Anderson, Carl A. ;
Phillips, Anne ;
Wesley, Emma ;
Parnell, Kirstie ;
Zhang, Hu ;
Drummond, Hazel ;
Nimmo, Elaine R. ;
Massey, Dunecan ;
Blaszczyk, Kasia ;
Elliott, Timothy ;
Cotterill, Lynn ;
Dallal, Helen ;
Lobo, Alan J. ;
Mowat, Craig ;
Sanderson, Jeremy D. ;
Jewell, Derek P. ;
Newman, William G. ;
Edwards, Cathryn ;
Ahmad, Tariq ;
Mansfield, John C. ;
Satsangi, Jack ;
Parkes, Miles ;
Mathew, Christopher G. ;
Donnelly, Peter ;
Peltonen, Leena ;
Blackwell, Jenefer M. ;
Bramon, Elvira ;
Brown, Matthew A. ;
Casas, Juan P. ;
Corvin, Aiden ;
Craddock, Nicholas ;
Deloukas, Panos ;
Duncanson, Audrey ;
Jankowski, Janusz ;
Markus, Hugh S. ;
McCarthy, Mark I. ;
Palmer, Colin N. A. ;
Plomin, Robert ;
Rautanen, Anna ;
Sawcer, Stephen J. ;
Samani, Nilesh ;
Trembath, Richard C. ;
Viswanathan, Ananth C. ;
Wood, Nicholas ;
Spencer, Chris C. A. ;
Bellenguez, Celine .
NATURE GENETICS, 2009, 41 (12) :1330-U99
[2]   A genome-wide association study of alcohol dependence [J].
Bierut, Laura J. ;
Agrawal, Arpana ;
Bucholz, Kathleen K. ;
Doheny, Kimberly F. ;
Laurie, Cathy ;
Pugh, Elizabeth ;
Fisher, Sherri ;
Fox, Louis ;
Howells, William ;
Bertelsen, Sarah ;
Hinrichs, Anthony L. ;
Almasy, Laura ;
Breslau, Naomi ;
Culverhouse, Robert C. ;
Dick, Danielle M. ;
Edenberg, Howard J. ;
Foroud, Tatiana ;
Grucza, Richard A. ;
Hatsukami, Dorothy ;
Hesselbrock, Victor ;
Johnson, Eric O. ;
Kramer, John ;
Krueger, Robert F. ;
Kuperman, Samuel ;
Lynskey, Michael ;
Mann, Karl ;
Neuman, Rosalind J. ;
Noethen, Markus M. ;
Nurnberger, John I., Jr. ;
Porjesz, Bernice ;
Ridinger, Monika ;
Saccone, Nancy L. ;
Saccone, Scott F. ;
Schuckit, Marc A. ;
Tischfield, Jay A. ;
Wang, Jen C. ;
Rietschel, Marcella ;
Goate, Alison M. ;
Rice, John P. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (11) :5082-5087
[3]   PHENOMICS: THE SYSTEMATIC STUDY OF PHENOTYPES ON A GENOME-WIDE SCALE [J].
Bilder, R. M. ;
Sabb, F. W. ;
Cannon, T. D. ;
London, E. D. ;
Jentsch, J. D. ;
Parker, D. Stott ;
Poldrack, R. A. ;
Evans, C. ;
Freimer, N. B. .
NEUROSCIENCE, 2009, 164 (01) :30-42
[4]   VALIDATION OF INTERVIEW-BASED DISEASE CLASSIFICATIONS - MAIL SURVEY OF PHYSICIANS [J].
BURGESS, AM ;
MARTEL, MU ;
WYMAN, DK .
JOURNAL OF CHRONIC DISEASES, 1971, 24 (01) :45-&
[5]   Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls [J].
Burton, Paul R. ;
Clayton, David G. ;
Cardon, Lon R. ;
Craddock, Nick ;
Deloukas, Panos ;
Duncanson, Audrey ;
Kwiatkowski, Dominic P. ;
McCarthy, Mark I. ;
Ouwehand, Willem H. ;
Samani, Nilesh J. ;
Todd, John A. ;
Donnelly, Peter ;
Barrett, Jeffrey C. ;
Davison, Dan ;
Easton, Doug ;
Evans, David ;
Leung, Hin-Tak ;
Marchini, Jonathan L. ;
Morris, Andrew P. ;
Spencer, Chris C. A. ;
Tobin, Martin D. ;
Attwood, Antony P. ;
Boorman, James P. ;
Cant, Barbara ;
Everson, Ursula ;
Hussey, Judith M. ;
Jolley, Jennifer D. ;
Knight, Alexandra S. ;
Koch, Kerstin ;
Meech, Elizabeth ;
Nutland, Sarah ;
Prowse, Christopher V. ;
Stevens, Helen E. ;
Taylor, Niall C. ;
Walters, Graham R. ;
Walker, Neil M. ;
Watkins, Nicholas A. ;
Winzer, Thilo ;
Jones, Richard W. ;
McArdle, Wendy L. ;
Ring, Susan M. ;
Strachan, David P. ;
Pembrey, Marcus ;
Breen, Gerome ;
St Clair, David ;
Caesar, Sian ;
Gordon-Smith, Katherine ;
Jones, Lisa ;
Fraser, Christine ;
Green, Elain K. .
NATURE, 2007, 447 (7145) :661-678
[6]   Variants in the 5q31 cytokine gene cluster are associated with psoriasis [J].
Chang, M. ;
Li, Y. ;
Yan, C. ;
Callis-Duffin, K. P. ;
Matsunami, N. ;
Garcia, V. E. ;
Cargill, M. ;
Civello, D. ;
Bui, N. ;
Catanese, J. J. ;
Leppert, M. F. ;
Krueger, G. G. ;
Begovich, A. B. ;
Schrodi, S. J. .
GENES AND IMMUNITY, 2008, 9 (02) :176-181
[7]   VALIDATION OF QUESTIONNAIRE INFORMATION ON RISK-FACTORS AND DISEASE OUTCOMES IN A PROSPECTIVE COHORT STUDY OF WOMEN [J].
COLDITZ, GA ;
MARTIN, P ;
STAMPFER, MJ ;
WILLETT, WC ;
SAMPSON, L ;
ROSNER, B ;
HENNEKENS, CH ;
SPEIZER, FE .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1986, 123 (05) :894-900
[8]  
DO CB, 2011, PLOS GENETI IN PRESS
[9]   Web-Based, Participant-Driven Studies Yield Novel Genetic Associations for Common Traits [J].
Eriksson, Nicholas ;
Macpherson, J. Michael ;
Tung, Joyce Y. ;
Hon, Lawrence S. ;
Naughton, Brian ;
Saxonov, Serge ;
Avey, Linda ;
Wojcicki, Anne ;
Pe'er, Itsik ;
Mountain, Joanna .
PLOS GENETICS, 2010, 6 (06) :1-20
[10]   Prevalence of celiac disease in at-risk and not-at-risk groups in the United States - A large multicenter study [J].
Fasano, A ;
Berti, I ;
Gerarduzzi, T ;
Not, T ;
Colletti, RB ;
Drago, S ;
Elitsur, Y ;
Green, PHR ;
Guandalini, S ;
Hill, ID ;
Pietzak, M ;
Ventura, A ;
Thorpe, M ;
Kryszak, D ;
Fornaroli, F ;
Wasserman, SS ;
Murray, JA ;
Horvath, K .
ARCHIVES OF INTERNAL MEDICINE, 2003, 163 (03) :286-292