KnockoffHybrid: A knockoff framework for hybrid analysis of trio and population designs in genome-wide association studies

被引:1
作者
Yang, Yi [1 ,2 ]
Wang, Qi [2 ]
Wang, Chen [3 ]
Buxbaum, Joseph [4 ,5 ,6 ]
Ionita-Laza, Iuliana [3 ,7 ]
机构
[1] City Univ Hong Kong, Dept Biostat, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Sch Data Sci, Hong Kong, Peoples R China
[3] Columbia Univ, Dept Biostat, New York, NY 10032 USA
[4] Icahn Sch Med Mt Sinai, Dept Psychiat, New York, NY 10029 USA
[5] Icahn Sch Med Mt Sinai, Dept Neurosci, New York, NY 10029 USA
[6] Icahn Sch Med Mt Sinai, Dept Genet Genom Sci, New York, NY 10029 USA
[7] Lund Univ, Dept Stat, Lund, Sweden
关键词
FAMILY-BASED DESIGNS; GENETIC ASSOCIATION; COMMON ALLELES; LARGE-SCALE; AUTISM; SPECTRUM; STRATIFICATION; DISEQUILIBRIUM; IDENTIFICATION; INDIVIDUALS;
D O I
10.1016/j.ajhg.2024.05.003
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Both trio and population designs are popular study designs for identifying risk genetic variants in genome-wide association studies (GWASs). The trio design, as a family-based design, is robust to confounding due to population structure, whereas the population design is often more powerful due to larger sample sizes. Here, we propose KnockoffHybrid, a knockoff-based statistical method for hybrid analysis of both the trio and population designs. KnockoffHybrid provides a unified framework that brings together the advantages of both designs and produces powerful hybrid analysis while controlling the false discovery rate (FDR) in the presence of linkage disequilibrium and population structure. Furthermore, KnockoffHybrid has the flexibility to leverage different types of summary statistics for hybrid analyses, including expression quantitative trait loci (eQTL) and GWAS summary statistics. We demonstrate in simulations that KnockoffHybrid offers power gains over non-hybrid methods for the trio and population designs with the same number of cases while controlling the FDR with complex correlation among variants and population structure among subjects. In hybrid analyses of three trio cohorts for autism spectrum disorders (ASDs) from the Autism Speaks MSSNG, Autism Sequencing Consortium, and Autism Genome Project with GWAS summary statistics from the iPSYCH project and eQTL summary statistics from the MetaBrain project, KnockoffHybrid outperforms conventional methods by replicating several known risk genes for ASDs and identifying additional associations with variants in other genes, including the PRAME family genes involved in axon guidance and which may act as common targets for human speech/language evolution and related disorders.
引用
收藏
页码:1448 / 1461
页数:15
相关论文
共 72 条
[1]   Whole exome sequencing reveals inherited and de novo variants in autism spectrum disorder: a trio study from Saudi families [J].
Al-Mubarak, Bashayer ;
Abouelhoda, Mohamed ;
Omar, Aisha ;
AlDhalaan, Hesham ;
Aldosari, Mohammed ;
Nester, Michael ;
Alshamrani, Hussain. A. ;
El-Kalioby, Mohamed ;
Goljan, Ewa ;
Albar, Renad ;
Subhani, Shazia ;
Tahir, Asma ;
Asfahani, Sultana ;
Eskandrani, Alaa ;
Almusaiab, Ahmed ;
Magrashi, Amna ;
Shinwari, Jameela ;
Monies, Dorota ;
Al Tassan, Nada .
SCIENTIFIC REPORTS, 2017, 7
[2]   Informative missingness in genetic association studies: Case-parent designs [J].
Allen, AS ;
Rathouz, PJ ;
Satten, GA .
AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 72 (03) :671-680
[3]   A global reference for human genetic variation [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Wang, Jun ;
Wilson, Richard K. ;
Boerwinkle, Eric ;
Doddapaneni, Harsha ;
Han, Yi ;
Korchina, Viktoriya ;
Kovar, Christie ;
Lee, Sandra ;
Muzny, Donna ;
Reid, Jeffrey G. ;
Zhu, Yiming ;
Chang, Yuqi ;
Feng, Qiang ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Lan, Tianming ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Liu, Shengmao ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Tang, Meifang ;
Wang, Bo .
NATURE, 2015, 526 (7571) :68-+
[4]  
American Psychiatric Association, 2013, DIAGN STAT MAN MENT, DOI 10.1176/appi.books.9780890425596
[5]   A genome-wide scan for common alleles affecting risk for autism [J].
Anney, Richard ;
Klei, Lambertus ;
Pinto, Dalila ;
Regan, Regina ;
Conroy, Judith ;
Magalhaes, Tiago R. ;
Correia, Catarina ;
Abrahams, Brett S. ;
Sykes, Nuala ;
Pagnamenta, Alistair T. ;
Almeida, Joana ;
Bacchelli, Elena ;
Bailey, Anthony J. ;
Baird, Gillian ;
Battaglia, Agatino ;
Berney, Tom ;
Bolshakova, Nadia ;
Boelte, Sven ;
Bolton, Patrick F. ;
Bourgeron, Thomas ;
Brennan, Sean ;
Brian, Jessica ;
Carson, Andrew R. ;
Casallo, Guillermo ;
Casey, Jillian ;
Chu, Su H. ;
Cochrane, Lynne ;
Corsello, Christina ;
Crawford, Emily L. ;
Crossett, Andrew ;
Dawson, Geraldine ;
de Jonge, Maretha ;
Delorme, Richard ;
Drmic, Irene ;
Duketis, Eftichia ;
Duque, Frederico ;
Estes, Annette ;
Farrar, Penny ;
Fernandez, Bridget A. ;
Folstein, Susan E. ;
Fombonne, Eric ;
Freitag, Christine M. ;
Gilbert, John ;
Gillberg, Christopher ;
Glessner, Joseph T. ;
Goldberg, Jeremy ;
Green, Jonathan ;
Guter, Stephen J. ;
Hakonarson, Hakon ;
Heron, Elizabeth A. .
HUMAN MOLECULAR GENETICS, 2010, 19 (20) :4072-4082
[6]   Tractor uses local ancestry to enable the inclusion of admixed individuals in GWAS and to boost power [J].
Atkinson, Elizabeth G. ;
Maihofer, Adam X. ;
Kanai, Masahiro ;
Martin, Alicia R. ;
Karczewski, Konrad J. ;
Santoro, Marcos L. ;
Ulirsch, Jacob C. ;
Kamatani, Yoichiro ;
Okada, Yukinori ;
Finucane, Hilary K. ;
Koenen, Karestan C. ;
Nievergelt, Caroline M. ;
Daly, Mark J. ;
Neale, Benjamin M. .
NATURE GENETICS, 2021, 53 (02) :195-+
[7]   Causal inference in genetic trio studies [J].
Bates, Stephen ;
Sesia, Matteo ;
Sabatti, Chiara ;
Candes, Emmanuel .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (39) :24117-24126
[8]   The Role of Copy Number Variations and FHIT Gene on Phenotypic Characteristics of Cases Diagnosed with Autism Spectrum Disorder [J].
Bolat, Gul Unsel ;
Bolat, Hilmi .
MOLECULAR SYNDROMOLOGY, 2021, 12 (01) :12-19
[9]   The Autism Sequencing Consortium: Large-Scale, High-Throughput Sequencing in Autism Spectrum Disorders [J].
Buxbaum, Joseph D. ;
Daly, Mark J. ;
Devlin, Bernie ;
Lehner, Thomas ;
Roeder, Kathryn ;
State, Matthew W. .
NEURON, 2012, 76 (06) :1052-1056
[10]   Panning for gold: "model-X' knockoffs for high dimensional controlled variable selection [J].
Candes, Emmanuel ;
Fan, Yingying ;
Janson, Lucas ;
Lv, Jinchi .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2018, 80 (03) :551-577