Gene selection by incorporating genetic networks into case-control association studies

被引:4
|
作者
Cao, Xuewei [1 ]
Liang, Xiaoyu [2 ]
Zhang, Shuanglin [1 ]
Sha, Qiuying [1 ]
机构
[1] Michigan Technol Univ, Dept Math Sci, Houghton, MI 49931 USA
[2] Michigan State Univ, Dept Epidemiol & Biostat, E Lansing, MI USA
关键词
GENOME-WIDE ASSOCIATION; RHEUMATOID-ARTHRITIS; DNA METHYLATION; RISK; REGULARIZATION; POLYMORPHISMS; VARIANTS; LASSO; RARE;
D O I
10.1038/s41431-022-01264-x
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Large-scale genome-wide association studies (GWAS) have been successfully applied to a wide range of genetic variants underlying complex diseases. The network-based regression approach has been developed to incorporate a biological genetic network and to overcome the challenges caused by the computational efficiency for analyzing high-dimensional genomic data. In this paper, we propose a gene selection approach by incorporating genetic networks into case-control association studies for DNA sequence data or DNA methylation data. Instead of using traditional dimension reduction techniques such as principal component analyses and supervised principal component analyses, we use a linear combination of genotypes at SNPs or methylation values at CpG sites in a gene to capture gene-level signals. We employ three linear combination approaches: optimally weighted sum (OWS), beta-based weighted sum (BWS), and LD-adjusted polygenic risk score (LD-PRS). OWS and LD-PRS are supervised approaches that depend on the effect of each SNP or CpG site on the case-control status, while BWS can be extracted without using the case-control status. After using one of the linear combinations of genotypes or methylation values in each gene to capture gene-level signals, we regularize them to perform gene selection based on the biological network. Simulation studies show that the proposed approaches have higher true positive rates than using traditional dimension reduction techniques. We also apply our approaches to DNA methylation data and UK Biobank DNA sequence data for analyzing rheumatoid arthritis. The results show that the proposed methods can select potentially rheumatoid arthritis related genes that are missed by existing methods.
引用
收藏
页码:270 / 277
页数:8
相关论文
共 50 条
  • [21] Evaluation of Public Control Data and Case-control Ratios for Genetic Association Studies
    Adrianto, Indra
    Lessard, Christopher J.
    Adler, Adam
    Kaufman, Kenneth M.
    Moser, Kathy L.
    Gray-McGuire, Courtney
    GENETIC EPIDEMIOLOGY, 2010, 34 (08) : 943 - 944
  • [22] Case-control genetic association studies in gastrointestinal disease: Review and recommendations
    Saito, Yuri A.
    Talley, Nicholas J.
    de Andrade, Mariza
    Petersen, Gloria M.
    AMERICAN JOURNAL OF GASTROENTEROLOGY, 2006, 101 (06): : 1379 - 1389
  • [23] Robust analysis of secondary phenotypes in case-control genetic association studies
    Xing, Chuanhua
    McCarthy, Janice M.
    Dupuis, Josee
    Cupples, L. Adrienne
    Meigs, James B.
    Lin, Xihong
    Allen, Andrew S.
    STATISTICS IN MEDICINE, 2016, 35 (23) : 4226 - 4237
  • [24] A Joint Association Test for Multiple SNPs in Genetic Case-Control Studies
    Wang, Tao
    Jacob, Howard
    Ghosh, Soumitra
    Wang, Xujing
    Zeng, Zhao-Bang
    GENETIC EPIDEMIOLOGY, 2009, 33 (02) : 151 - 163
  • [25] Robust Estimation for Secondary Trait Association in Case-Control Genetic Studies
    Tapsoba, Jean de Dieu
    Kooperberg, Charles
    Reiner, Alexander
    Wang, Ching-Yun
    Dai, James Y.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2014, 179 (10) : 1264 - 1272
  • [26] The impact of diagnostic error on testing genetic association in case-control studies
    Zheng, G
    Tian, X
    STATISTICS IN MEDICINE, 2005, 24 (06) : 869 - 882
  • [27] On an extended interpretation of linkage disequilibrium in genetic case-control association studies
    Dickhaus, Thorsten
    Stange, Jens
    Demirhan, Haydar
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2015, 14 (05) : 497 - 505
  • [28] Multiple hypothesis testing strategies for genetic case-control association studies
    Rosenberg, Philip S.
    Che, Anney
    Chen, Bingshu E.
    STATISTICS IN MEDICINE, 2006, 25 (18) : 3134 - 3149
  • [29] Issues in association analysis: Error control in case-control association studies for disease gene discovery
    Ott, J
    HUMAN HEREDITY, 2004, 58 (3-4) : 171 - 174
  • [30] Candidate gene case-control association studies: advantages and potential pitfalls
    Daly, AK
    Day, CP
    BRITISH JOURNAL OF CLINICAL PHARMACOLOGY, 2001, 52 (05) : 489 - 499