Genome-wide association studies of ischemic stroke based on interpretable machine learning

被引:0
作者
Nikoli, Stefan [1 ]
Ignatov, Dmitry I. [1 ]
Khvorykh, Gennady, V [2 ]
Limborska, Svetlana A. [2 ]
Khrunin, Andrey, V [2 ]
机构
[1] HSE Univ, Lab Models & Methods Computat Pragmat, Dept Data Anal & Artificial Intelligence, Moscow, Russia
[2] Natl Res Ctr Kurchatov Inst, Moscow, Russia
基金
俄罗斯科学基金会;
关键词
Genome-wide association studies; Interpretable machine learning; Ischemic stroke; Illuminating druggable genome; XGBoost; Interpretable neural network TabNet; SNP ranking; SNP importance; OXIDATIVE STRESS; DISEASE; RISK; GENE; PROTEINS; LOCI;
D O I
10.7717/peerj-cs.2454
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the identification of several dozen genetic loci associated with ischemic stroke (IS), the genetic bases of this disease remain largely unexplored. In this research we present the results of genome-wide association studies (GWAS) based on classical statistical testing and machine learning algorithms (logistic regression, gradient boosting on decision trees, and tabular deep learning model TabNet). To build a consensus on the results obtained by different techniques, the Pareto-Optimal solution was proposed and applied. These methods were applied to real genotypic data of sick and healthy individuals of European ancestry obtained from the Database of Genotypes and Phenotypes (5,581 individuals, 883,749 single nucleotide polymorphisms). Finally, 131 genes were identified as candidates for association with the onset of IS. UBQLN1, TRPS1, and MUSK were previously described as associated with the course of IS in model animals. ACOT11 taking part in metabolism of fatty acids was shown for the first time to be associated with IS. The identified genes were compared with genes from the Illuminating Druggable Genome project. The product of GPR26 representing the G-coupled protein receptor can be considered as a therapeutic target for stroke prevention. The approaches presented in this research can be used to reprocess GWAS datasets from other diseases.
引用
收藏
页数:26
相关论文
共 50 条
  • [31] Genome-Wide Association Studies of Autism
    Glessner J.T.
    Connolly J.J.
    Hakonarson H.
    Current Behavioral Neuroscience Reports, 2014, 1 (4) : 234 - 241
  • [32] Genome-Wide Association Studies and Diet
    Ferguson, Lynnette R.
    JOURNAL OF NUTRIGENETICS AND NUTRIGENOMICS, 2010, 3 (4-6) : 144 - 150
  • [33] Genome-wide association studies: a primer
    Corvin, A.
    Craddock, N.
    Sullivan, P. F.
    PSYCHOLOGICAL MEDICINE, 2010, 40 (07) : 1063 - 1077
  • [34] Genome-wide association studies in mice
    Flint, Jonathan
    Eskin, Eleazar
    NATURE REVIEWS GENETICS, 2012, 13 (11) : 807 - 817
  • [35] Genome-Wide Association Studies in Hepatology
    Weber, S.
    Gruenhage, F.
    Hall, R.
    Lammert, F.
    ZEITSCHRIFT FUR GASTROENTEROLOGIE, 2010, 48 (01): : 56 - 64
  • [36] Identification of novel biomarkers in ischemic stroke: a genome-wide integrated analysis
    Xie, Qizhi
    Zhang, Xiaoyun
    Peng, Sijia
    Sun, Jingjing
    Chen, Xiao
    Deng, Yuanfei
    Yi, Li
    BMC MEDICAL GENETICS, 2020, 21 (01)
  • [37] A genome-wide association study links small-vessel ischemic stroke to autophagy
    Lee, Tsong-Hai
    Ko, Tai-Ming
    Chen, Chien-Hsiun
    Chang, Yeu-Jhy
    Lu, Liang-Suei
    Chang, Chien-Hung
    Huang, Kuo-Lun
    Chang, Ting-Yu
    Lee, Jiann-Der
    Chang, Ku-Chou
    Yang, Jen-Tsung
    Wen, Ming-Shien
    Wang, Chao-Yung
    Chen, Ying-Ting
    Chen, Tsai-Chuan
    Chou, Shu-Yu
    Lee, Ming-Ta Michael
    Chen, Yuan-Tsong
    Wu, Jer-Yuarn
    SCIENTIFIC REPORTS, 2017, 7
  • [38] Genetics of coronary artery disease in the light of genome-wide association studies
    Schunkert, Heribert
    von Scheidt, Moritz
    Kessler, Thorsten
    Stiller, Barbara
    Zeng, Lingyao
    Vilne, Baiba
    CLINICAL RESEARCH IN CARDIOLOGY, 2018, 107 : S2 - S9
  • [39] Computer vision and machine learning for robust phenotyping in genome-wide studies
    Zhang, Jiaoping
    Naik, Hsiang Sing
    Assefa, Teshale
    Sarkar, Soumik
    Reddy, R. V. Chowda
    Singh, Arti
    Ganapathysubramanian, Baskar
    Singh, Asheesh K.
    SCIENTIFIC REPORTS, 2017, 7
  • [40] Strategies and issues in the detection of pathway enrichment in genome-wide association studies
    Hong, Mun-Gwan
    Pawitan, Yudi
    Magnusson, Patrik K. E.
    Prince, Jonathan A.
    HUMAN GENETICS, 2009, 126 (02) : 289 - 301