Genome-wide association studies of ischemic stroke based on interpretable machine learning

被引:0
作者
Nikoli, Stefan [1 ]
Ignatov, Dmitry I. [1 ]
Khvorykh, Gennady, V [2 ]
Limborska, Svetlana A. [2 ]
Khrunin, Andrey, V [2 ]
机构
[1] HSE Univ, Lab Models & Methods Computat Pragmat, Dept Data Anal & Artificial Intelligence, Moscow, Russia
[2] Natl Res Ctr Kurchatov Inst, Moscow, Russia
基金
俄罗斯科学基金会;
关键词
Genome-wide association studies; Interpretable machine learning; Ischemic stroke; Illuminating druggable genome; XGBoost; Interpretable neural network TabNet; SNP ranking; SNP importance; OXIDATIVE STRESS; DISEASE; RISK; GENE; PROTEINS; LOCI;
D O I
10.7717/peerj-cs.2454
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the identification of several dozen genetic loci associated with ischemic stroke (IS), the genetic bases of this disease remain largely unexplored. In this research we present the results of genome-wide association studies (GWAS) based on classical statistical testing and machine learning algorithms (logistic regression, gradient boosting on decision trees, and tabular deep learning model TabNet). To build a consensus on the results obtained by different techniques, the Pareto-Optimal solution was proposed and applied. These methods were applied to real genotypic data of sick and healthy individuals of European ancestry obtained from the Database of Genotypes and Phenotypes (5,581 individuals, 883,749 single nucleotide polymorphisms). Finally, 131 genes were identified as candidates for association with the onset of IS. UBQLN1, TRPS1, and MUSK were previously described as associated with the course of IS in model animals. ACOT11 taking part in metabolism of fatty acids was shown for the first time to be associated with IS. The identified genes were compared with genes from the Illuminating Druggable Genome project. The product of GPR26 representing the G-coupled protein receptor can be considered as a therapeutic target for stroke prevention. The approaches presented in this research can be used to reprocess GWAS datasets from other diseases.
引用
收藏
页数:26
相关论文
共 50 条
  • [21] Genome-Wide Association Studies in Pediatric Endocrinology
    Dauber, Andrew
    Hirschhorn, Joel N.
    HORMONE RESEARCH IN PAEDIATRICS, 2011, 75 (05): : 322 - 328
  • [22] Genome-wide association studies for discovery of genes involved in asthma
    Akhabir, Loubna
    Sandford, Andrew J.
    RESPIROLOGY, 2011, 16 (03) : 396 - 406
  • [23] Pitfalls in performing genome-wide association studies on ratio traits
    Mccaw, Zachary R.
    Dey, Rounak
    Somineni, Hari
    Amar, David
    Mukherjee, Sumit
    Sandor, Kaitlin
    Karaletsos, Theofanis
    Koller, Daphne
    Aschard, Hugues
    Smith, George Davey
    Macarthur, Daniel
    O'Dushlaine, Colm
    Soare, Thomas W.
    HUMAN GENETICS AND GENOMICS ADVANCES, 2025, 6 (02):
  • [24] Genome-wide association studies in asthma; perhaps, the end of the beginning
    Lockett, Gabrielle A.
    Holloway, John W.
    CURRENT OPINION IN ALLERGY AND CLINICAL IMMUNOLOGY, 2013, 13 (05) : 463 - 469
  • [25] Genetic basis of lacunar stroke: a pooled analysis of individual patient data and genome-wide association studies
    Traylor, Matthew
    Persyn, Elodie
    Tomppo, Liisa
    Klasson, Sofia
    Abedi, Vida
    Bakker, Mark K.
    Torres, Nuria
    Li, Linxin
    Bell, Steven
    Rutten-Jacobs, Loes
    Tozer, Daniel J.
    Griessenauer, Christoph J.
    Zhang, Yanfei
    Pedersen, Annie
    Sharma, Pankaj
    Jimenez-Conde, Jordi
    Rundek, Tatjana
    Grewal, Raji P.
    Lindgren, Arne
    Meschia, James F.
    Salomaa, Veikko
    Havulinna, Aki
    Kourkoulis, Christina
    Crawford, Katherine
    Marini, Sandro
    Mitchell, Braxton D.
    Kittner, Steven J.
    Rosand, Jonathan
    Dichgans, Martin
    Jern, Christina
    Strbian, Daniel
    Fernandez-Cadenas, Israel
    Zand, Ramin
    Ruigrok, Ynte
    Rost, Natalia
    Lemmens, Robin
    Rothwell, Peter M.
    Anderson, Christopher D.
    Wardlaw, Joanna
    Lewis, Cathryn M.
    Markus, Hugh S.
    LANCET NEUROLOGY, 2021, 20 (05) : 351 - 361
  • [26] Genome-wide association studies in melanoma: off to a good start
    Kim, Hye Kyung
    Chanock, Stephen J.
    PIGMENT CELL & MELANOMA RESEARCH, 2012, 25 (02)
  • [27] New Distance-Based approach for Genome-Wide Association Studies
    Irigoien, Itziar
    Cormand, Bru
    Soler-Artigas, Maria
    Sanchez-Mora, Cristina
    Ramos-Quiroga, Josep-Antoni
    Arenas, Concepcion
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (05) : 2938 - 2949
  • [28] The combined effect of SNP-marker and phenotype attributes in genome-wide association studies
    Chan, E. K. F.
    Hawken, R.
    Reverter, A.
    ANIMAL GENETICS, 2009, 40 (02) : 149 - 156
  • [29] SNP Set Association Analysis for Genome-Wide Association Studies
    Cai, Min
    Dai, Hui
    Qiu, Yongyong
    Zhao, Yang
    Zhang, Ruyang
    Chu, Minjie
    Dai, Juncheng
    Hu, Zhibin
    Shen, Hongbing
    Chen, Feng
    PLOS ONE, 2013, 8 (05):
  • [30] Machine-Learning-Based Genome-Wide Association Studies for Uncovering QTL Underlying Soybean Yield and Its Components
    Yoosefzadeh-Najafabadi, Mohsen
    Eskandari, Milad
    Torabi, Sepideh
    Torkamaneh, Davoud
    Tulpan, Dan
    Rajcan, Istvan
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2022, 23 (10)