Machine Learning to Advance Human Genome-Wide Association Studies

被引:5
作者
Sigala, Rafaella E. [1 ]
Lagou, Vasiliki [1 ]
Shmeliov, Aleksey [1 ]
Atito, Sara [2 ,3 ]
Kouchaki, Samaneh [2 ,3 ]
Awais, Muhammad [2 ,3 ]
Prokopenko, Inga [1 ,2 ]
Mahdi, Adam [4 ]
Demirkan, Ayse [1 ,2 ]
机构
[1] Dept Clin & Expt Med, Sect Stat Multiom, Guildford GU2 7XH, Surrey, England
[2] Univ Surrey, Surrey Inst People Centred Artificial Intelligence, Guildford GU2 7XH, Surrey, England
[3] Univ Surrey, Ctr Vis Speech Signal Proc, Guildford GU2 7XH, Surrey, England
[4] Univ Oxford, Oxford Internet Inst, Oxford OX1 3JS, Oxon, England
关键词
genome-wide association; human genetics; machine learning; RISK PREDICTION; GENE; DISEASE; GWAS; PRIORITIZATION; SCHIZOPHRENIA; DISCOVERY; VARIANTS; OBESITY; FTO;
D O I
10.3390/genes15010034
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Machine learning, including deep learning, reinforcement learning, and generative artificial intelligence are revolutionising every area of our lives when data are made available. With the help of these methods, we can decipher information from larger datasets while addressing the complex nature of biological systems in a more efficient way. Although machine learning methods have been introduced to human genetic epidemiological research as early as 2004, those were never used to their full capacity. In this review, we outline some of the main applications of machine learning to assigning human genetic loci to health outcomes. We summarise widely used methods and discuss their advantages and challenges. We also identify several tools, such as Combi, GenNet, and GMSTool, specifically designed to integrate these methods for hypothesis-free analysis of genetic variation data. We elaborate on the additional value and limitations of these tools from a geneticist's perspective. Finally, we discuss the fast-moving field of foundation models and large multi-modal omics biobank initiatives.
引用
收藏
页数:18
相关论文
共 50 条
[12]   Genome-Wide Association Studies for Polycystic Ovary Syndrome [J].
Liu, Hongbin ;
Zhao, Han ;
Chen, Zi-Jiang .
SEMINARS IN REPRODUCTIVE MEDICINE, 2016, 34 (04) :224-229
[13]   Genome-wide association studies and Crohn's disease [J].
Lee, James C. ;
Parkes, Miles .
BRIEFINGS IN FUNCTIONAL GENOMICS, 2011, 10 (02) :71-76
[14]   fastJT: An R package for robust and efficient feature selection for machine learning and genome-wide association studies [J].
Lin, Jiaxing ;
Sibley, Alexander ;
Shterev, Ivo ;
Nixon, Andrew ;
Innocenti, Federico ;
Chan, Cliburn ;
Owzar, Kouros .
BMC BIOINFORMATICS, 2019, 20 (1)
[15]   Genetic Architecture of Lung Cancer Using Machine-Learning Approaches in Genome-Wide Association Studies [J].
Byun, J. ;
Han, Y. ;
Edelson, J. ;
Ostrom, Q. ;
Amos, C. .
JOURNAL OF THORACIC ONCOLOGY, 2019, 14 (10) :S516-S517
[16]   Combining Multiple Hypothesis Testing with Machine Learning Increases the Statistical Power of Genome-wide Association Studies [J].
Mieth, Bettina ;
Kloft, Marius ;
Rodriguez, Juan Antonio ;
Sonnenburg, Soren ;
Vobruba, Robin ;
Morcillo-Suarez, Carlos ;
Farre, Xavier ;
Marigorta, Urko M. ;
Fehr, Ernst ;
Dickhaus, Thorsten ;
Blanchard, Gilles ;
Schunk, Daniel ;
Navarro, Arcadi ;
Mueller, Klaus-Robert .
SCIENTIFIC REPORTS, 2016, 6
[17]   Robust Reference Powered Association Test of Genome-Wide Association Studies [J].
Wang, Yi ;
Li, Yi ;
Hao, Meng ;
Liu, Xiaoyu ;
Zhang, Menghan ;
Wang, Jiucun ;
Xiong, Momiao ;
Shugart, Yin Yao ;
Jin, Li .
FRONTIERS IN GENETICS, 2019, 10
[18]   Genome-Wide Association Studies of Human Growth Traits [J].
Weedon, Michael N. .
RECENT ADVANCES IN GROWTH RESEARCH: NUTRITIONAL, MOLECULAR AND ENDOCRINE PERSPECTIVES, 2013, 71 :29-38
[19]   Twelve Years of Genome-Wide Association Studies of Human Protein N-Glycosylation [J].
Timoshchuk, Anna ;
Sharapov, Sodbo ;
Aulchenko, Yurii S. .
ENGINEERING, 2023, 26 :17-31
[20]   A Genome-Wide Association Study of Metabolic Syndrome in the Taiwanese Population [J].
Ho, Chih-Yi ;
Lee, Jia-In ;
Huang, Shu-Pin ;
Chen, Szu-Chia ;
Geng, Jiun-Hung .
NUTRIENTS, 2024, 16 (01)