machine learning;
gene-gene interaction;
random forest;
support vector machines;
multifactor-dimensionality reduction;
genome-wide association study;
MULTIFACTOR-DIMENSIONALITY REDUCTION;
STATISTICAL EPISTASIS;
NEURAL-NETWORKS;
RANDOM FORESTS;
OPTIMIZATION;
SUSCEPTIBILITY;
ASSOCIATION;
CHALLENGES;
STRATEGIES;
POWER;
D O I:
10.1093/bib/bbs024
中图分类号:
Q5 [生物化学];
学科分类号:
071010 ;
081704 ;
摘要:
Because of the complexity of gene-phenotype relationships machine learning approaches have considerable appeal as a strategy for modelling interactions. A number of such methods have been developed and applied in recent years with some modest success. Progress is hampered by the challenges presented by the complexity of the disease genetic data, including phenotypic and genetic heterogeneity, polygenic forms of inheritance and variable penetrance, combined with the analytical and computational issues arising from the enormous number of potential interactions. We review here recent and current approaches focusing, wherever possible, on applications to real data (particularly in the context of genome-wide association studies) and looking ahead to the further challenges posed by next generation sequencing data.
引用
收藏
页码:251 / 260
页数:10
相关论文
共 42 条
[1]
Bi J., 2003, Journal of Machine Learning Research, V3, P1229, DOI 10.1162/153244303322753643
机构:
Univ Newcastle, Inst Human Genet, Int Ctr Life, Newcastle Upon Tyne NE1 3BZ, Tyne & Wear, EnglandUniv Newcastle, Inst Human Genet, Int Ctr Life, Newcastle Upon Tyne NE1 3BZ, Tyne & Wear, England
机构:
Univ Newcastle, Inst Human Genet, Int Ctr Life, Newcastle Upon Tyne NE1 3BZ, Tyne & Wear, EnglandUniv Newcastle, Inst Human Genet, Int Ctr Life, Newcastle Upon Tyne NE1 3BZ, Tyne & Wear, England