共 50 条
Machine learning and its applications in plant molecular studies
被引:34
作者:
Sun, Shanwen
[1
]
Wang, Chunyu
[2
]
Ding, Hui
[3
]
Zou, Quan
[1
]
机构:
[1] Univ Elect Sci & Technol China, Inst Fundamental & Frontier Sci, Chengdu 610054, Peoples R China
[2] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin, Peoples R China
[3] Univ Elect Sci & Technol China, Ctr Informat Biol, Chengdu, Peoples R China
基金:
中国国家自然科学基金;
关键词:
supervised machine learning;
unsupervised machine learning;
evaluation metrics;
plants;
genomics;
PRINCIPAL COMPONENT ANALYSIS;
SUBCELLULAR-LOCALIZATION;
CLIMATE-CHANGE;
IDENTIFICATION;
PROTEINS;
GENE;
RESISTANCE;
GENOMICS;
NETWORK;
PREDICTION;
D O I:
10.1093/bfgp/elz036
中图分类号:
Q81 [生物工程学(生物技术)];
Q93 [微生物学];
学科分类号:
071005 ;
0836 ;
090102 ;
100705 ;
摘要:
The advent of high-throughput genomic technologies has resulted in the accumulation of massive amounts of genomic information. However, biologists are challenged with how to effectively analyze these data. Machine learning can provide tools for better and more efficient data analysis. Unfortunately, because many plant biologists are unfamiliar with machine learning, its application in plant molecular studies has been restricted to a few species and a limited set of algorithms. Thus, in this study, we provide the basic steps for developing machine learning frameworks and present a comprehensive overview of machine learning algorithms and various evaluation metrics. Furthermore, we introduce sources of important curated plant genomic data and R packages to enable plant biologists to easily and quickly apply appropriate machine learning algorithms in their research. Finally, we discuss current applications of machine learning algorithms for identifying various genes related to resistance to biotic and abiotic stress. Broad application of machine learning and the accumulation of plant sequencing data will advance plant molecular studies.
引用
收藏
页码:40 / 48
页数:9
相关论文
共 50 条