Machine learning and its applications in plant molecular studies

被引:34
|
作者
Sun, Shanwen [1 ]
Wang, Chunyu [2 ]
Ding, Hui [3 ]
Zou, Quan [1 ]
机构
[1] Univ Elect Sci & Technol China, Inst Fundamental & Frontier Sci, Chengdu 610054, Peoples R China
[2] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin, Peoples R China
[3] Univ Elect Sci & Technol China, Ctr Informat Biol, Chengdu, Peoples R China
基金
中国国家自然科学基金;
关键词
supervised machine learning; unsupervised machine learning; evaluation metrics; plants; genomics; PRINCIPAL COMPONENT ANALYSIS; SUBCELLULAR-LOCALIZATION; CLIMATE-CHANGE; IDENTIFICATION; PROTEINS; GENE; RESISTANCE; GENOMICS; NETWORK; PREDICTION;
D O I
10.1093/bfgp/elz036
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The advent of high-throughput genomic technologies has resulted in the accumulation of massive amounts of genomic information. However, biologists are challenged with how to effectively analyze these data. Machine learning can provide tools for better and more efficient data analysis. Unfortunately, because many plant biologists are unfamiliar with machine learning, its application in plant molecular studies has been restricted to a few species and a limited set of algorithms. Thus, in this study, we provide the basic steps for developing machine learning frameworks and present a comprehensive overview of machine learning algorithms and various evaluation metrics. Furthermore, we introduce sources of important curated plant genomic data and R packages to enable plant biologists to easily and quickly apply appropriate machine learning algorithms in their research. Finally, we discuss current applications of machine learning algorithms for identifying various genes related to resistance to biotic and abiotic stress. Broad application of machine learning and the accumulation of plant sequencing data will advance plant molecular studies.
引用
收藏
页码:40 / 48
页数:9
相关论文
共 50 条
  • [21] Applications of Machine Learning Methods in the Studies of Polymer Glass Formation
    Yang, Zhen-yue
    Nie, Wen-jian
    Liu, Lun-yang
    Xu, Xiao-lei
    Xia, Wen-jie
    Xu, Wen-sheng
    ACTA POLYMERICA SINICA, 2023, 54 (04): : 432 - 450
  • [22] A review of machine learning applications in life cycle assessment studies
    Romeiko, Xiaobo Xue
    Zhang, Xuesong
    Pang, Yulei
    Gao, Feng
    Xu, Ming
    Lin, Shao
    Babbitt, Callie
    SCIENCE OF THE TOTAL ENVIRONMENT, 2024, 912
  • [23] Machine Learning and Its Applications in Studying the Geographical Distribution of Ants
    Chen, Shan
    Ding, Yuanzhao
    DIVERSITY-BASEL, 2022, 14 (09):
  • [24] An Optical Communication's Perspective on Machine Learning and Its Applications
    Khan, Faisal Nadeem
    Fan, Qirui
    Lu, Chao
    Lau, Alan Pak Tao
    JOURNAL OF LIGHTWAVE TECHNOLOGY, 2019, 37 (02) : 493 - 516
  • [25] Enhancement of Plant Metabolite Fingerprinting by Machine Learning
    Scott, Ian M.
    Vermeer, Cornelia P.
    Liakata, Maria
    Corol, Delia I.
    Ward, Jane L.
    Lin, Wanchang
    Johnson, Helen E.
    Whitehead, Lynne
    Kular, Baldeep
    Baker, John M.
    Walsh, Sean
    Dave, Anuja
    Larson, Tony R.
    Graham, Ian A.
    Wang, Trevor L.
    King, Ross D.
    Draper, John
    Beale, Michael H.
    PLANT PHYSIOLOGY, 2010, 153 (04) : 1506 - 1520
  • [26] Machine learning applications in cascading failure analysis in power systems: A review
    Sami, Naeem Md
    Naeini, Mia
    ELECTRIC POWER SYSTEMS RESEARCH, 2024, 232
  • [27] Galaxy tools and workflows for sequence analysis with applications in molecular plant pathology
    Cock, Peter J. A.
    Gruening, Bjoern A.
    Paszkiewicz, Konrad
    Pritchard, Leighton
    PEERJ, 2013, 1
  • [28] Identifying Molecular Biomarkers for Diseases With Machine Learning Based on Integrative Omics
    Shi, Kai
    Lin, Wei
    Zhao, Xing-Ming
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (06) : 2514 - 2525
  • [29] Machine-Learning-Based Genome-Wide Association Studies for Uncovering QTL Underlying Soybean Yield and Its Components
    Yoosefzadeh-Najafabadi, Mohsen
    Eskandari, Milad
    Torabi, Sepideh
    Torkamaneh, Davoud
    Tulpan, Dan
    Rajcan, Istvan
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2022, 23 (10)
  • [30] A primer on machine learning techniques for genomic applications
    Monaco, Alfonso
    Pantaleo, Ester
    Amoroso, Nicola
    Lacalamita, Antonio
    Lo Giudice, Claudio
    Fonzino, Adriano
    Fosso, Bruno
    Picardi, Ernesto
    Tangaro, Sabina
    Pesole, Graziano
    Bellotti, Roberto
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 4345 - 4359