ModelRevelator: Fast phylogenetic model estimation via deep learning

被引:9
|
作者
Burgstaller-Muehlbacher, Sebastian [1 ,2 ]
Crotty, Stephen M. [3 ,4 ]
Schmidt, Heiko A. [1 ,2 ]
Reden, Franziska [1 ,2 ]
Drucks, Tamara [1 ,2 ,6 ]
von Haeseler, Arndt [1 ,2 ,5 ]
机构
[1] Univ Vienna, Max Perutz Labs, Ctr Integrat Bioinformat Vienna, A-1030 Vienna, Austria
[2] Med Univ Vienna, Vienna Bioctr VBC 5, A-1030 Vienna, Austria
[3] Univ Adelaide, Sch Math Sci, Adelaide, SA 5005, Australia
[4] Univ Adelaide, ARC Ctr Excellence Math & Stat Frontiers, Adelaide, SA 5005, Australia
[5] Univ Vienna, Fac Comp Sci, Bioinformat & Computat Biol, Waehringer Str 29, A-1090 Vienna, Austria
[6] TU Wien, Res Unit Machine Learning, A-1040 Vienna, Austria
关键词
Phylogenetic model estimation; Deep learning; Artificial intelligence; Phylogenetics; Phylogenomics; DNA-SEQUENCES; SELECTION; SUBSTITUTIONS; SIMULATION; JMODELTEST; EVOLUTION; PROTEIN; SITES; RATES; TREE;
D O I
10.1016/j.ympev.2023.107905
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Selecting the best model of sequence evolution for a multiple-sequence-alignment (MSA) constitutes the first step of phylogenetic tree reconstruction. Common approaches for inferring nucleotide models typically apply maximum likelihood (ML) methods, with discrimination between models determined by one of several information criteria. This requires tree reconstruction and optimisation which can be computationally expensive. We demonstrate that neural networks can be used to perform model selection, without the need to reconstruct trees, optimise parameters, or calculate likelihoods.We introduce ModelRevelator, a model selection tool underpinned by two deep neural networks. The first neural network, NNmodelfind, recommends one of six commonly used models of sequence evolution, ranging in complexity from Jukes and Cantor to General Time Reversible. The second, NNalphafind, recommends whether or not a Gamma-distributed rate heterogeneous model should be incorporated, and if so, provides an estimate of the shape parameter, alpha. Users can simply input an MSA into ModelRevelator, and swiftly receive output recommending the evolutionary model, inclusive of the presence or absence of rate heterogeneity, and an estimate of alpha.We show that ModelRevelator performs comparably with likelihood-based methods and the recently published machine learning method ModelTeller over a wide range of parameter settings, with significant potential savings in computational effort. Further, we show that this performance is not restricted to the alignments on which the networks were trained, but is maintained even on unseen empirical data. We expect that ModelRevelator will provide a valuable alternative for phylogeneticists, especially where traditional methods of model selection are computationally prohibitive.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] A Fast Deep Learning Model for Textual Relevance in Biomedical Information Retrieval
    Mohan, Sunil
    Fiorini, Nicolas
    Kim, Sun
    Lu, Zhiyong
    WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018), 2018, : 77 - 86
  • [42] Fast and Accurate Deep Learning Model for Stamps Detection for Embedded Devices
    A. Gayer
    D. Ershova
    V. Arlazarov
    Pattern Recognition and Image Analysis, 2022, 32 : 772 - 779
  • [43] DEEP LEARNING ESTIMATION OF MEDIAN NERVE VOLUME USING ULTRASOUND IMAGING IN A HUMAN CADAVER MODEL
    Kuroiwa, Tomoyuki
    Jagtap, Jaidip
    Starlinger, Julia
    Lui, Hayman
    Akkus, Zeynettin
    Erickson, Bradley
    Amadio, Peter
    ULTRASOUND IN MEDICINE AND BIOLOGY, 2022, 48 (11) : 2237 - 2248
  • [44] The model of fast face recognition against age interference in deep learning
    Zhang, Yuzhe
    Wu, Peilin
    Zhao, Jinhui
    Feng, Hao
    Liao, Rongtao
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2022, 14 (3-4) : 223 - 238
  • [45] Mining versatile feruloyl esterases: phylogenetic classification, structural features, and deep learning model
    Guo, Liang
    Dong, Yuxin
    Zhang, Deyong
    Pan, Xinrong
    Jin, Xinjie
    Yan, Xinyu
    Lu, Yin
    BIORESOURCES AND BIOPROCESSING, 2025, 12 (01)
  • [46] PHYLOGENETIC REPLAY LEARNING IN DEEP NEURAL NETWORKS
    Glafkides, Jean-Patrice
    Sher, Gene, I
    Akdag, Herman
    JORDANIAN JOURNAL OF COMPUTERS AND INFORMATION TECHNOLOGY, 2022, 8 (03): : 218 - 231
  • [47] Deep learning-based weight estimation using a fast-reconstructed mesh model from the point cloud of a pig
    Kwon, Kiyoun
    Park, Ahram
    Lee, Hyunoh
    Mun, Duhwan
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 210
  • [48] Dichromatic Model Based Highlight Removal via Deep Learning
    Lee, Chan-Ho
    Yoo, Jun-Sang
    Kim, Jong-Ok
    11TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE: DATA, NETWORK, AND AI IN THE AGE OF UNTACT (ICTC 2020), 2020, : 93 - 95
  • [49] Ensemble Learning Based on Hybrid Deep Learning Model for Heart Disease Early Prediction
    Almulihi, Ahmed
    Saleh, Hager
    Hussien, Ali Mohamed
    Mostafa, Sherif
    El-Sappagh, Shaker
    Alnowaiser, Khaled
    Ali, Abdelmgeid A.
    Refaat Hassan, Moatamad
    DIAGNOSTICS, 2022, 12 (12)
  • [50] THE DISCOVERY OF DYNAMICS VIA LINEAR MULTISTEP METHODS AND DEEP LEARNING: ERROR ESTIMATION
    DU, Q. I. A. N. G.
    Gu, Y. I. Q. I.
    Yang, H. A. I. Z. H. A. O.
    Zhou, C. H. A. O.
    SIAM JOURNAL ON NUMERICAL ANALYSIS, 2022, 60 (04) : 2014 - 2045