ModelRevelator: Fast phylogenetic model estimation via deep learning

被引:9
|
作者
Burgstaller-Muehlbacher, Sebastian [1 ,2 ]
Crotty, Stephen M. [3 ,4 ]
Schmidt, Heiko A. [1 ,2 ]
Reden, Franziska [1 ,2 ]
Drucks, Tamara [1 ,2 ,6 ]
von Haeseler, Arndt [1 ,2 ,5 ]
机构
[1] Univ Vienna, Max Perutz Labs, Ctr Integrat Bioinformat Vienna, A-1030 Vienna, Austria
[2] Med Univ Vienna, Vienna Bioctr VBC 5, A-1030 Vienna, Austria
[3] Univ Adelaide, Sch Math Sci, Adelaide, SA 5005, Australia
[4] Univ Adelaide, ARC Ctr Excellence Math & Stat Frontiers, Adelaide, SA 5005, Australia
[5] Univ Vienna, Fac Comp Sci, Bioinformat & Computat Biol, Waehringer Str 29, A-1090 Vienna, Austria
[6] TU Wien, Res Unit Machine Learning, A-1040 Vienna, Austria
关键词
Phylogenetic model estimation; Deep learning; Artificial intelligence; Phylogenetics; Phylogenomics; DNA-SEQUENCES; SELECTION; SUBSTITUTIONS; SIMULATION; JMODELTEST; EVOLUTION; PROTEIN; SITES; RATES; TREE;
D O I
10.1016/j.ympev.2023.107905
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Selecting the best model of sequence evolution for a multiple-sequence-alignment (MSA) constitutes the first step of phylogenetic tree reconstruction. Common approaches for inferring nucleotide models typically apply maximum likelihood (ML) methods, with discrimination between models determined by one of several information criteria. This requires tree reconstruction and optimisation which can be computationally expensive. We demonstrate that neural networks can be used to perform model selection, without the need to reconstruct trees, optimise parameters, or calculate likelihoods.We introduce ModelRevelator, a model selection tool underpinned by two deep neural networks. The first neural network, NNmodelfind, recommends one of six commonly used models of sequence evolution, ranging in complexity from Jukes and Cantor to General Time Reversible. The second, NNalphafind, recommends whether or not a Gamma-distributed rate heterogeneous model should be incorporated, and if so, provides an estimate of the shape parameter, alpha. Users can simply input an MSA into ModelRevelator, and swiftly receive output recommending the evolutionary model, inclusive of the presence or absence of rate heterogeneity, and an estimate of alpha.We show that ModelRevelator performs comparably with likelihood-based methods and the recently published machine learning method ModelTeller over a wide range of parameter settings, with significant potential savings in computational effort. Further, we show that this performance is not restricted to the alignments on which the networks were trained, but is maintained even on unseen empirical data. We expect that ModelRevelator will provide a valuable alternative for phylogeneticists, especially where traditional methods of model selection are computationally prohibitive.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Deep Learning for Opportunistic Rain Estimation via Satellite Microwave Links
    Scognamiglio, Giovanni
    Rucci, Andrea
    Vaccaro, Attilio
    Adirosi, Elisa
    Sapienza, Fabiola
    Giannetti, Filippo
    Bacci, Giacomo
    Angeloni, Sabina
    Baldini, Luca
    Roversi, Giacomo
    Ortolani, Alberto
    Antonini, Andrea
    Melani, Samantha
    SENSORS, 2024, 24 (21)
  • [32] Research on Vehicle Distance Estimation Model based on Deep Learning
    Ding, Cheng
    Bao, Jianmin
    Mi, Guanyu
    Kuai, Xiao
    Kang, YiNing
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7169 - 7173
  • [33] Design of sparse arrays via deep learning for enhanced DOA estimation
    Steven Wandale
    Koichi Ichige
    EURASIP Journal on Advances in Signal Processing, 2021
  • [34] Deep Learning for the Estimation of the Longitudinal Slip Ratio
    Marotta, Raffaele
    Ivanov, Valentin
    Strano, Salvatore
    Terzo, Mario
    Tordela, Ciro
    2023 IEEE INTERNATIONAL WORKSHOP ON METROLOGY FOR AUTOMOTIVE, METROAUTOMOTIVE, 2023, : 193 - 198
  • [35] Deep Learning for Underwater Visual Odometry Estimation
    Teixeira, Bernardo
    Silva, Hugo
    Matos, Anibal
    Silva, Eduardo
    IEEE ACCESS, 2020, 8 : 44687 - 44701
  • [36] Microstructure Estimation by Combining Deep Learning and Phase Transformation Model
    Noguchi, Satoshi
    Aihara, Syuji
    Inoue, Junya
    TETSU TO HAGANE-JOURNAL OF THE IRON AND STEEL INSTITUTE OF JAPAN, 2023, 109 (11): : 898 - 914
  • [37] DGImNet: A deep learning model for photovoltaic soiling loss estimation
    Fang, Mingyu
    Qian, Weixing
    Qian, Tao
    Bao, Qiwei
    Zhang, Haocheng
    Qiu, Xiao
    APPLIED ENERGY, 2024, 376
  • [38] FAST APPROXIMATION OF NON-NEGATIVE SPARSE RECOVERY VIA DEEP LEARNING
    Xie, Youye
    Wang, Zifan
    Pei, Weiping
    Tang, Gongguo
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2921 - 2925
  • [39] Screening and evaluation of diabetic retinopathy via a deep learning network model: A prospective study
    Yao, Li
    Cao, Chan-Yuan
    Yu, Guo-Xiao
    Shu, Xu-Peng
    Fan, Xiao-Nan
    Zhang, Yi-Fan
    WORLD JOURNAL OF DIABETES, 2024, 15 (12)
  • [40] Fast and Accurate Deep Learning Model for Stamps Detection for Embedded Devices
    Gayer, A.
    Ershova, D.
    Arlazarov, V.
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2022, 32 (04) : 772 - 779