fastDFE: Fast and Flexible Inference of the Distribution of Fitness Effects

被引:6
作者
Sendrowski, Janek [1 ]
Bataillon, Thomas [1 ]
机构
[1] Aarhus Univ, Bioinformat Res Ctr, Aarhus, Denmark
关键词
distribution of fitness effects; genomics; software; site-frequency spectrum; site-degeneracy annotation; ancestral-allele annotation; POLYMORPHISM;
D O I
10.1093/molbev/msae070
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Estimating the distribution of fitness effects (DFE) of new mutations is of fundamental importance in evolutionary biology, ecology, and conservation. However, existing methods for DFE estimation suffer from limitations, such as slow computation speed and limited scalability. To address these issues, we introduce fastDFE, a Python-based software package, offering fast, and flexible DFE inference from site-frequency spectrum (SFS) data. Apart from providing efficient joint inference of multiple DFEs that share parameters, it offers the feature of introducing genomic covariates that influence the DFEs and testing their significance. To further simplify usage, fastDFE is equipped with comprehensive VCF-to-SFS parsing utilities. These include options for site filtering and stratification, as well as site-degeneracy annotation and probabilistic ancestral-allele inference. fastDFE thereby covers the entire workflow of DFE inference from the moment of acquiring a raw VCF file. Despite its Python foundation, fastDFE incorporates a full R interface, including native R visualization capabilities. The package is comprehensively tested and documented at fastdfe.readthedocs.io.
引用
收藏
页数:5
相关论文
共 11 条
[1]   Hunting for Beneficial Mutations: Conditioning on SIFT Scores When Estimating the Distribution of Fitness Effect of New Mutations [J].
Chen, Jun ;
Bataillon, Thomas ;
Glemin, Sylvain ;
Lascoux, Martin .
GENOME BIOLOGY AND EVOLUTION, 2022, 14 (01)
[2]   Adaptive Protein Evolution in Animals and the Effective Population Size Hypothesis [J].
Galtier, Nicolas .
PLOS GENETICS, 2016, 12 (01)
[3]   Joint inference of the distribution of fitness effects of deleterious mutations and population demography based on nucleotide polymorphism frequencies [J].
Keightley, Peter D. ;
Eyre-Walker, Adam .
GENETICS, 2007, 177 (04) :2251-2261
[4]   Inferring the Probability of the Derived vs. the Ancestral Allelic State at a Polymorphic Site [J].
Keightley, Peter D. ;
Jackson, Benjamin C. .
GENETICS, 2018, 209 (03) :897-906
[5]  
Latrille T, 2024, bioRxiv, DOI [10.1101/2023.05.03.538864, 10.1101/2023.05.03.538864, DOI 10.1101/2023.05.03.538864]
[6]   Strong evidence for the adaptive walk model of gene evolution in Drosophila and Arabidopsis [J].
Moutinho, Ana Filipa ;
Eyre-Walker, Adam ;
Dutheil, Julien Y. .
PLOS BIOLOGY, 2022, 20 (09)
[7]  
SAWYER SA, 1992, GENETICS, V132, P1161
[8]  
Sendrowski J., 2022, Demography of birch populations across Scandinavia
[9]  
Sethupathy Praveen, 2008, Advances in Bioinformatics, V2008, P257864, DOI 10.1155/2008/257864
[10]   polyDFEv2.0: testing for invariance of the distribution of fitness effects within and across species [J].
Tataru, Paula ;
Bataillon, Thomas .
BIOINFORMATICS, 2019, 35 (16) :2868-2869