共 2 条
VCFtoTree: a user-friendly tool to construct locus-specific alignments and phylogenies from thousands of anthropologically relevant genome sequences
被引:8
|作者:
Xu, Duo
[1
]
Jaber, Yousef
[1
]
Pavlidis, Pavlos
[2
]
Gokcumen, Omer
[1
]
机构:
[1] SUNY Buffalo, Dept Biol Sci, Buffalo, NY 14260 USA
[2] Fdn Res & Technol Hellas, IMBB, Iraklion, Crete, Greece
来源:
BMC BIOINFORMATICS
|
2017年
/
18卷
基金:
美国国家科学基金会;
关键词:
VCF;
Phylogeny;
FASTA;
1000Genomes;
Anthropological genetics;
Next generation sequencing data;
EVOLUTION;
VARIANTS;
SOFTWARE;
INSIGHTS;
DELETION;
FORMAT;
TREES;
D O I:
10.1186/s12859-017-1844-0
中图分类号:
Q5 [生物化学];
学科分类号:
071010 ;
081704 ;
摘要:
Background: Constructing alignments and phylogenies for a given locus from large genome sequencing studies with relevant outgroups allow novel evolutionary and anthropological insights. However, no user- friendly tool has been developed to integrate thousands of recently available and anthropologically relevant genome sequences to construct complete sequence alignments and phylogenies. Results: Here, we provide VCFtoTree, a user friendly tool with a graphical user interface that directly accesses online databases to download, parse and analyze genome variation data for regions of interest. Our pipeline combines popular sequence datasets and tree building algorithms with custom data parsing to generate accurate alignments and phylogenies using all the individuals from the 1000 Genomes Project, Neanderthal and Denisovan genomes, as well as reference genomes of Chimpanzee and Rhesus Macaque. It can also be applied to other phased human genomes, as well as genomes from other species. The output of our pipeline includes an alignment in FASTA format and a tree file in newick format. Conclusion: VCFtoTree fulfills the increasing demand for constructing alignments and phylogenies for a given loci from thousands of available genomes. Our software provides a user friendly interface for a wider audience without prerequisite knowledge in programming.
引用
收藏
页数:8
相关论文