A New Orthology Assessment Method for Phylogenomic Data: Unrooted Phylogenetic Orthology

被引:53
作者
Ballesteros, Jesus A. [1 ]
Hormiga, Gustavo [1 ]
机构
[1] George Washington Univ, Dept Biol Sci, Washington, DC 20052 USA
基金
美国国家科学基金会;
关键词
Markov cluster; protein homology; spiders; Araneae; transcriptomics; genomics; COMPARATIVE TRANSCRIPTOMICS; QUALITY ASSESSMENT; TREE; EVOLUTION; GENOMICS; SYSTEMATICS; PREDICTION; SEQUENCES; ALGORITHM; ALIGNMENT;
D O I
10.1093/molbev/msw069
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Current sequencing technologies are making available unprecedented amounts of genetic data for a large variety of species including nonmodel organisms. Although many phylogenomic surveys spend considerable time finding orthologs from the wealth of sequence data, these results do not transcend the original study and after being processed for specific phylogenetic purposes these orthologs do not become stable orthology hypotheses. We describe a procedure to detect and document the phylogenetic distribution of orthologs allowing researchers to use this information to guide selection of loci best suited to test specific evolutionary questions. At the core of this pipeline is a new phylogenetic orthology method that is neither affected by the position of the root nor requires explicit assignment of outgroups. We discuss the properties of this new orthology assessment method and exemplify its utility for phylogenomics using a small insects dataset. In addition, we exemplify the pipeline to identify and document stable orthologs for the group of orb-weaving spiders (Araneoidea) using RNAseq data. The scripts used in this study, along with sample files and additional documentation, are available at https://github.com/ballesterus/UPhO.
引用
收藏
页码:2117 / 2134
页数:18
相关论文
共 99 条
[1]   TranslatorX: multiple alignment of nucleotide sequences guided by amino acid translations [J].
Abascal, Federico ;
Zardoya, Rafael ;
Telford, Maximilian J. .
NUCLEIC ACIDS RESEARCH, 2010, 38 :W7-W13
[2]   The PhyloFacts FAT-CAT web server: ortholog identification and function prediction using fast approximate tree classification [J].
Afrasiabi, Cyrus ;
Samad, Bushra ;
Dineen, David ;
Meacham, Christopher ;
Sjoelander, Kimmen .
NUCLEIC ACIDS RESEARCH, 2013, 41 (W1) :W242-W248
[3]  
AGNARSSON I., 2013, Spider research in the 21st century: trends perspectives, P58
[4]   The utility of ITS2 in spider phylogenetics: notes on prior work and an example from Anelosimus [J].
Agnarsson, Ingi .
JOURNAL OF ARACHNOLOGY, 2010, 38 (02) :377-382
[5]   Phylogenetic and Functional Assessment of Orthologs Inference Projects and Methods [J].
Altenhoff, Adrian M. ;
Dessimoz, Christophe .
PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (01)
[6]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[7]   A Transcriptomic Approach to Ribbon Worm Systematics (Nemertea): Resolving the Pilidiophora Problem [J].
Andrade, Sonia C. S. ;
Montenegro, Horacio ;
Strand, Malin ;
Schwartz, Megan L. ;
Kajihara, Hiroshi ;
Norenburg, Jon L. ;
Turbeville, James M. ;
Sundberg, Per ;
Giribet, Gonzalo .
MOLECULAR BIOLOGY AND EVOLUTION, 2014, 31 (12) :3206-3215
[8]  
[Anonymous], 2000, A cluster algorithm for graphs
[9]  
[Anonymous], The USENIX Magazine, V36 36, P42, DOI DOI 10.5281/ZENODO.16303
[10]   Can RNA-Seq Resolve the Rapid Radiation of Advanced Moths and Butterflies (Hexapoda: Lepidoptera: Apoditrysia)? An Exploratory Study [J].
Bazinet, Adam L. ;
Cummings, Michael P. ;
Mitter, Kim T. ;
Mitter, Charles W. .
PLOS ONE, 2013, 8 (12)