Motivation: The complete sequencing of many genomes has made it possible to identify orthologous genes descending from a common ancestor. However, reconstruction of evolutionary history over long time periods faces many challenges due to gene duplications and losses. Identification of orthologous groups shared by multiple proteomes therefore becomes a clustering problem in which an optimal compromise between conflicting evidences needs to be found. Results: Here we present a new proteome-scale analysis program called MultiParanoid that can automatically find orthology relationships between proteins in multiple proteomes. The software is an extension of the InParanoid program that identifies orthologs and inparalogs in pairwise proteome comparisons. MultiParanoid applies a clustering algorithm to merge multiple pairwise ortholog groups from InParanoid into multi-species ortholog groups. To avoid outparalogs in the same cluster, MultiParanoid only combines species that share the same last ancestor. To validate the clustering technique, we compared the results to a reference set obtained by manual phylogenetic analysis. We further compared the results to ortholog groups in KOGs and OrthoMCL, which revealed that MultiParanoid produces substantially fewer outparalogs than these resources.
机构:
Penn State Univ, Astrobiol Res Ctr, University Pk, PA 16802 USA
Penn State Univ, Dept Biol, University Pk, PA 16802 USAPenn State Univ, Astrobiol Res Ctr, University Pk, PA 16802 USA
Blair, Jaime E.
;
Ikeo, Kazuho
论文数: 0引用数: 0
h-index: 0
机构:
Natl Inst Genet, Ctr Informat Biol, Mishima, Shizuoka 4118540, JapanPenn State Univ, Astrobiol Res Ctr, University Pk, PA 16802 USA
Ikeo, Kazuho
;
Gojobori, Takashi
论文数: 0引用数: 0
h-index: 0
机构:
Natl Inst Genet, Ctr Informat Biol, Mishima, Shizuoka 4118540, JapanPenn State Univ, Astrobiol Res Ctr, University Pk, PA 16802 USA
Gojobori, Takashi
;
Hedges, S. Blair
论文数: 0引用数: 0
h-index: 0
机构:
Penn State Univ, Astrobiol Res Ctr, University Pk, PA 16802 USA
Penn State Univ, Dept Biol, University Pk, PA 16802 USAPenn State Univ, Astrobiol Res Ctr, University Pk, PA 16802 USA
机构:
Penn State Univ, Astrobiol Res Ctr, University Pk, PA 16802 USA
Penn State Univ, Dept Biol, University Pk, PA 16802 USAPenn State Univ, Astrobiol Res Ctr, University Pk, PA 16802 USA
Blair, Jaime E.
;
Ikeo, Kazuho
论文数: 0引用数: 0
h-index: 0
机构:
Natl Inst Genet, Ctr Informat Biol, Mishima, Shizuoka 4118540, JapanPenn State Univ, Astrobiol Res Ctr, University Pk, PA 16802 USA
Ikeo, Kazuho
;
Gojobori, Takashi
论文数: 0引用数: 0
h-index: 0
机构:
Natl Inst Genet, Ctr Informat Biol, Mishima, Shizuoka 4118540, JapanPenn State Univ, Astrobiol Res Ctr, University Pk, PA 16802 USA
Gojobori, Takashi
;
Hedges, S. Blair
论文数: 0引用数: 0
h-index: 0
机构:
Penn State Univ, Astrobiol Res Ctr, University Pk, PA 16802 USA
Penn State Univ, Dept Biol, University Pk, PA 16802 USAPenn State Univ, Astrobiol Res Ctr, University Pk, PA 16802 USA