A graph-theoretic approach for inparalog detection

被引:5
作者
Tremblay-Savard, Olivier [1 ]
Swenson, Krister M. [1 ,2 ]
机构
[1] Univ Montreal, Dept Informat DIRO, Montreal, PQ H3C 3J7, Canada
[2] McGill Univ, McGill Ctr Bioinformat, Montreal, PQ H3C 2B4, Canada
关键词
GENE MAP; EVOLUTION; DUPLICATION; ORTHOLOGY; GENOME;
D O I
10.1186/1471-2105-13-S19-S16
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Understanding the history of a gene family that evolves through duplication, speciation, and loss is a fundamental problem in comparative genomics. Features such as function, position, and structural similarity between genes are intimately connected to this history; relationships between genes such as orthology (genes related through a speciation event) or paralogy (genes related through a duplication event) are usually correlated with these features. For example, recent work has shown that in human and mouse there is a strong connection between function and inparalogs, the paralogs that were created since the speciation event separating the human and mouse lineages. Methods exist for detecting inparalogs that either use information from only two species, or consider a set of species but rely on clustering methods. In this paper we present a graph-theoretic approach for finding lower bounds on the number of inparalogs for a given set of species; we pose an edge covering problem on the similarity graph and give an efficient 2/3-approximation as well as a faster heuristic. Since the physical position of inparalogs corresponding to recent speciations is not likely to have changed since the duplication, we also use our predictions to estimate the types of duplications that have occurred in some vertebrates and drosophila.
引用
收藏
页数:11
相关论文
共 34 条
[1]   Automatic clustering of orthologs and inparalogs shared by multiple proteomes [J].
Alexeyenko, Andrey ;
Tamas, Ivica ;
Liu, Gang ;
Sonnhammer, Erik L. L. .
BIOINFORMATICS, 2006, 22 (14) :E9-E15
[2]  
Allendorf F.W., 1984, P1
[3]   Phylogenetic and Functional Assessment of Orthologs Inference Projects and Methods [J].
Altenhoff, Adrian M. ;
Dessimoz, Christophe .
PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (01)
[4]   An Alu transposition model for the origin and expansion of human segmental duplications [J].
Bailey, JA ;
Liu, G ;
Eichler, EE .
AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (04) :823-834
[5]   Assessing Performance of Orthology Detection Strategies Applied to Eukaryotic Genomes [J].
Chen, Feng ;
Mackey, Aaron J. ;
Vermunt, Jeroen K. ;
Roos, David S. .
PLOS ONE, 2007, 2 (04)
[6]  
Chunfang Zheng, 2011, Algorithms in Bioinformatics. Proceedings of the 11th International Workshop, WABI 2011, P364, DOI 10.1007/978-3-642-23038-7_30
[7]   Recent duplication, domain accretion and the dynamic mutation of the human genome [J].
Eichler, EE .
TRENDS IN GENETICS, 2001, 17 (11) :661-669
[8]   Evolutionary Patterns of Recently Emerged Animal Duplogs [J].
Ezawa, Kiyoshi ;
Ikeo, Kazuho ;
Gojobori, Takashi ;
Saitou, Naruya .
GENOME BIOLOGY AND EVOLUTION, 2011, 3 :1119-1135
[9]   Getting Started in Gene Orthology and Functional Analysis [J].
Fang, Gang ;
Bhardwaj, Nitin ;
Robilotto, Rebecca ;
Gerstein, Mark B. .
PLOS COMPUTATIONAL BIOLOGY, 2010, 6 (03)
[10]  
FITCH WM, 1977, GENETICS, V86, P623