Inparanoid: a comprehensive database of eukaryotic orthologs

被引:545
作者
O'Brien, KP
Remm, M
Sonnhammer, ELL [1 ]
机构
[1] Karolinska Inst, Ctr Genom & Bioinformat, S-17177 Stockholm, Sweden
[2] Univ Tartu, Inst Mol & Cell Biol, Dept Bioinformat, EE-50090 Tartu, Estonia
关键词
D O I
10.1093/nar/gki107
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The Inparanoid eukaryotic ortholog database (http://inparanoid.cgb.ki.se/)is a collection of pairwise ortholog groups between 17 whole genomes; Anopheles gambiae, Caenorhabditis briggsae, Caenorhabditis elegans, Drosophila melanogaster, Danio rerio, Takifugu rubripes, Gallus gallus, Homo sapiens, Mus musculus, Pan troglodytes, Rattus norvegicus, Oryza sativa, Plasmodium falciparum, Arabidopsis thaliana, Escherichia coli, Saccharomyces cerevisiae and Schizosaccharomyces pombe. Complete proteomes for these genomes were derived from Ensembl and UniProt and compared pairwise using Blast, followed by a clustering step using the Inparanoid program. An Inparanoid cluster is seeded by a reciprocally best-matching ortholog pair, around which inparalogs (should they exist) are gathered independently, while outparalogs are excluded. The ortholog clusters can be searched on the website using Ensembl gene/protein or UniProt identifiers, annotation text or by Blast alignment against our protein datasets. The entire dataset can be downloaded, as can the Inparanoid program itself.
引用
收藏
页码:D476 / D480
页数:5
相关论文
共 10 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkw1099, 10.1093/nar/gkh131]
[3]   Ensembl 2004 [J].
Birney, E ;
Andrews, D ;
Bevan, P ;
Caccamo, M ;
Cameron, G ;
Chen, Y ;
Clarke, L ;
Coates, G ;
Cox, T ;
Cuff, J ;
Curwen, V ;
Cutts, T ;
Down, T ;
Durbin, R ;
Eyras, E ;
Fernandez-Suarez, XM ;
Gane, P ;
Gibbins, B ;
Gilbert, J ;
Hammond, M ;
Hotz, H ;
Iyer, V ;
Kahari, A ;
Jekosch, K ;
Kasprzyk, A ;
Keefe, D ;
Keenan, S ;
Lehvaslaiho, H ;
McVicker, G ;
Melsopp, C ;
Meidl, P ;
Mongin, E ;
Pettett, R ;
Potter, S ;
Proctor, G ;
Rae, M ;
Searle, S ;
Slater, G ;
Smedley, D ;
Smith, J ;
Spooner, W ;
Stabenau, A ;
Stalker, J ;
Storey, R ;
Ureta-Vidal, A ;
Woodwark, C ;
Clamp, M ;
Hubbard, T .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D468-D470
[4]   The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 [J].
Boeckmann, B ;
Bairoch, A ;
Apweiler, R ;
Blatter, MC ;
Estreicher, A ;
Gasteiger, E ;
Martin, MJ ;
Michoud, K ;
O'Donovan, C ;
Phan, I ;
Pilbout, S ;
Schneider, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :365-370
[5]   Genome sequence of the human malaria parasite Plasmodium falciparum [J].
Gardner, MJ ;
Hall, N ;
Fung, E ;
White, O ;
Berriman, M ;
Hyman, RW ;
Carlton, JM ;
Pain, A ;
Nelson, KE ;
Bowman, S ;
Paulsen, IT ;
James, K ;
Eisen, JA ;
Rutherford, K ;
Salzberg, SL ;
Craig, A ;
Kyes, S ;
Chan, MS ;
Nene, V ;
Shallom, SJ ;
Suh, B ;
Peterson, J ;
Angiuoli, S ;
Pertea, M ;
Allen, J ;
Selengut, J ;
Haft, D ;
Mather, MW ;
Vaidya, AB ;
Martin, DMA ;
Fairlamb, AH ;
Fraunholz, MJ ;
Roos, DS ;
Ralph, SA ;
McFadden, GI ;
Cummings, LM ;
Subramanian, GM ;
Mungall, C ;
Venter, JC ;
Carucci, DJ ;
Hoffman, SL ;
Newbold, C ;
Davis, RW ;
Fraser, CM ;
Barrell, B .
NATURE, 2002, 419 (6906) :498-511
[6]   OrthoDisease: A database of human disease orthologs [J].
O'Brien, KP ;
Westerlund, I ;
Sonnhammer, ELL .
HUMAN MUTATION, 2004, 24 (02) :112-119
[7]   Automatic clustering of orthologs and in-paralogs from pairwise species comparisons [J].
Remm, M ;
Storm, CEV ;
Sonnhammer, ELL .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 314 (05) :1041-1052
[8]   Orthology, paralogy and proposed classification for paralog subtypes [J].
Sonnhammer, ELL ;
Koonin, EV .
TRENDS IN GENETICS, 2002, 18 (12) :619-620
[9]   The bioperl toolkit:: Perl modules for the life sciences [J].
Stajich, JE ;
Block, D ;
Boulez, K ;
Brenner, SE ;
Chervitz, SA ;
Dagdigian, C ;
Fuellen, G ;
Gilbert, JGR ;
Korf, I ;
Lapp, H ;
Lehväslaiho, H ;
Matsalla, C ;
Mungall, CJ ;
Osborne, BI ;
Pocock, MR ;
Schattner, P ;
Senger, M ;
Stein, LD ;
Stupka, E ;
Wilkinson, MD ;
Birney, E .
GENOME RESEARCH, 2002, 12 (10) :1611-1618
[10]   The TIGR rice genome annotation resource: annotating the rice genome and creating resources for plant biologists [J].
Yuan, QP ;
Ouyang, S ;
Liu, J ;
Suh, B ;
Cheung, F ;
Sultana, R ;
Lee, D ;
Quackenbush, J ;
Buell, CR .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :229-233