Pfarao: a web application for protein family analysis customized for cytoskeletal and motor proteins (CyMoBase)

被引:22
作者
Odronitz, Florian [1 ]
Kollmar, Martin [1 ]
机构
[1] Max Planck Inst Biophys Chem, Dept NMR Based Struct Biol, D-37077 Gottingen, Germany
关键词
D O I
10.1186/1471-2164-7-300
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Annotation of protein sequences of eukaryotic organisms is crucial for the understanding of their function in the cell. Manual annotation is still by far the most accurate way to correctly predict genes. The classification of protein sequences, their phylogenetic relation and the assignment of function involves information from various sources. This often leads to a collection of heterogeneous data, which is hard to track. Cytoskeletal and motor proteins consist of large and diverse superfamilies comprising up to several dozen members per organism. Up to date there is no integrated tool available to assist in the manual large-scale comparative genomic analysis of protein families. Description: Pfarao (Protein Family Application for Retrieval, Analysis and Organisation) is a database driven online working environment for the analysis of manually annotated protein sequences and their relationship. Currently, the system can store and interrelate a wide range of information about protein sequences, species, phylogenetic relations and sequencing projects as well as links to literature and domain predictions. Sequences can be imported from multiple sequence alignments that are generated during the annotation process. A web interface allows to conveniently browse the database and to compile tabular and graphical summaries of its content. Conclusion: We implemented a protein sequence-centric web application to store, organize, interrelate, and present heterogeneous data that is generated in manual genome annotation and comparative genomics. The application has been developed for the analysis of cytoskeletal and motor proteins (CyMoBase) but can easily be adapted for any protein.
引用
收藏
页数:8
相关论文
共 12 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
Durbin R., 1998, Biological sequence analysis: Probabilistic models of proteins and nucleic acids
[3]   Pfam:: clans, web tools and services [J].
Finn, Robert D. ;
Mistry, Jaina ;
Schuster-Bockler, Benjamin ;
Griffiths-Jones, Sam ;
Hollich, Volker ;
Lassmann, Timo ;
Moxon, Simon ;
Marshall, Mhairi ;
Khanna, Ajay ;
Durbin, Richard ;
Eddy, Sean R. ;
Sonnhammer, Erik L. L. ;
Bateman, Alex .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D247-D251
[4]  
Fowler Martin, 2002, Patterns of Enterprise Applications Architecture
[5]   The molecular mechanism of muscle contraction [J].
Geeves, MA ;
Holmes, KC .
FIBROUS PROTEINS: MUSCLE AND MOLECULAR MOTORS, 2005, 71 :161-+
[6]   Molecular motors and mechanisms of directional transport in neurons [J].
Hirokawa, N ;
Takemura, R .
NATURE REVIEWS NEUROSCIENCE, 2005, 6 (03) :201-214
[7]   Orthologs, paralogs, and evolutionary genomics [J].
Koonin, EV .
ANNUAL REVIEW OF GENETICS, 2005, 39 :309-338
[8]   Genome annotation assessment in Drosophila melanogaster [J].
Reese, MG ;
Hartzell, G ;
Harris, NL ;
Ohler, U ;
Abril, JF ;
Lewis, SE .
GENOME RESEARCH, 2000, 10 (04) :483-501
[9]   Cell division [J].
Scholey, JM ;
Brust-Mascher, I ;
Mogilner, A .
NATURE, 2003, 422 (6933) :746-752
[10]   Genome annotation: From sequence to biology [J].
Stein, L .
NATURE REVIEWS GENETICS, 2001, 2 (07) :493-503