Protein Information and Knowledge Extractor: Discovering biological information from proteomics data

被引:6
作者
Alberto Medina-Aunon, J. [1 ]
Paradela, Alberto [1 ]
Macht, Marcus [2 ]
Thiele, Herbert [2 ]
Corthals, Garry [3 ,4 ]
Pablo Albar, Juan [1 ]
机构
[1] CSIC, Prote Facil, Ctr Nacl Biotecnol, Madrid, Spain
[2] Bruker Daltonik GmbH, Bremen, Germany
[3] Turku Univ, Prote Res Grp, Turku Ctr Biotechnol, Turku, Finland
[4] Abo Akad Univ, Turku, Finland
关键词
Bioinformatics; Data mining; Functional proteomics; Proteome analysis; Web tool; GENE ONTOLOGY; FUNCTIONAL-ANALYSIS; TOOLS; SETS; IDENTIFICATIONS; IDENTIFIERS; NAVIGATION; PLATFORM; PRIDE; CELLS;
D O I
10.1002/pmic.201000093
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
One of the main goals in proteomics is to solve biological and molecular questions regarding a set of identified proteins. In order to achieve this goal, one has to extract and collect the existing biological data from public repositories for every protein and afterward, analyze and organize the collected data. Due to the complexity of this task and the huge amount of data available, it is not possible to gather this information by hand, making it necessary to find automatic methods of data collection. Within a proteomic context, we have developed Protein Information and Knowledge Extractor (PIKE) which solves this problem by automatically accessing several public information systems and databases across the Internet. PIKE bioinformatics tool starts with a set of identified proteins, listed as the most common protein databases accession codes, and retrieves all relevant and updated information from the most relevant databases. Once the search is complete, PIKE summarizes the information for every single protein using several file formats that share and exchange the information with other software tools. It is our opinion that PIKE represents a great step forward for information procurement and drastically reduces manual database validation for large proteomic studies. It is available at http://proteo.cnb.csic.es/pike.
引用
收藏
页码:3262 / 3271
页数:10
相关论文
共 26 条
[1]   BABELOMICS: a suite of web tools for functional annotation and analysis of groups of genes in high-throughput experiments [J].
Al-Shahrour, F ;
Minguez, P ;
Vaquerizas, JM ;
Conde, L ;
Dopazo, J .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W460-W464
[2]   FatiGO:: a web tool for finding significant associations of Gene Ontology terms with groups of genes [J].
Al-Shahrour, F ;
Díaz-Uriarte, R ;
Dopazo, J .
BIOINFORMATICS, 2004, 20 (04) :578-580
[3]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[4]   MatchMiner: a tool for batch navigation among gene and gene product identifiers [J].
Bussey, KJ ;
Kane, D ;
Sunshine, M ;
Narasimhan, S ;
Nishizuka, S ;
Reinhold, WC ;
Zeeberg, B ;
Ajay ;
Weinstein, JN .
GENOME BIOLOGY, 2003, 4 (04)
[5]   GENECODIS: a web-based tool for finding significant concurrent annotations in gene lists [J].
Carmona-Saez, Pedro ;
Chagoyen, Monica ;
Tirado, Francisco ;
Carazo, Jose M. ;
Pascual-Montano, Alberto .
GENOME BIOLOGY, 2007, 8 (01)
[6]   The Protein Identifier Cross-Referencing (PICR) service:: reconciling protein identifiers across multiple source databases [J].
Cote, Richard G. ;
Jones, Philip ;
Martens, Lennart ;
Kerrien, Samuel ;
Reisinger, Florian ;
Lin, Quan ;
Leinonen, Rasko ;
Apweiler, Rolf ;
Hermjakob, Henning .
BMC BIOINFORMATICS, 2007, 8 (1) :401
[7]   DAVID: Database for annotation, visualization, and integrated discovery [J].
Dennis, G ;
Sherman, BT ;
Hosack, DA ;
Yang, J ;
Gao, W ;
Lane, HC ;
Lempicki, RA .
GENOME BIOLOGY, 2003, 4 (09)
[8]   Using GOstats to test gene lists for GO term association [J].
Falcon, S. ;
Gentleman, R. .
BIOINFORMATICS, 2007, 23 (02) :257-258
[9]   The bioinformatics links directory: A compilation of molecular biology web servers [J].
Fox, JA ;
Butland, SL ;
McMillan, S ;
Campbell, G ;
Ouellette, BFF .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W3-W24
[10]   PRIDE:: a public repository of protein and peptide identifications for the proteomics community [J].
Jones, Philip ;
Cote, Richard G. ;
Martens, Lennart ;
Quinn, Antony F. ;
Taylor, Chris F. ;
Derache, William ;
Hermjakob, Henning ;
Apweiler, Rolf .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D659-D663