Open source system for analyzing, validating, and storing protein identification data

被引:523
作者
Craig, R
Cortens, JP
Beavis, RC [1 ]
机构
[1] Beavis Informat Ltd, Winnipeg, MB, Canada
[2] Manitoba Ctr Prote, Winnipeg, MB, Canada
关键词
proteomics database; GPM; X Tandem; protein identification; XIAPE;
D O I
10.1021/pr049882h
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
This paper describes an open-source system for analyzing, storing, and validating proteomics information derived from tandem mass spectrometry. It is based on a combination of data analysis servers, a user interface, and a relational database. The database was designed to store the minimum amount of information necessary to search and retrieve data obtained from the publicly available data analysis servers. Collectively, this system was referred to as the Global Proteome Machine (GPM). The components of the system have been made available as open source development projects. A publicly available system has been established, comprised of a group of data analysis servers and one main database server.
引用
收藏
页码:1234 / 1242
页数:9
相关论文
共 27 条
[1]   Constellations in a cellular universe [J].
Aebersold, R .
NATURE, 2003, 422 (6928) :115-116
[2]   Mass spectrometry in proteomics [J].
Aebersold, R ;
Goodlett, DR .
CHEMICAL REVIEWS, 2001, 101 (02) :269-295
[3]   Mass spectrometry-based proteomics [J].
Aebersold, R ;
Mann, M .
NATURE, 2003, 422 (6928) :198-207
[4]   Enabling proteomics: the need for an extendable 'workbench' for user-configurable solutions [J].
Beynon, RJ .
COMPARATIVE AND FUNCTIONAL GENOMICS, 2004, 5 (01) :52-55
[5]   An overview of ensembl [J].
Birney, E ;
Andrews, TD ;
Bevan, P ;
Caccamo, M ;
Chen, Y ;
Clarke, L ;
Coates, G ;
Cuff, J ;
Curwen, V ;
Cutts, T ;
Down, T ;
Eyras, E ;
Fernandez-Suarez, XM ;
Gane, P ;
Gibbins, B ;
Gilbert, J ;
Hammond, M ;
Hotz, HR ;
Iyer, V ;
Jekosch, K ;
Kahari, A ;
Kasprzyk, A ;
Keefe, D ;
Keenan, S ;
Lehvaslaiho, H ;
McVicker, G ;
Melsopp, C ;
Meidl, P ;
Mongin, E ;
Pettett, R ;
Potter, S ;
Proctor, G ;
Rae, M ;
Searle, S ;
Slater, G ;
Smedley, D ;
Smith, J ;
Spooner, W ;
Stabenau, A ;
Stalker, J ;
Storey, R ;
Ureta-Vidal, A ;
Woodwark, KC ;
Cameron, G ;
Durbin, R ;
Cox, A ;
Hubbard, T ;
Clamp, M .
GENOME RESEARCH, 2004, 14 (05) :925-928
[6]   A method for reducing the time required to match protein sequences with tandem mass spectra [J].
Craig, R ;
Beavis, RC .
RAPID COMMUNICATIONS IN MASS SPECTROMETRY, 2003, 17 (20) :2310-2316
[7]   Intensity-based protein identification by machine learning from a library of tandem mass spectra [J].
Elias, JE ;
Gibbons, FD ;
King, OD ;
Roth, FP ;
Gygi, SP .
NATURE BIOTECHNOLOGY, 2004, 22 (02) :214-219
[8]   The biopolymer markup language [J].
Fenyö, D .
BIOINFORMATICS, 1999, 15 (04) :339-340
[9]   A method for assessing the statistical significance of mass spectrometry-based protein identifications using general scoring schemes [J].
Fenyö, D ;
Beavis, RC .
ANALYTICAL CHEMISTRY, 2003, 75 (04) :768-774
[10]   Protein identification using mass spectrometric information [J].
Fenyö, D ;
Qin, J ;
Chait, BT .
ELECTROPHORESIS, 1998, 19 (06) :998-1005