Open source system for analyzing, validating, and storing protein identification data

被引:520
作者
Craig, R
Cortens, JP
Beavis, RC [1 ]
机构
[1] Beavis Informat Ltd, Winnipeg, MB, Canada
[2] Manitoba Ctr Prote, Winnipeg, MB, Canada
关键词
proteomics database; GPM; X Tandem; protein identification; XIAPE;
D O I
10.1021/pr049882h
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
This paper describes an open-source system for analyzing, storing, and validating proteomics information derived from tandem mass spectrometry. It is based on a combination of data analysis servers, a user interface, and a relational database. The database was designed to store the minimum amount of information necessary to search and retrieve data obtained from the publicly available data analysis servers. Collectively, this system was referred to as the Global Proteome Machine (GPM). The components of the system have been made available as open source development projects. A publicly available system has been established, comprised of a group of data analysis servers and one main database server.
引用
收藏
页码:1234 / 1242
页数:9
相关论文
共 27 条
  • [1] Constellations in a cellular universe
    Aebersold, R
    [J]. NATURE, 2003, 422 (6928) : 115 - 116
  • [2] Mass spectrometry in proteomics
    Aebersold, R
    Goodlett, DR
    [J]. CHEMICAL REVIEWS, 2001, 101 (02) : 269 - 295
  • [3] Mass spectrometry-based proteomics
    Aebersold, R
    Mann, M
    [J]. NATURE, 2003, 422 (6928) : 198 - 207
  • [4] Enabling proteomics: the need for an extendable 'workbench' for user-configurable solutions
    Beynon, RJ
    [J]. COMPARATIVE AND FUNCTIONAL GENOMICS, 2004, 5 (01): : 52 - 55
  • [5] An overview of ensembl
    Birney, E
    Andrews, TD
    Bevan, P
    Caccamo, M
    Chen, Y
    Clarke, L
    Coates, G
    Cuff, J
    Curwen, V
    Cutts, T
    Down, T
    Eyras, E
    Fernandez-Suarez, XM
    Gane, P
    Gibbins, B
    Gilbert, J
    Hammond, M
    Hotz, HR
    Iyer, V
    Jekosch, K
    Kahari, A
    Kasprzyk, A
    Keefe, D
    Keenan, S
    Lehvaslaiho, H
    McVicker, G
    Melsopp, C
    Meidl, P
    Mongin, E
    Pettett, R
    Potter, S
    Proctor, G
    Rae, M
    Searle, S
    Slater, G
    Smedley, D
    Smith, J
    Spooner, W
    Stabenau, A
    Stalker, J
    Storey, R
    Ureta-Vidal, A
    Woodwark, KC
    Cameron, G
    Durbin, R
    Cox, A
    Hubbard, T
    Clamp, M
    [J]. GENOME RESEARCH, 2004, 14 (05) : 925 - 928
  • [6] A method for reducing the time required to match protein sequences with tandem mass spectra
    Craig, R
    Beavis, RC
    [J]. RAPID COMMUNICATIONS IN MASS SPECTROMETRY, 2003, 17 (20) : 2310 - 2316
  • [7] Intensity-based protein identification by machine learning from a library of tandem mass spectra
    Elias, JE
    Gibbons, FD
    King, OD
    Roth, FP
    Gygi, SP
    [J]. NATURE BIOTECHNOLOGY, 2004, 22 (02) : 214 - 219
  • [8] The biopolymer markup language
    Fenyö, D
    [J]. BIOINFORMATICS, 1999, 15 (04) : 339 - 340
  • [9] A method for assessing the statistical significance of mass spectrometry-based protein identifications using general scoring schemes
    Fenyö, D
    Beavis, RC
    [J]. ANALYTICAL CHEMISTRY, 2003, 75 (04) : 768 - 774
  • [10] Protein identification using mass spectrometric information
    Fenyö, D
    Qin, J
    Chait, BT
    [J]. ELECTROPHORESIS, 1998, 19 (06) : 998 - 1005