Consequences of the discontinuation of the International Protein Index (IPI) database and its substitution by the UniProtKB "complete proteome" sets

被引:25
作者
Griss, Johannes [1 ,2 ]
Martin, Maria [1 ]
O'Donovan, Claire [1 ]
Apweiler, Rolf [1 ]
Hermjakob, Henning [1 ]
Vizcaino, Juan Antonio [1 ]
机构
[1] EMBL European Bioinformat Inst, Cambridge CB10 1SD, England
[2] Med Univ Vienna, Dept Med 1, Vienna, Austria
基金
英国惠康基金;
关键词
Bioinformatics; Discontinuation; Gene annotation; International Protein Index; Protein databases; UniProt Knowledgebase; RESOURCE;
D O I
10.1002/pmic.201100363
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The International Protein Index (IPI) database has been one of the most widely used protein databases in MS proteomics approaches. Recently, the closure of IPI in September 2011 was announced. Its recommended replacement is the new UniProt Knowledgebase (UniProtKB) "complete proteome" sets, launched in May 2011. Here, we analyze the consequences of IPI's discontinuation for human and mouse data, and the effect of its substitution with UniProtKB on two levels: (i) data already produced and (ii) newly performed experiments. To estimate the effect on existing data, we investigated how well IPI identifiers map to UniProtKB accessions. We found that 21% of human and 10% of mouse identifiers do not map to UniProtKB and would thus be "lost." To investigate the impact on new experiments, we compared the theoretical search space (i. e. the tryptic peptides) of both resources and found that it is decreased by 14.0% for human and 8.9% for mouse data through IPI's closure. An analysis on the experimental evidence for these "lost" peptides showed that the vast majority has not been identified in experiments available in the major proteomics repositories. It thus seems likely that the search space provided by UniProtKB is of higher quality than the one currently provided by IPI.
引用
收藏
页码:4434 / 4438
页数:5
相关论文
共 16 条
  • [11] NCBI Reference Sequences: current status, policy and new initiatives
    Pruitt, Kim D.
    Tatusova, Tatiana
    Klimke, William
    Maglott, Donna R.
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 : D32 - D36
  • [12] Large-scate database searching using tandem mass spectra: Looking up the answer in the back of the book
    Sadygov, RG
    Cociorva, D
    Yates, JR
    [J]. NATURE METHODS, 2004, 1 (03) : 195 - 202
  • [13] The Proteomics Identifications database: 2010 update
    Vizcaino, Juan Antonio
    Cote, Richard
    Reisinger, Florian
    Barsnes, Harald
    Foster, Joseph M.
    Rameseder, Jonathan
    Hermjakob, Henning
    Martens, Lennart
    [J]. NUCLEIC ACIDS RESEARCH, 2010, 38 : D736 - D742
  • [14] Design, implementation and maintenance of a model organism database for Arabidopsis thaliana
    Weems, D
    Miller, N
    Garcia-Hernandez, M
    Huala, E
    Rhee, SY
    [J]. COMPARATIVE AND FUNCTIONAL GENOMICS, 2004, 5 (04): : 362 - 369
  • [15] The vertebrate genome annotation (Vega) database
    Wilming, L. G.
    Gilbert, J. G. R.
    Howe, K.
    Trevanion, S.
    Hubbard, T.
    Harrow, J. L.
    [J]. NUCLEIC ACIDS RESEARCH, 2008, 36 : D753 - D760
  • [16] The H-Invitational Database (H-InvDB), a comprehensive annotation resource for human genes and transcripts
    Yamasaki, Chisato
    Murakami, Katsuhiko
    Fujii, Yasuyuki
    Sato, Yoshiharu
    Harada, Erimi
    Takeda, Jun-Ichi
    Taniya, Takayuki
    Sakate, Ryuichi
    Kikugawa, Shingo
    Shimada, Makoto
    Tanino, Motohiko
    Koyanagi, Kanako O.
    Barrero, Roberto A.
    Gough, Craig
    Chun, Hong-Woo
    Habara, Takuya
    Hanaoka, Hideki
    Hayakawa, Yosuke
    Hilton, Phillip B.
    Kaneko, Yayoi
    Kanno, Masako
    Kawahara, Yoshihiro
    Kawamura, Toshiyuki
    Matsuya, Akihiro
    Nagata, Naoki
    Nishikata, Kensaku
    Noda, Akiko Ogura
    Nurimoto, Shin
    Saichi, Naomi
    Sakai, Hiroaki
    Sanbonmatsu, Ryoko
    Shiba, Rie
    Suzuki, Mami
    Takabayashi, Kazuhiko
    Takahashi, Aiko
    Tamura, Takuro
    Tanaka, Masayuki
    Tanaka, Susumu
    Todokoro, Fusano
    Yamaguchi, Kaori
    Yamamoto, Naoyuki
    Okido, Toshihisa
    Mashima, Jun
    Hashizume, Aki
    Jin, Lihua
    Lee, Kyung-Bum
    Lin, Yi-Chueh
    Nozaki, Asami
    Sakai, Katsunaga
    Tada, Masahito
    [J]. NUCLEIC ACIDS RESEARCH, 2008, 36 : D793 - D799