STRING v9.1: protein-protein interaction networks, with increased coverage and integration

被引:3283
作者
Franceschini, Andrea [1 ,2 ]
Szklarczyk, Damian [3 ]
Frankild, Sune [3 ]
Kuhn, Michael [4 ]
Simonovic, Milan [1 ,2 ]
Roth, Alexander [1 ,2 ]
Lin, Jianyi [5 ]
Minguez, Pablo [6 ]
Bork, Peer [6 ,7 ]
von Mering, Christian [1 ,2 ]
Jensen, Lars J. [3 ]
机构
[1] Univ Zurich, Inst Mol Life Sci, CH-8006 Zurich, Switzerland
[2] Univ Zurich, Swiss Inst Bioinformat, CH-8006 Zurich, Switzerland
[3] Univ Copenhagen, Novo Nordisk Fdn Ctr Prot Res, DK-1168 Copenhagen, Denmark
[4] Tech Univ Dresden, Ctr Biotechnol, Dresden, Germany
[5] Univ Milan, Dept Comp Sci, I-20122 Milan, Italy
[6] European Mol Biol Lab, Heidelberg, Germany
[7] Max Delbruck Ctr Mol Med, Berlin, Germany
关键词
MOLECULAR INTERACTION DATABASE; METABOLIC PATHWAYS; GENES; PREDICTION; VISUALIZATION; ENRICHMENT; FAMILIES; TOOLS; LISTS; FOLDS;
D O I
10.1093/nar/gks1094
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Complete knowledge of all direct and indirect interactions between proteins in a given cell would represent an important milestone towards a comprehensive description of cellular mechanisms and functions. Although this goal is still elusive, considerable progress has been made-particularly for certain model organisms and functional systems. Currently, protein interactions and associations are annotated at various levels of detail in online resources, ranging from raw data repositories to highly formalized pathway databases. For many applications, a global view of all the available interaction data is desirable, including lower-quality data and/or computational predictions. The STRING database (http://string-db.org/) aims to provide such a global perspective for as many organisms as feasible. Known and predicted associations are scored and integrated, resulting in comprehensive protein networks covering > 1100 organisms. Here, we describe the update to version 9.1 of STRING, introducing several improvements: (i) we extend the automated mining of scientific texts for interaction information, to now also include full-text articles; (ii) we entirely re-designed the algorithm for transferring interactions from one model organism to the other; and (iii) we provide users with statistical information on any functional enrichment observed in their networks.
引用
收藏
页码:D808 / D815
页数:8
相关论文
共 60 条
[1]   Ten thousand interactions for the molecular biologist [J].
Aloy, P ;
Russell, RB .
NATURE BIOTECHNOLOGY, 2004, 22 (10) :1317-1321
[2]   Resolving the Ortholog Conjecture: Orthologs Tend to Be Weakly, but Significantly, More Similar in Function than Paralogs [J].
Altenhoff, Adrian M. ;
Studer, Romain A. ;
Robinson-Rechavi, Marc ;
Dessimoz, Christophe .
PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (05)
[3]   Ongoing and future developments at the Universal Protein Resource [J].
Apweiler, Rolf ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Alam-Faruque, Yasmin ;
Antunes, Ricardo ;
Barrell, Daniel ;
Bely, Benoit ;
Bingley, Mark ;
Binns, David ;
Bower, Lawrence ;
Browne, Paul ;
Chan, Wei Mun ;
Dimmer, Emily ;
Eberhardt, Ruth ;
Fazzini, Francesco ;
Fedotov, Alexander ;
Foulger, Rebecca ;
Garavelli, John ;
Castro, Leyla Garcia ;
Huntley, Rachael ;
Jacobsen, Julius ;
Kleen, Michael ;
Laiho, Kati ;
Legge, Duncan ;
Lin, Quan ;
Liu, Wudong ;
Luo, Jie ;
Orchard, Sandra ;
Patient, Samuel ;
Pichler, Klemens ;
Poggioli, Diego ;
Pontikos, Nikolas ;
Pruess, Manuela ;
Rosanoff, Steven ;
Sawford, Tony ;
Sehra, Harminder ;
Turner, Edward ;
Corbett, Matt ;
Donnelly, Mike ;
van Rensburg, Pieter ;
Xenarios, Ioannis ;
Bougueleret, Lydie ;
Auchincloss, Andrea ;
Argoud-Puy, Ghislaine ;
Axelsen, Kristian ;
Bairoch, Amos ;
Baratin, Delphine ;
Blatter, Marie-Claude ;
Boeckmann, Brigitte .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D214-D219
[4]   IMID: integrated molecular interaction database [J].
Balaji, Sentil ;
Mcclendon, Charles ;
Chowdhary, Rajesh ;
Liu, Jun S. ;
Zhang, Jinfeng .
BIOINFORMATICS, 2012, 28 (05) :747-749
[5]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[6]  
Caspi R, 2008, NUCLEIC ACIDS RES, V36, pD623, DOI [10.1093/nar/gkm900, 10.1093/nar/gkt1103]
[7]   PROTEINS - 1000 FAMILIES FOR THE MOLECULAR BIOLOGIST [J].
CHOTHIA, C .
NATURE, 1992, 357 (6379) :543-544
[8]   Toward automatic reconstruction of a highly resolved tree of life [J].
Ciccarelli, FD ;
Doerks, T ;
von Mering, C ;
Creevey, CJ ;
Snel, B ;
Bork, P .
SCIENCE, 2006, 311 (5765) :1283-1287
[9]   Turning a hobby into a job: How duplicated genes find new functions [J].
Conant, Gavin C. ;
Wolfe, Kenneth H. .
NATURE REVIEWS GENETICS, 2008, 9 (12) :938-950
[10]   Reactome: a database of reactions, pathways and biological processes [J].
Croft, David ;
O'Kelly, Gavin ;
Wu, Guanming ;
Haw, Robin ;
Gillespie, Marc ;
Matthews, Lisa ;
Caudy, Michael ;
Garapati, Phani ;
Gopinath, Gopal ;
Jassal, Bijay ;
Jupe, Steven ;
Kalatskaya, Irina ;
Mahajan, Shahana ;
May, Bruce ;
Ndegwa, Nelson ;
Schmidt, Esther ;
Shamovsky, Veronica ;
Yung, Christina ;
Birney, Ewan ;
Hermjakob, Henning ;
D'Eustachio, Peter ;
Stein, Lincoln .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D691-D697