Protein structure determination by exhaustive search of Protein Data Bank derived databases

被引:41
作者
Stokes-Rees, Ian [1 ]
Sliz, Piotr [1 ,2 ]
机构
[1] Harvard Univ, Sch Med, Dept Biol Chem & Mol Pharmacol, Boston, MA 02115 USA
[2] Childrens Hosp, Mol Med Lab, Boston, MA 02115 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
p97; ATPase; likelihood functions; scoring methods; grid computing; MOLECULAR-REPLACEMENT; MODELS; PIPELINE; REVEALS; SCIENCE; IMPACT; MAPS; SCOP; P97;
D O I
10.1073/pnas.1012095107
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Parallel sequence and structure alignment tools have become ubiquitous and invaluable at all levels in the study of biological systems. We demonstrate the application and utility of this same parallel search paradigm to the process of protein structure determination, benefitting from the large and growing corpus of known structures. Such searches were previously computationally intractable. Through the method of Wide Search Molecular Replacement, developed here, they can be completed in a few hours with the aide of national-scale federated cyberinfrastructure. By dramatically expanding the range of models considered for structure determination, we show that small (less than 12% structural coverage) and low sequence identity (less than 20% identity) template structures can be identified through multidimensional template scoring metrics and used for structure determination. Many new macromolecular complexes can benefit significantly from such a technique due to the lack of known homologous protein folds or sequences. We demonstrate the effectiveness of the method by determining the structure of a full-length p97 homologue from Trichoplusia ni. Example cases with the MHC/T-cell receptor complex and the EmoB protein provide systematic estimates of minimum sequence identity, structure coverage, and structural similarity required for this method to succeed. We describe how this structure-search approach and other novel computationally intensive workflows are made tractable through integration with the US national computational cyberinfrastructure, allowing, for example, rapid processing of the entire Structural Classification of Proteins protein fragment database.
引用
收藏
页码:21476 / 21481
页数:6
相关论文
共 37 条
[1]   Data growth and its impact on the SCOP database: new developments [J].
Andreeva, Antonina ;
Howorth, Dave ;
Chandonia, John-Marc ;
Brenner, Steven E. ;
Hubbard, Tim J. P. ;
Chothia, Cyrus ;
Murzin, Alexey G. .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D419-D425
[2]  
[Anonymous], 2007, Workflows for e-science. p, DOI DOI 10.1007/978-1-84628-757-222
[3]   THE CCP4 SUITE - PROGRAMS FOR PROTEIN CRYSTALLOGRAPHY [J].
BAILEY, S .
ACTA CRYSTALLOGRAPHICA SECTION D-BIOLOGICAL CRYSTALLOGRAPHY, 1994, 50 :760-763
[4]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[5]   Probing the limits of the molecular replacement method: the case of Trypanosoma brucei phosphoglycerate kinase [J].
Bernstein, BE ;
Hol, WGJ .
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 1997, 53 :756-764
[6]   Analysis of nucleotide binding to p97 reveals the properties of a tandem AAA hexameric ATPase [J].
Briggs, Louise C. ;
Baldwin, Geoff S. ;
Miyata, Non ;
Kondo, Hisao ;
Zhang, Xiaodong ;
Freemont, Paul S. .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2008, 283 (20) :13745-13752
[7]   Discrimination between distant homologs and structural analogs: Lessons from manually constructed, reliable data sets [J].
Cheng, Hua ;
Kim, Bong-Hyun ;
Grishin, Nick V. .
JOURNAL OF MOLECULAR BIOLOGY, 2008, 377 (04) :1265-1278
[8]  
David M, 2009, P 14 INT C ARCH SUPP
[9]   Improved structures of full-length p97, an AAA ATPase: Implications for mechanisms of nucleotide-dependent conformational change [J].
Davies, Jason M. ;
Brunger, Axel T. ;
Weis, William I. .
STRUCTURE, 2008, 16 (05) :715-726
[10]   Complete structure of p97/valosin-containing protein reveals communication between nucleotide domains [J].
DeLaBarre, B ;
Brunger, AT .
NATURE STRUCTURAL BIOLOGY, 2003, 10 (10) :856-863