Using Phaser and ensembles to improve the performance of SIMBAD

被引:7
作者
Simpkin, Adam J. [1 ,2 ]
Simkovic, Felix [1 ]
Thomas, Jens M. H. [1 ]
Savko, Martin [2 ]
Lebedev, Andrey [3 ,4 ]
Uski, Ville [3 ,4 ]
Ballard, Charles C. [3 ,4 ]
Wojdyr, Marcin [5 ]
Shepard, William [2 ]
Rigdena, Daniel J. [1 ]
Keegana, Ronan M. [1 ,3 ,4 ]
机构
[1] Univ Liverpool, Inst Integrat Biol, Liverpool L69 7ZB, Merseyside, England
[2] Synchrotron SOLEIL, BP 48, F-91192 Gif Sur Yvette, France
[3] Rutherford Appleton Lab, STEC, Didcot OX11 0FA, Oxon, England
[4] Rutherford Appleton Lab, CCP4, Res Complex & Harwell, Didcot OX11 0FA, Oxon, England
[5] Global Phasing Ltd, Cambridge CB3 0AX, England
来源
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY | 2020年 / 76卷
基金
英国生物技术与生命科学研究理事会;
关键词
molecular-replacement pipeline; contaminants; structure solution; SIMBAD; ensembles; sequence independent; MOLECULAR-REPLACEMENT; CRYSTAL-STRUCTURES; SEARCH MODELS; PROTEINS; AMPLE; SOLVE;
D O I
10.1107/S2059798319015031
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The conventional approach to search-model identification in molecular replacement (MR) is to screen a database of known structures using the target sequence. However, this strategy is not always effective, for example when the relationship between sequence and structural similarity fails or when the crystal contents are not those expected. An alternative approach is to identify suitable search models directly from the experimental data. SIMBAD is a sequence-independent MR pipeline that uses either a crystal lattice search or MR functions to directly locate suitable search models from databases. The previous version of SIMBAD used the fast AMoRe rotation-function search. Here, a new version of SIMBAD which makes use of Phaser and its likelihood scoring to improve the sensitivity of the pipeline is presented. It is shown that the additional compute time potentially required by the more sophisticated scoring is counterbalanced by the greater sensitivity, allowing more cases to trigger early-termination criteria, rather than running to completion. Using Phaser solved 17 out of 25 test cases in comparison to the ten solved with AMoRe, and it is shown that use of ensemble search models produces additional performance benefits.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 34 条
[1]   PHENIX: a comprehensive Python']Python-based system for macromolecular structure solution [J].
Adams, Paul D. ;
Afonine, Pavel V. ;
Bunkoczi, Gabor ;
Chen, Vincent B. ;
Davis, Ian W. ;
Echols, Nathaniel ;
Headd, Jeffrey J. ;
Hung, Li-Wei ;
Kapral, Gary J. ;
Grosse-Kunstleve, Ralf W. ;
McCoy, Airlie J. ;
Moriarty, Nigel W. ;
Oeffner, Robert ;
Read, Randy J. ;
Richardson, David C. ;
Richardson, Jane S. ;
Terwilliger, Thomas C. ;
Zwart, Peter H. .
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2010, 66 :213-221
[2]   AMPLE: a cluster-and-truncate approach to solve the crystal structures of small proteins using rapidly computed ab initio models [J].
Bibby, Jaclyn ;
Keegan, Ronan M. ;
Mayans, Olga ;
Winn, Martyn D. ;
Rigden, Daniel J. .
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2012, 68 :1622-1631
[3]   Improvement of molecular-replacement models with Sculptor [J].
Bunkoczi, Gabor ;
Read, Randy J. .
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2011, 67 :303-312
[4]   Does NMR mean "not for molecular replacement"? Using NMR-Based search models to solve protein crystal structures [J].
Chen, YW ;
Dodson, EJ ;
Kleywegt, GJ .
STRUCTURE, 2000, 8 (11) :R213-R220
[5]   Accelerated Profile HMM Searches [J].
Eddy, Sean R. .
PLOS COMPUTATIONAL BIOLOGY, 2011, 7 (10)
[6]   An introduction to molecular replacement [J].
Evans, Philip ;
McCoy, Airlie .
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2008, 64 :1-10
[7]   Determination of crystal structures of proteins of unknown identity using a marathon molecular replacement procedure: structure of Stenotrophomonas maltophilia phosphate-binding protein [J].
Hatti, Kaushik ;
Gulati, Ashutosh ;
Srinivasan, Narayanaswamy ;
Murthy, M. R. N. .
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2016, 72 :1081-1089
[8]   Matthews coefficient probabilities: Improved estimates for unit cell contents of proteins, DNA, and protein-nucleic acid complex crystals [J].
Kantardjieff, KA ;
Rupp, B .
PROTEIN SCIENCE, 2003, 12 (09) :1865-1871
[9]   Recent developments in MrBUMP: better search-model preparation, graphical interaction with search models, and solution improvement and assessment [J].
Keegan, Ronan M. ;
McNicholas, Stuart J. ;
Thomas, Jens M. H. ;
Simpkin, Adam J. ;
Simkovic, Felix ;
Uski, Ville ;
Ballard, Charles C. ;
Winn, Martyn D. ;
Wilson, Keith S. ;
Rigden, Daniel J. .
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2018, 74 :167-182
[10]   Exploring the speed and performance of molecular replacement with AMPLE using QUARK ab initio protein models [J].
Keegan, Ronan M. ;
Bibby, Jaclyn ;
Thomas, Jens ;
Xu, Dong ;
Zhang, Yang ;
Mayans, Olga ;
Winn, Martyn D. ;
Rigden, Daniel J. .
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2015, 71 :338-343