findMySequence: a neural-network-based approach for identification of unknown proteins in X-ray crystallography and cryo-EM

被引:39
作者
Chojnowski, Grzegorz [1 ]
Simpkin, Adam J. [2 ]
Leonardo, Diego A. [3 ]
Seifert-Davila, Wolfram [4 ]
Vivas-Ruiz, Dan E. [5 ]
Keegan, Ronan M. [6 ]
Rigden, Daniel J. [2 ]
机构
[1] European Mol Biol Lab, Hamburg Unit, Notkestr 85, D-22607 Hamburg, Germany
[2] Univ Liverpool, Inst Syst Mol & Integrat Biol, Liverpool L69 7ZB, Merseyside, England
[3] Univ Sao Paulo, Sao Carlos Inst Phys, Ave Joao Dagnone 1100, BR-13563120 Sao Carlos, SP, Brazil
[4] European Mol Biol Lab, Meyerhofstr 1, D-69117 Heidelberg, Germany
[5] Univ Nacl Mayor San Marcos, Fac Ciencias Biol, Lab Biologla Mol, Ave Venezuela Cdra 34 S-N,Ciudad Univ, Lima, Peru
[6] Rutherford Appleton Lab, UKRI STFC, Res Complex Harwell, Didcot OX11 0FA, Oxon, England
基金
英国生物技术与生命科学研究理事会;
关键词
protein structures; protein sequences; SIMBAD; cryo-EM; bioinformatics; structure determination; findMySequence; neural networks; STENOTROPHOMONAS-MALTOPHILIA; ANGSTROM RESOLUTION; SEQUENCE ASSIGNMENT; CRYSTAL-STRUCTURE; REFINEMENT; BINDING; NEUTRALIZATION; COMPLEX; SEARCH; VENOMS;
D O I
10.1107/S2052252521011088
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Although experimental protein-structure determination usually targets known proteins, chains of unknown sequence are often encountered. They can be purified from natural sources, appear as an unexpected fragment of a well characterized protein or appear as a contaminant. Regardless of the source of the problem, the unknown protein always requires characterization. Here, an automated pipeline is presented for the identification of protein sequences from cryo-EM reconstructions and crystallographic data. The method's application to characterize the crystal structure of an unknown protein purified from a snake venom is presented. It is also shown that the approach can be successfully applied to the identification of protein sequences and validation of sequence assignments in cryo-EM protein structures.
引用
收藏
页码:86 / +
页数:15
相关论文
共 70 条
[51]   Automatic local resolution-based sharpening of cryo-EM maps [J].
Ramirez-Aportela, Erney ;
Luis Vilas, Jose ;
Glukhova, Alisa ;
Melero, Roberto ;
Conesa, Pablo ;
Martinez, Marta ;
Maluenda, David ;
Mota, Javier ;
Jimenez, Amaya ;
Vargas, Javier ;
Marabini, Roberto ;
Sexton, Patrick M. ;
Maria Carazo, Jose ;
Sorzano, Carlos Oscar S. .
BIOINFORMATICS, 2020, 36 (03) :765-772
[52]   Evolutionary shift toward protein-based architecture in trypanosomal mitochondrial ribosomes [J].
Ramrath, David J. F. ;
Niemann, Moritz ;
Leibundgut, Marc ;
Bieri, Philipp ;
Prange, Celine ;
Horn, Elke K. ;
Leitner, Alexander ;
Boehringer, Daniel ;
Schneider, Andre ;
Ban, Nenad .
SCIENCE, 2018, 362 (6413)
[53]   The 3.5-A° CryoEM Structure of Nanodisc-Reconstituted Yeast Vacuolar ATPase Vo Proton Channel [J].
Roh, Soung-Hun ;
Stam, Nicholas J. ;
Hryc, Corey F. ;
Couoh-Cardel, Sergio ;
Pintilie, Grigore ;
Chiu, Wah ;
Wilkens, Stephan .
MOLECULAR CELL, 2018, 69 (06) :993-+
[54]  
SHAPIRO SS, 1965, BIOMETRIKA, V52, P591, DOI 10.2307/2333709
[55]   Using Phaser and ensembles to improve the performance of SIMBAD [J].
Simpkin, Adam J. ;
Simkovic, Felix ;
Thomas, Jens M. H. ;
Savko, Martin ;
Lebedev, Andrey ;
Uski, Ville ;
Ballard, Charles C. ;
Wojdyr, Marcin ;
Shepard, William ;
Rigdena, Daniel J. ;
Keegana, Ronan M. .
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2020, 76 :1-8
[56]   SIMBAD: a sequence-independent molecular-replacement pipeline [J].
Simpkin, Adam J. ;
Simkovic, Felix ;
Thomas, Jens M. H. ;
Savko, Martin ;
Lebedev, Andrey ;
Uski, Ville ;
Ballard, Charles ;
Wojdyr, Marcin ;
Wu, Rui ;
Sanishvili, Ruslan ;
Xu, Yibin ;
Lisa, Maria-Natalia ;
Buschiazzo, Alejandro ;
Shepard, William ;
Rigden, Daniel J. ;
Keegan, Ronan M. .
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2018, 74 :595-605
[57]   Protein structure determination by exhaustive search of Protein Data Bank derived databases [J].
Stokes-Rees, Ian ;
Sliz, Piotr .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (50) :21476-21481
[58]   Multi-particle cryo-EM refinement with M visualizes ribosome-antibiotic complex at 3.5 Å in cells [J].
Tegunov, Dimitry ;
Xue, Liang ;
Dienemann, Christian ;
Cramer, Patrick ;
Mahamid, Julia .
NATURE METHODS, 2021, 18 (02) :186-+
[59]   De novo main-chain modeling for EM maps using MAINMAST [J].
Terashi, Genki ;
Kihara, Daisuke .
NATURE COMMUNICATIONS, 2018, 9
[60]   Automated side-chain model building and sequence assignment by template matching [J].
Terwilliger, TC .
ACTA CRYSTALLOGRAPHICA SECTION D-BIOLOGICAL CRYSTALLOGRAPHY, 2003, 59 :45-49