Progressive assembly of multi-domain protein structures from cryo-EM density maps

被引:35
作者
Zhou, Xiaogen [1 ,2 ]
Li, Yang [1 ]
Zhang, Chengxin [1 ]
Zheng, Wei [1 ]
Zhang, Guijun [2 ]
Zhang, Yang [1 ,3 ]
机构
[1] Univ Michigan, Dept Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
[2] Zhejiang Univ Technol, Coll Informat Engn, Hangzhou, Peoples R China
[3] Univ Michigan, Dept Biol Chem, Ann Arbor, MI 48109 USA
来源
NATURE COMPUTATIONAL SCIENCE | 2022年 / 2卷 / 04期
基金
美国国家科学基金会;
关键词
REFINEMENT; VALIDATION; ALGORITHM; EVOLUTION; MODEL;
D O I
10.1038/s43588-022-00232-1
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Progress in cryo-electron microscopy has provided the potential for large-size protein structure determination. However, the success rate for solving multi-domain proteins remains low because of the difficulty in modelling inter-domain orientations. Here we developed domain enhanced modeling using cryo-electron microscopy (DEMO-EM), an automatic method to assemble multi-domain structures from cryo-electron microscopy maps through a progressive structural refinement procedure combining rigid-body domain fitting and flexible assembly simulations with deep-neural-network inter-domain distance profiles. The method was tested on a large-scale benchmark set of proteins containing up to 12 continuous and discontinuous domains with medium- to low-resolution density maps, where DEMO-EM produced models with correct inter-domain orientations (template modeling score (TM-score) >0.5) for 97% of cases and outperformed state-of-the-art methods. DEMO-EM was applied to the severe acute respiratory syndrome coronavirus 2 genome and generated models with average TM-score and root-mean-square deviation of 0.97 and 1.3 angstrom, respectively, with respect to the deposited structures. These results demonstrate an efficient pipeline that enables automated and reliable large-scale multi-domain protein structure modelling from cryo-electron microscopy maps.
引用
收藏
页码:265 / +
页数:12
相关论文
共 60 条
[1]   Waterworks-specific composition of drinking water disinfection by-products [J].
Andersson, Anna ;
Harir, Mourad ;
Gonsior, Michael ;
Hertkorn, Norbert ;
Schmitt-Kopplin, Philippe ;
Kylin, Henrik ;
Karlsson, Susanne ;
Ashiq, Muhammad Jamshaid ;
Lavonen, Elin ;
Nilsson, Kerstin ;
Pettersson, AEmma ;
Stavklint, Helena ;
Bastviken, David .
ENVIRONMENTAL SCIENCE-WATER RESEARCH & TECHNOLOGY, 2019, 5 (05) :861-872
[2]  
Barad BA, 2015, NAT METHODS, V12, P943, DOI [10.1038/NMETH.3541, 10.1038/nmeth.3541]
[3]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
EUROPEAN JOURNAL OF BIOCHEMISTRY, 1977, 80 (02) :319-324
[4]   Structure of the human MHC-I peptide-loading complex [J].
Blees, Andreas ;
Januliene, Dovile ;
Hofmann, Tommy ;
Koller, Nicole ;
Schmidt, Carla ;
Trowitzsch, Simon ;
Moeller, Arne ;
Tampe, Robert .
NATURE, 2017, 551 (7681) :525-+
[5]   Multi-resolution contour-based fitting of macromolecular structures [J].
Chacón, P ;
Wriggers, W .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 317 (03) :375-384
[6]   MolProbity: all-atom structure validation for macromolecular crystallography [J].
Chen, Vincent B. ;
Arendall, W. Bryan, III ;
Headd, Jeffrey J. ;
Keedy, Daniel A. ;
Immormino, Robert M. ;
Kapral, Gary J. ;
Murray, Laura W. ;
Richardson, Jane S. ;
Richardson, David C. .
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2010, 66 :12-21
[7]   Evolution of the protein repertoire [J].
Chothia, C ;
Gough, J ;
Vogel, C ;
Teichmann, SA .
SCIENCE, 2003, 300 (5626) :1701-1703
[8]   The Buccaneer software for automated model building.: 1.: Tracing protein chains [J].
Cowtan, Kevin .
ACTA CRYSTALLOGRAPHICA SECTION D-BIOLOGICAL CRYSTALLOGRAPHY, 2006, 62 :1002-1011
[9]   LMProt:: An efficient algorithm for Monte Carlo sampling of protein conformational space [J].
da Silva, RA ;
Degrève, L ;
Caliri, A .
BIOPHYSICAL JOURNAL, 2004, 87 (03) :1567-1577
[10]  
DiMaio F, 2015, NAT METHODS, V12, P361, DOI [10.1038/NMETH.3286, 10.1038/nmeth.3286]