Cross-validation in cryo-EM-based structural modeling

被引:40
|
作者
Falkner, Benjamin [1 ]
Schroeder, Gunnar F. [1 ,2 ]
机构
[1] Forschungszentrum Julich, Inst Complex Syst ICS 6, D-52425 Julich, Germany
[2] Univ Dusseldorf, Dept Phys, D-40225 Dusseldorf, Germany
关键词
flexible fitting; real-space structure refinement; ELECTRON-DENSITY MAPS; PROTEIN STRUCTURES; ATOMIC STRUCTURES; RESOLUTION; REFINEMENT; CRYSTALLOGRAPHY; BIAS;
D O I
10.1073/pnas.1119041110
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Single-particle cryo-EM is a powerful approach to determine the structure of large macromolecules and assemblies thereof in many cases at subnanometer resolution. It has become popular to refine or flexibly fit atomic models into density maps derived from cryo-EM experiments. These density maps are typically significantly lower in resolution than electron density maps obtained from X-ray diffraction experiments, such that the number of parameters that need to be determined is much larger than the number of experimental observables. Overfitting and misinterpretation of the density, thus, become a serious problem. For diffraction data, a cross-validation approach was introduced almost 20 y ago; however, no such approach has been described yet for structure refinement against cryo-EM density maps, although the overfitting problem is, because of the lower resolution, significantly larger. We present a cross-validation approach for real-space refinement against cryo-EM density maps in analogy to cross-validation typically used in crystallography. Our approach is able to detect overfitting and allows for optimizing the choice of restraints used in the refinement. The approach is shown on three protein structures with simulated data and experimental data of the rotavirus double-layer particle. Because cross-validation requires splitting the dataset into at least two independent sets, we further present an approach to quantify correlations between the structure factor sets. This analysis is also helpful for other cross-validation applications, such as refinements against diffraction data or 3D reconstructions of cryo-EM density maps.
引用
收藏
页码:8930 / 8935
页数:6
相关论文
共 50 条
  • [21] On Estimating Model in Feature Selection With Cross-Validation
    Qi, Chunxia
    Diao, Jiandong
    Qiu, Like
    IEEE ACCESS, 2019, 7 : 33454 - 33463
  • [22] Predictive modeling and cryo-EM: A synergistic approach to modeling macromolecular structure
    Corum, Michael R.
    Venkannagari, Harikanth
    Hryc, Corey F.
    Baker, Matthew L.
    BIOPHYSICAL JOURNAL, 2024, 123 (04) : 435 - 450
  • [23] Outcomes of the EMDataResource cryo-EM Ligand Modeling Challenge
    Lawson, Catherine L.
    Kryshtafovych, Andriy
    Pintilie, Grigore D.
    Burley, Stephen K.
    Cerny, Jiri
    Chen, Vincent B.
    Emsley, Paul
    Gobbi, Alberto
    Joachimiak, Andrzej
    Noreng, Sigrid
    Prisant, Michael G.
    Read, Randy J.
    Richardson, Jane S.
    Rohou, Alexis L.
    Schneider, Bohdan
    Sellers, Benjamin D.
    Shao, Chenghua
    Sourial, Elizabeth
    Williams, Chris I.
    Williams, Christopher J.
    Yang, Ying
    Abbaraju, Venkat
    Afonine, Pavel V.
    Baker, Matthew L.
    Bond, Paul S.
    Blundell, Tom L.
    Burnley, Tom
    Campbell, Arthur
    Cao, Renzhi
    Cheng, Jianlin
    Chojnowski, Grzegorz
    Cowtan, K. D.
    DiMaio, Frank
    Esmaeeli, Reza
    Giri, Nabin
    Grubmueller, Helmut
    Hoh, Soon Wen
    Hou, Jie
    Hryc, Corey F.
    Hunte, Carola
    Igaev, Maxim
    Joseph, Agnel P.
    Kao, Wei-Chun
    Kihara, Daisuke
    Kumar, Dilip
    Lang, Lijun
    Lin, Sean
    Subramaniya, Sai R. Maddhuri Venkata
    Mittal, Sumit
    Mondal, Arup
    NATURE METHODS, 2024, 21 (07) : 1340 - 1348
  • [24] THEORETICAL ANALYSES OF CROSS-VALIDATION ERROR AND VOTING IN INSTANCE-BASED LEARNING
    TURNEY, P
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 1994, 6 (04) : 331 - 360
  • [25] Cryo-EM model validation using independent map reconstructions
    DiMaio, Frank
    Zhang, Junjie
    Chiu, Wah
    Baker, David
    PROTEIN SCIENCE, 2013, 22 (06) : 865 - 868
  • [26] Cryo-EM Map-Based Model Validation Using the False Discovery Rate Approach
    Olek, Mateusz
    Joseph, Agnel Praveen
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2021, 8
  • [27] Structure of γ-tubulin small complex based on a cryo-EM map, chemical cross-links, and a remotely related structure
    Greenberg, Charles H.
    Kollman, Justin
    Zelter, Alex
    Johnson, Richard
    MacCoss, Michael J.
    Davis, Trisha N.
    Agard, David A.
    Sali, Andrej
    JOURNAL OF STRUCTURAL BIOLOGY, 2016, 194 (03) : 303 - 310
  • [28] Cross-validation as a means of investigating DEM interpolation error
    Wise, Stephen
    COMPUTERS & GEOSCIENCES, 2011, 37 (08) : 978 - 991
  • [29] Structural Insights of WHAMM's Interaction with Microtubules by Cryo-EM
    Liu, Tianyang
    Dai, Anbang
    Cao, Yong
    Zhang, Rui
    Dong, Meng-Qiu
    Wang, Hong-Wei
    JOURNAL OF MOLECULAR BIOLOGY, 2017, 429 (09) : 1352 - 1363
  • [30] De novo computational RNA modeling into cryo-EM maps of large ribonucleoprotein complexes
    Kappel, Kalli
    Liu, Shiheng
    Larsen, Kevin P.
    Skiniotis, Georgios
    Puglisi, Elisabetta Viani
    Puglisi, Joseph D.
    Zhou, Z. Hong
    Zhao, Rui
    Das, Rhiju
    NATURE METHODS, 2018, 15 (11) : 947 - +