Cross-validation in cryo-EM-based structural modeling

被引:40
|
作者
Falkner, Benjamin [1 ]
Schroeder, Gunnar F. [1 ,2 ]
机构
[1] Forschungszentrum Julich, Inst Complex Syst ICS 6, D-52425 Julich, Germany
[2] Univ Dusseldorf, Dept Phys, D-40225 Dusseldorf, Germany
关键词
flexible fitting; real-space structure refinement; ELECTRON-DENSITY MAPS; PROTEIN STRUCTURES; ATOMIC STRUCTURES; RESOLUTION; REFINEMENT; CRYSTALLOGRAPHY; BIAS;
D O I
10.1073/pnas.1119041110
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Single-particle cryo-EM is a powerful approach to determine the structure of large macromolecules and assemblies thereof in many cases at subnanometer resolution. It has become popular to refine or flexibly fit atomic models into density maps derived from cryo-EM experiments. These density maps are typically significantly lower in resolution than electron density maps obtained from X-ray diffraction experiments, such that the number of parameters that need to be determined is much larger than the number of experimental observables. Overfitting and misinterpretation of the density, thus, become a serious problem. For diffraction data, a cross-validation approach was introduced almost 20 y ago; however, no such approach has been described yet for structure refinement against cryo-EM density maps, although the overfitting problem is, because of the lower resolution, significantly larger. We present a cross-validation approach for real-space refinement against cryo-EM density maps in analogy to cross-validation typically used in crystallography. Our approach is able to detect overfitting and allows for optimizing the choice of restraints used in the refinement. The approach is shown on three protein structures with simulated data and experimental data of the rotavirus double-layer particle. Because cross-validation requires splitting the dataset into at least two independent sets, we further present an approach to quantify correlations between the structure factor sets. This analysis is also helpful for other cross-validation applications, such as refinements against diffraction data or 3D reconstructions of cryo-EM density maps.
引用
收藏
页码:8930 / 8935
页数:6
相关论文
共 50 条
  • [1] Cross-Validation of Data in SAXS and Cryo-EM
    Afsari, Bijan
    Kim, Jin Seob
    Chirikjian, Gregory S.
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 1224 - 1230
  • [3] Need for Cross-Validation of Single Particle Cryo-EM
    Cossio, Pilar
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2020, 60 (05) : 2413 - 2418
  • [4] Cross-validation EM training for robust parameter estimation
    Shinozaki, T.
    Ostendorf, M.
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 437 - +
  • [5] Theoretical Analysis of Cross-Validation(CV)-EM Algorithm
    Takenouchi, Takashi
    Ikeda, Kazushi
    ARTIFICIAL NEURAL NETWORKS (ICANN 2010), PT III, 2010, 6354 : 321 - 326
  • [6] Cross-validation and aggregated EM training for robust parameter estimation
    Shinozaki, Takahiro
    Ostendorf, Mari
    COMPUTER SPEECH AND LANGUAGE, 2008, 22 (02): : 185 - 195
  • [7] Cross-validation is dead. Long live cross-validation! Model validation based on resampling
    Knut Baumann
    Journal of Cheminformatics, 2 (Suppl 1)
  • [8] Cryo-EM-based structural insights into supramolecular assemblies of y-hemolysin from S. aureus reveal the pore formation mechanism
    Mishra, Suman
    Roy, Anupam
    Dutta, Somnath
    STRUCTURE, 2023, 31 (06) : 651 - +
  • [9] Analyzing cross-validation for forecasting with structural instability
    Hirano, Keisuke
    Wright, Jonathan H.
    JOURNAL OF ECONOMETRICS, 2022, 226 (01) : 139 - 154
  • [10] Damage recognition process in DNA repair pathway: cryo-EM-based analysis of the UvrAUvrB complex in Mycobacterium tuberculosis
    Genta, M.
    Ferrara, G.
    Bolognesi, M.
    Rossi, F.
    Rizzi, M.
    Chaves, A.
    Miggiano, R.
    FEBS OPEN BIO, 2024, 14 : 64 - 64