On bias, variance, overfitting, gold standard and consensus in single-particle analysis by cryo-electron microscopy

被引:14
|
作者
Sorzano, C. O. S. [1 ]
Jimenez-Moreno, A. [1 ]
Maluenda, D. [1 ]
Martinez, M. [1 ]
Ramirez-Aportela, E. [1 ]
Krieger, J. [1 ]
Melero, R. [1 ]
Cuervo, A. [1 ]
Conesa, J. [1 ]
Filipovic, J. [2 ]
Conesa, P. [1 ]
del Cano, L. [1 ]
Fonseca, Y. C. [1 ]
Jimenez-de la Morena, J. [1 ]
Losana, P. [1 ]
Sanchez-Garcia, R. [1 ]
Strelak, D. [1 ]
Fernandez-Gimenez, E. [1 ]
de Isidro-Gomez, F. P. [1 ]
Herreros, D. [1 ]
Vilas, J. L. [3 ]
Marabini, R. [4 ]
Carazo, J. M. [1 ]
机构
[1] Ctr Nacl Biotecnol CNB CSIC, Biocomp Unit, Calle Darwin 3, Madrid 28049, Spain
[2] Masaryk Univ, Brno, Czech Republic
[3] Yale Univ, Sch Engn & Appl Sci, New Haven, CT 06520 USA
[4] Univ Autonoma Madrid, Escuela Politecn Super, Madrid 28049, Spain
来源
ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY | 2022年 / 78卷
基金
欧盟地平线“2020”;
关键词
single-particle analysis; cryo-electron microscopy; parameter estimation; image processing; bias; variance; overfitting; gold standard; TRANSMISSION ELECTRON-MICROSCOPY; LIKELIHOOD-BASED CLASSIFICATION; TO-NOISE RATIO; CRYO-EM; 3-DIMENSIONAL RECONSTRUCTION; LOCAL RESOLUTION; ALIGNMENT; IMAGES; VALIDATION; PARAMETERS;
D O I
10.1107/S2059798322001978
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Cryo-electron microscopy (cryoEM) has become a well established technique to elucidate the 3D structures of biological macromolecules. Projection images from thousands of macromolecules that are assumed to be structurally identical are combined into a single 3D map representing the Coulomb potential of the macromolecule under study. This article discusses possible caveats along the image-processing path and how to avoid them to obtain a reliable 3D structure. Some of these problems are very well known in the community. These may be referred to as sample-related (such as specimen denaturation at interfaces or non-uniform projection geometry leading to underrepresented projection directions). The rest are related to the algorithms used. While some have been discussed in depth in the literature, such as the use of an incorrect initial volume, others have received much less attention. However, they are fundamental in any data-analysis approach. Chiefly among them, instabilities in estimating many of the key parameters that are required for a correct 3D reconstruction that occur all along the processing workflow are referred to, which may significantly affect the reliability of the whole process. In the field, the term overfitting has been coined to refer to some particular kinds of artifacts. It is argued that overfitting is a statistical bias in key parameter-estimation steps in the 3D reconstruction process, including intrinsic algorithmic bias. It is also shown that common tools (Fourier shell correlation) and strategies (gold standard) that are normally used to detect or prevent overfitting do not fully protect against it. Alternatively, it is proposed that detecting the bias that leads to overfitting is much easier when addressed at the level of parameter estimation, rather than detecting it once the particle images have been combined into a 3D map. Comparing the results from multiple algorithms (or at least, independent executions of the same algorithm) can detect parameter bias. These multiple executions could then be averaged to give a lower variance estimate of the underlying parameters.
引用
收藏
页码:410 / 423
页数:14
相关论文
共 50 条
  • [21] Single-particle 3D Reconstruction from Cryo-Electron Microscopy Images on GPU
    Tan, Guangming
    Guo, Ziyu
    Chen, Mingyu
    Meng, Dan
    ICS'09: PROCEEDINGS OF THE 2009 ACM SIGARCH INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, 2009, : 380 - 389
  • [22] Clustering Enhancement of Noisy Cryo-Electron Microscopy Single-Particle Images with a Network Structural Similarity Metric
    Yin, Shuo
    Zhang, Biao
    Yang, Yang
    Huang, Yan
    Shen, Hong-Bin
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2019, 59 (04) : 1658 - 1667
  • [23] Deep Consensus, a deep learning-based approach for particle pruning in cryo-electron microscopy
    Sanchez-Garcia, Ruben
    Segura, Joan
    Maluenda, David
    Maria Carazo, Jose
    Sorzano, Carlos Oscar S.
    IUCRJ, 2018, 5 : 854 - 865
  • [24] Model-based Reconstruction for Single Particle Cryo-Electron Microscopy
    Venkatakrishnan, S. V.
    Juneja, Puneet
    O'Neill, Hugh
    2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 1390 - 1394
  • [25] Protein Secondary Structure Determination by Constrained Single-Particle Cryo-Electron Tomography
    Bartesaghi, Alberto
    Lecumberry, Federico
    Sapiro, Guillermo
    Subramaniam, Sriram
    STRUCTURE, 2012, 20 (12) : 2003 - 2013
  • [26] A basic introduction to single particles cryo-electron microscopy
    Raimondi, Vittoria
    Grinzato, Alessandro
    AIMS BIOPHYSICS, 2022, 9 (01): : 5 - 20
  • [27] Computational Protocol for Assessing the Optimal Pixel Size to Improve the Accuracy of Single-particle Cryo-electron Microscopy Maps
    Tiwari, Sandhya P.
    Chhabra, Sahil
    Tama, Florence
    Miyashita, Osamu
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2020, 60 (05) : 2570 - 2580
  • [28] Preparation of Doublet Microtubule Fraction for Single Particle Cryo-electron Microscopy
    Black, Corbin
    Dai, Daniel Chen
    Peri, Katya
    Ichikawa, Muneyoshi
    Bui, Khanh Huy
    BIO-PROTOCOL, 2021, 11 (11):
  • [29] The Vaccinia Virus DNA Helicase Structure from Combined Single-Particle Cryo-Electron Microscopy and AlphaFold2 Prediction
    Hutin, Stephanie
    Li Ling, Wai
    Tarbouriech, Nicolas
    Schoehn, Guy
    Grimm, Clemens
    Fischer, Utz
    Burmeister, Wim P.
    VIRUSES-BASEL, 2022, 14 (10):
  • [30] Collection, pre-processing and on-the-fly analysis of data for high-resolution, single-particle cryo-electron microscopy
    Thompson, Rebecca F.
    Iadanza, Matthew G.
    Hesketh, Emma L.
    Rawson, Shaun
    Ranson, Neil A.
    NATURE PROTOCOLS, 2019, 14 (01) : 100 - +