Automatic post-picking using MAPPOS improves particle image detection from cryo-EM micrographs

被引:21
作者
Norousi, Ramin [1 ]
Wickles, Stephan [2 ,3 ]
Leidig, Christoph [2 ,3 ]
Becker, Thomas [2 ,3 ]
Schmid, Volker J. [1 ]
Beckmann, Roland [2 ,3 ]
Tresch, Achim [4 ,5 ]
机构
[1] Univ Munich, Dept Stat, Munich, Germany
[2] Univ Munich, Ctr Integrated Prot Sci, Munich, Germany
[3] Univ Munich, Munich Ctr Adv Photon Gene Ctr, Dept Biochem, Munich, Germany
[4] Max Planck Inst Plant, Cologne, Germany
[5] Univ Cologne, Inst Genet, D-50931 Cologne, Germany
关键词
Electron microscopy; Particle picking; Machine learning; Classification ensemble; 3D cryo-EM density map; SINGLE-PARTICLE; STRUCTURAL BASIS; SELECTION; TEMPLATE; SPIDER;
D O I
10.1016/j.jsb.2013.02.008
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Cryo-electron microscopy (cryo-EM) studies using single particle reconstruction are extensively used to reveal structural information on macromolecular complexes. Aiming at the highest achievable resolution, state of the art electron microscopes automatically acquire thousands of high-quality micrographs. Particles are detected on and boxed out from each micrograph using fully- or semi-automated approaches. However, the obtained particles still require laborious manual post-picking classification, which is one major bottleneck for single particle analysis of large datasets. We introduce MAPPOS, a supervised post-picking strategy for the classification of boxed particle images, as additional strategy adding to the already efficient automated particle picking routines. MAPPOS employs machine learning techniques to train a robust classifier from a small number of characteristic image features. In order to accurately quantify the performance of MAPPOS we used simulated particle and non-particle images. In addition, we verified our method by applying it to an experimental cryo-EM dataset and comparing the results to the manual classification of the same dataset Comparisons between MAPPOS and manual post-picking classification by several human experts demonstrated that merely a few hundred sample images are sufficient for MAPPOS to classify an entire dataset with a human-like performance. MAPPOS was shown to greatly accelerate the throughput of large datasets by reducing the manual workload by orders of magnitude while maintaining a reliable identification of non-particle images. (c) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:59 / 66
页数:8
相关论文
共 37 条
[1]   Particle picking by segmentation: A comparative study with SPIDER-based manual particle picking [J].
Adiga, U ;
Baxter, WT ;
Hall, RJ ;
Rockel, B ;
Rath, BK ;
Frank, J ;
Glaeser, R .
JOURNAL OF STRUCTURAL BIOLOGY, 2005, 152 (03) :211-220
[2]  
[Anonymous], 1973, Pattern Classification and Scene Analysis
[3]   Experimental evaluation of support vector machine-based and correlation-based approaches to automatic particle selection [J].
Arbelaez, Pablo ;
Han, Bong-Gyoon ;
Typke, Dieter ;
Lim, Joseph ;
Glaeser, Robert M. ;
Malik, Jitendra .
JOURNAL OF STRUCTURAL BIOLOGY, 2011, 175 (03) :319-328
[4]   Determination of signal-to-noise ratios and spectral SNRs in cryo-EM low-dose imaging of molecules [J].
Baxter, William T. ;
Grassucci, Robert A. ;
Gao, Haixiao ;
Frank, Joachim .
JOURNAL OF STRUCTURAL BIOLOGY, 2009, 166 (02) :126-132
[5]   Structural basis of highly conserved ribosome recycling in eukaryotes and archaea [J].
Becker, Thomas ;
Franckenberg, Sibylle ;
Wickles, Stephan ;
Shoemaker, Christopher J. ;
Anger, Andreas M. ;
Armache, Jean-Paul ;
Sieber, Heidemarie ;
Ungewickell, Charlotte ;
Berninghausen, Otto ;
Daberkow, Ingo ;
Karcher, Annette ;
Thomm, Michael ;
Hopfner, Karl-Peter ;
Green, Rachel ;
Beckmann, Roland .
NATURE, 2012, 482 (7386) :501-U221
[6]   Structural basis for aminoglycoside inhibition of bacterial ribosome recycling [J].
Borovinskaya, Maria A. ;
Pai, Raj D. ;
Zhang, Wen ;
Schuwirth, Barbara S. ;
Holton, James M. ;
Hirokawa, Go ;
Kaji, Hideko ;
Kaji, Akira ;
Cate, Jamie H. Doudna .
NATURE STRUCTURAL & MOLECULAR BIOLOGY, 2007, 14 (08) :727-732
[7]   The use of the area under the roc curve in the evaluation of machine learning algorithms [J].
Bradley, AP .
PATTERN RECOGNITION, 1997, 30 (07) :1145-1159
[8]  
Breiman L, 1996, MACH LEARN, V24, P123, DOI 10.1023/A:1018054314350
[9]  
BURMA DP, 1985, J BIOL CHEM, V260, P517