Massively parallel unsupervised single-particle cryo-EM data clustering via statistical manifold learning

被引:28
|
作者
Wu, Jiayi [1 ,2 ]
Ma, Yong-Bei [2 ]
Congdon, Charles [3 ]
Brett, Bevin [3 ]
Chen, Shuobing [1 ,2 ]
Xu, Yaofang [2 ,4 ]
Ouyang, Qi [1 ,5 ]
Mao, Youdong [1 ,2 ,6 ]
机构
[1] Peking Univ, Sch Phys, State Key Lab Artificial Microstruct & Mesoscop P, Inst Condensed Matter Phys,Ctr Quantitat Biol, Beijing, Peoples R China
[2] Dana Farber Canc Inst, Intel Parallel Comp Ctr Struct Biol, Boston, MA 02115 USA
[3] Intel Corp, Software & Serv Grp, Santa Clara, CA USA
[4] Peking Univ, Hlth Sci Ctr, Dept Biophys, Beijing, Peoples R China
[5] Peking Univ, Peking Tsinghua Joint Ctr Life Sci, Beijing, Peoples R China
[6] Harvard Med Sch, Dept Microbiol & Immunobiol, Boston, MA USA
来源
PLOS ONE | 2017年 / 12卷 / 08期
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
NONLINEAR DIMENSIONALITY REDUCTION; MICROSCOPY; CLASSIFICATION; PROJECTION; MACROMOLECULES; IMAGES; SPARX; SUITE; XMIPP;
D O I
10.1371/journal.pone.0182130
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Structural heterogeneity in single-particle cryo-electron microscopy (cryo-EM) data represents a major challenge for high-resolution structure determination. Unsupervised classification may serve as the first step in the assessment of structural heterogeneity. However, traditional algorithms for unsupervised classification, such as K-means clustering and maximum likelihood optimization, may classify images into wrong classes with decreasing signal-to-noise-ratio (SNR) in the image data, yet demand increased computational costs. Overcoming these limitations requires further development of clustering algorithms for high-performance cryo-EM data processing. Here we introduce an unsupervised single-particle clustering algorithm derived from a statistical manifold learning framework called generative topographic mapping (GTM). We show that unsupervised GTM clustering improves classification accuracy by about 40% in the absence of input references for data with lower SNRs. Applications to several experimental datasets suggest that our algorithm can detect subtle structural differences among classes via a hierarchical clustering strategy. After code optimization over a high-performance computing (HPC) environment, our software implementation was able to generate thousands of reference-free class averages within hours in a massively parallel fashion, which allows a significant improvement on ab initio 3D reconstruction and assists in the computational purification of homogeneous datasets for high-resolution visualization.
引用
收藏
页数:25
相关论文
共 50 条
  • [41] Determination of the ribosome structure to a resolution of 2.5 Å by single-particle cryo-EM
    Liu, Zheng
    Gutierrez-Vargas, Cristina
    Wei, Jia
    Grassucci, Robert A.
    Sun, Ming
    Espina, Noel
    Madison-Antenucci, Susan
    Tong, Liang
    Frank, Joachim
    PROTEIN SCIENCE, 2017, 26 (01) : 82 - 92
  • [42] Seeing Atoms: Single-Particle Cryo-EM Breaks the Atomic Barrier
    Tan, Yong Zi
    Carragher, Bridget
    MOLECULAR CELL, 2020, 80 (06) : 938 - 939
  • [43] Evaluating the accuracy of classifying heterogeneous single-particle cryo-EM datasets
    Arnold, William R.
    BIOPHYSICAL JOURNAL, 2022, 121 (03) : 131 - 131
  • [44] Photoreception and signaling in bacterial phytochrome revealed by single-particle cryo-EM
    Malla, Tek Narsingh
    Hernandez, Carolina
    Muniyappan, Srinivasan
    Menendez, David
    Bizhga, Dorina
    Mendez, Joshua H.
    Schwander, Peter
    Stojkovic, Emina A.
    Schmidt, Marius
    SCIENCE ADVANCES, 2024, 10 (32):
  • [45] Recent Technical Advances in Sample Preparation for Single-Particle Cryo-EM
    Xu, Yixin
    Dang, Shangyu
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2022, 9
  • [46] FAST WAVELET-BASED SINGLE-PARTICLE RECONSTRUCTION IN CRYO-EM
    Vonesch, Cedric
    Wang, Lanhui
    Shkolnisky, Yoel
    Singer, Amit
    2011 8TH IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING: FROM NANO TO MACRO, 2011, : 1950 - 1953
  • [47] Functionalized graphene grids with various charges for single-particle cryo-EM
    Ye Lu
    Nan Liu
    Yongbo Liu
    Liming Zheng
    Junhao Yang
    Jia Wang
    Xia Jia
    Qinru Zi
    Hailin Peng
    Yu Rao
    Hong-Wei Wang
    Nature Communications, 13
  • [48] Classifying liganded states in heterogeneous single-particle cryo-EM datasets
    Arnold, William R.
    Asarnow, Daniel
    Cheng, Yifan
    MICROSCOPY, 2022, 71 : i23 - i29
  • [49] Single-particle cryo-EM of the ryanodine receptor channel in an aqueous environment
    Baker, Mariah R.
    Fan, Guizhen
    Serysheva, Irina I.
    EUROPEAN JOURNAL OF TRANSLATIONAL MYOLOGY, 2015, 25 (01) : 35 - 48
  • [50] Estimating the effect of finite depth of field in single-particle cryo-EM
    Downing, Kenneth H.
    Glaeser, Robert M.
    ULTRAMICROSCOPY, 2018, 184 : 94 - 99