UPicker: a semi-supervised particle picking transformer method for cryo-EM micrographs

被引:0
作者
Zhang, Chi [1 ]
Cheng, Yiran [2 ]
Feng, Kaiwen [1 ]
Zhang, Fa [3 ]
Han, Renmin [2 ]
Feng, Jieqing [1 ]
机构
[1] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou 310058, Zhejiang, Peoples R China
[2] Shandong Univ, Res Ctr Math & Interdisciplinary Sci, Qingdao 266000, Shandong, Peoples R China
[3] Beijing Inst Technol, Sch Med Technol, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
cryo-EM; particle picking; object detection; unsupervised pretraining; transformer;
D O I
10.1093/bib/bbae636
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Automatic single particle picking is a critical step in the data processing pipeline of cryo-electron microscopy structure reconstruction. In recent years, several deep learning-based algorithms have been developed, demonstrating their potential to solve this challenge. However, current methods highly depend on manually labeled training data, which is labor-intensive and prone to biases especially for high-noise and low-contrast micrographs, resulting in suboptimal precision and recall. To address these problems, we propose UPicker, a semi-supervised transformer-based particle-picking method with a two-stage training process: unsupervised pretraining and supervised fine-tuning. During the unsupervised pretraining, an Adaptive Laplacian of Gaussian region proposal generator is proposed to obtain pseudo-labels from unlabeled data for initial feature learning. For the supervised fine-tuning, UPicker only needs a small amount of labeled data to achieve high accuracy in particle picking. To further enhance model performance, UPicker employs a contrastive denoising training strategy to reduce redundant detections and accelerate convergence, along with a hybrid data augmentation strategy to deal with limited labeled data. Comprehensive experiments on both simulated and experimental datasets demonstrate that UPicker outperforms state-of-the-art particle-picking methods in terms of accuracy and robustness while requiring fewer labeled data than other transformer-based models. Furthermore, ablation studies demonstrate the effectiveness and necessity of each component of UPicker. The source code and data are available at https://github.com/JachyLikeCoding/UPicker.
引用
收藏
页数:13
相关论文
共 43 条
  • [11] Pre-trained models: Past, present and future
    Han, Xu
    Zhang, Zhengyan
    Ding, Ning
    Gu, Yuxian
    Liu, Xiao
    Huo, Yuqi
    Qiu, Jiezhong
    Yao, Yuan
    Zhang, Ao
    Zhang, Liang
    Han, Wentao
    Huang, Minlie
    Jin, Qin
    Lan, Yanyan
    Liu, Yang
    Liu, Zhiyuan
    Lu, Zhiwu
    Qiu, Xipeng
    Song, Ruihua
    Tang, Jie
    Wen, Ji-Rong
    Yuan, Jinhui
    Zhao, Wayne Xin
    Zhu, Jun
    [J]. AI OPEN, 2021, 2 : 225 - 250
  • [12] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [13] Application of template matching technique to particle detection in electron micrographs
    Huang, Z
    Penczek, PA
    [J]. JOURNAL OF STRUCTURAL BIOLOGY, 2004, 145 (1-2) : 29 - 40
  • [14] EMPIAR: a public archive for raw electron microscopy image data
    Iudin, Andrii
    Korir, Paul K.
    Salavert-Torres, Jose
    Kleywegt, Gerard J.
    Patwardhan, Ardan
    [J]. NATURE METHODS, 2016, 13 (05) : 387 - 388
  • [15] The Hungarian Method for the assignment problem
    Kuhn, HW
    [J]. NAVAL RESEARCH LOGISTICS, 2005, 52 (01) : 7 - 21
  • [16] Appion: An integrated, database-driven pipeline to facilitate EM image processing
    Lander, Gabriel C.
    Stagg, Scott M.
    Voss, Neil R.
    Cheng, Anchi
    Fellmann, Denis
    Pulokas, James
    Yoshioka, Craig
    Irving, Christopher
    Mulder, Anke
    Lau, Pick-Wei
    Lyumkis, Dmitry
    Potter, Clinton S.
    Carragher, Bridget
    [J]. JOURNAL OF STRUCTURAL BIOLOGY, 2009, 166 (01) : 95 - 102
  • [17] High-resolution Single Particle Analysis from Electron Cryo-microscopy Images Using SPHIRE
    Moriya, Toshio
    Saur, Michael
    Stabrin, Markus
    Merino, Felipe
    Voicu, Horatiu
    Huang, Zhong
    Penczek, Pawel A.
    Raunser, Stefan
    Gatsogiannis, Christos
    [J]. JOVE-JOURNAL OF VISUALIZED EXPERIMENTS, 2017, (123):
  • [18] Ouyang J., 2022, J Cybersecur, V2, P65
  • [19] Paszke A, 2019, ADV NEUR IN, V32
  • [20] Punjani A, 2017, NAT METHODS, V14, P290, DOI [10.1038/NMETH.4169, 10.1038/nmeth.4169]