UPicker: a semi-supervised particle picking transformer method for cryo-EM micrographs

被引:0
作者
Zhang, Chi [1 ]
Cheng, Yiran [2 ]
Feng, Kaiwen [1 ]
Zhang, Fa [3 ]
Han, Renmin [2 ]
Feng, Jieqing [1 ]
机构
[1] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou 310058, Zhejiang, Peoples R China
[2] Shandong Univ, Res Ctr Math & Interdisciplinary Sci, Qingdao 266000, Shandong, Peoples R China
[3] Beijing Inst Technol, Sch Med Technol, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
cryo-EM; particle picking; object detection; unsupervised pretraining; transformer;
D O I
10.1093/bib/bbae636
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Automatic single particle picking is a critical step in the data processing pipeline of cryo-electron microscopy structure reconstruction. In recent years, several deep learning-based algorithms have been developed, demonstrating their potential to solve this challenge. However, current methods highly depend on manually labeled training data, which is labor-intensive and prone to biases especially for high-noise and low-contrast micrographs, resulting in suboptimal precision and recall. To address these problems, we propose UPicker, a semi-supervised transformer-based particle-picking method with a two-stage training process: unsupervised pretraining and supervised fine-tuning. During the unsupervised pretraining, an Adaptive Laplacian of Gaussian region proposal generator is proposed to obtain pseudo-labels from unlabeled data for initial feature learning. For the supervised fine-tuning, UPicker only needs a small amount of labeled data to achieve high accuracy in particle picking. To further enhance model performance, UPicker employs a contrastive denoising training strategy to reduce redundant detections and accelerate convergence, along with a hybrid data augmentation strategy to deal with limited labeled data. Comprehensive experiments on both simulated and experimental datasets demonstrate that UPicker outperforms state-of-the-art particle-picking methods in terms of accuracy and robustness while requiring fewer labeled data than other transformer-based models. Furthermore, ablation studies demonstrate the effectiveness and necessity of each component of UPicker. The source code and data are available at https://github.com/JachyLikeCoding/UPicker.
引用
收藏
页数:13
相关论文
共 43 条
  • [1] AutoCryoPicker: an unsupervised learning approach for fully automated single particle picking in Cryo-EM images
    Al-Azzawi, Adil
    Ouadou, Anes
    Tanner, John J.
    Cheng, Jianlin
    [J]. BMC BIOINFORMATICS, 2019, 20 (1)
  • [2] Alexe B, 2010, PROC CVPR IEEE, P73, DOI 10.1109/CVPR.2010.5540226
  • [3] Bendory T, 2020, IEEE SIGNAL PROC MAG, V37, P58, DOI [10.1109/MSP.2019.2957822, 10.1109/msp.2019.2957822]
  • [4] Positive-unlabeled convolutional neural networks for particle picking in cryo-electron micrographs
    Bepler, Tristan
    Morin, Andrew
    Rapp, Micah
    Brasch, Julia
    Shapiro, Lawrence
    Noble, Alex J.
    Berger, Bonnie
    [J]. NATURE METHODS, 2019, 16 (11) : 1153 - +
  • [5] Burley SK, 2017, METHODS MOL BIOL, V1606, P627, DOI 10.1007/978-1-4939-7000-1_26
  • [6] Carion Nicolas, 2020, EUR C COMP VIS, P213, DOI [10.48550/arXiv. 2005.12872, DOI 10.48550/ARXIV.2005.12872, 10.1007/978-3-030-58452-813, DOI 10.1007/978-3-030-58452-813]
  • [7] Dang T., 2022, Proceedings of the European Conference on Computer Vision (ECCV), P86
  • [8] CryoTransformer: a transformer model for picking protein particles from Cryo-EM micrographs
    Dhakal, Ashwin
    Gyawali, Rajan
    Wang, Liguo
    Cheng, Jianlin
    [J]. BIOINFORMATICS, 2024, 40 (03)
  • [9] A large expert-curated cryo-EM image dataset for machine learning protein particle picking
    Dhakal, Ashwin
    Gyawali, Rajan
    Wang, Liguo
    Cheng, Jianlin
    [J]. SCIENTIFIC DATA, 2023, 10 (01)
  • [10] CryoSegNet: accurate cryo-EM protein particle picking by integrating the foundational AI image segmentation model and attention-gated U-Net
    Gyawali, Rajan
    Dhakal, Ashwin
    Wang, Liguo
    Cheng, Jianlin
    [J]. BRIEFINGS IN BIOINFORMATICS, 2024, 25 (04)