DoubleHelix: nucleic acid sequence identification, assignment and validation tool for cryo-EM and crystal structure models

被引:5
|
作者
Chojnowski, Grzegorz [1 ]
机构
[1] Hamburg Unit, European Mol Biol Lab, Notkestr 85, D-22607 Hamburg, Germany
关键词
REFINEMENT; MAPS; CRYSTALLOGRAPHY; ISOSTERICITY; PREDICTIONS; EXPANSION; ACCURACY; RIBOSOME; DATABASE; MOTIFS;
D O I
10.1093/nar/gkad553
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Sequence assignment is a key step of the model building process in both cryogenic electron microscopy (cryo-EM) and macromolecular crystallography (MX). If the assignment fails, it can result in difficult to identify errors affecting the interpretation of a model. There are many model validation strategies that help experimentalists in this step of protein model building, but they are virtually non-existent for nucleic acids. Here, I present doubleHelix-a comprehensive method for assignment, identification, and validation of nucleic acid sequences in structures determined using cryo-EM and MX. The method combines a neural network classifier of nucleobase identities and a sequence-independent secondary structure assignment approach. I show that the presented method can successfully assist sequence-assignment step in nucleic-acid model building at lower resolutions, where visual map interpretation is very difficult. Moreover, I present examples of sequence assignment errors detected using doubleHelix in cryo-EM and MX structures of ribosomes deposited in the Protein Data Bank, which escaped the scrutiny of available model-validation approaches. The doubleHelix program source code is available under BSD-3 license at .
引用
收藏
页码:8255 / 8269
页数:15
相关论文
共 9 条
  • [1] Sequence-assignment validation in cryo-EM models with checkMySequence
    Chojnowski, Grzegorz
    ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2022, 78 : 806 - 816
  • [2] Deriving and refining atomic models in crystallography and cryo-EM: the latest Phenix tools to facilitate structure analysis
    Klaholz, Bruno P.
    ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2019, 75 : 878 - 881
  • [3] Sequence-assignment validation in protein crystal structure models with checkMySequence
    Chojnowski, Grzegorz
    ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2023, 79 : 559 - 568
  • [4] AB INITIO CRYO-EM STRUCTURE DETERMINATION AS A VALIDATION PROBLEM
    Penczek, Pawel A.
    Asturias, Francisco J.
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2090 - 2094
  • [5] Single particle cryo-EM map and model validation: It's not crystal clear
    Lander, Gabriel C.
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2024, 89
  • [6] Validation of Cryo-EM Structure of IP3R1 Channel
    Murray, Stephen C.
    Flanagan, John
    Popova, Olga B.
    Chiu, Wah
    Ludtke, Steven J.
    Serysheva, Irina I.
    STRUCTURE, 2013, 21 (06) : 900 - 909
  • [7] AI-based quality assessment methods for protein structure models from cryo-EM
    Zhu, Han
    Terashi, Genki
    Farheen, Farhanaz
    Nakamura, Tsukasa
    Kihara, Daisuke
    CURRENT RESEARCH IN STRUCTURAL BIOLOGY, 2025, 9
  • [8] Using a partial atomic model from medium-resolution cryo-EM to solve a large crystal structure
    Fabrega-Ferrer, Montserrat
    Cuervo, Ana
    Fernandez, Francisco J.
    Machon, Cristina
    Perez-Luque, Rosa
    Pous, Joan
    Vega, M. Cristina
    Carrascosa, Jose L.
    Coll, Miquel
    ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2021, 77 : 11 - 18
  • [9] DomainFit: Identification of protein domains in cryo-EM maps at intermediate resolution using AlphaFold2-predicted models
    Gao, Jerry
    Tong, Maxwell
    Lee, Chinkyu
    Gaertig, Jacek
    Legal, Thibault
    Bui, Khanh Huy
    STRUCTURE, 2024, 32 (08) : 1248 - 1259.e5