Interpretable CRISPR/Cas9 off-target activities with mismatches and indels prediction using BERT

被引:4
|
作者
Luo, Ye [1 ]
Chen, Yaowen [1 ]
Xie, HuanZeng [1 ]
Zhu, Wentao [1 ]
Zhang, Guishan [1 ]
机构
[1] Shantou Univ, Coll Engn, Shantou 515063, Peoples R China
基金
中国国家自然科学基金;
关键词
CRISPER/Cas9; Off-target; BERT; Adaptive batch-wise olass balancing; Deep learning; GENOME EDITING TECHNOLOGIES; CLASSIFICATION; CRISPR-CAS9; SPECIFICITY; DESIGN; CAS9; SYSTEMS; DNA;
D O I
10.1016/j.compbiomed.2024.107932
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Off-target effects of CRISPR/Cas9 can lead to suboptimal genome editing outcomes. Numerous deep learning-based approaches have achieved excellent performance for off-target prediction; however, few can predict the off-target activities with both mismatches and indels between single guide RNA (sgRNA) and target DNA sequence pair. In addition, data imbalance is a common pitfall for off-target prediction. Moreover, due to the complexity of genomic contexts, generating an interpretable model also remains challenged. To address these issues, firstly we developed a BERT-based model called CRISPR-BERT for enhancing the prediction of off-target activities with both mismatches and indels. Secondly, we proposed an adaptive batch-wise class balancing strategy to combat the noise exists in imbalanced off-target data. Finally, we applied a visualization approach for investigating the generalizable nucleotide position-dependent patterns of sgRNA-DNA pair for off-target activity. In our comprehensive comparison to existing methods on five mismatches-only datasets and two mismatches-and-indels datasets, CRISPR-BERT achieved the best performance in terms of AUROC and PRAUC. Besides, the visualization analysis demonstrated how implicit knowledge learned by CRISPR-BERT facilitates off-target prediction, which shows potential in model interpretability. Collectively, CRISPR-BERT provides an accurate and interpretable framework for off-target prediction, further contributes to sgRNA optimization in practical use for improved target specificity in CRISPR/Cas9 genome editing. The source code is available at https://github.com/BrokenStringx/CRISPR-BERT
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Off-target Effects in CRISPR/Cas9-mediated Genome Engineering
    Zhang, Xiao-Hui
    Tee, Louis Y.
    Wang, Xiao-Gang
    Huang, Qun-Shan
    Yang, Shi-Hua
    MOLECULAR THERAPY-NUCLEIC ACIDS, 2015, 4 : e264
  • [32] Biased and Unbiased Methods for the Detection of Off-Target Cleavage by CRISPR/Cas9: An Overview
    Martin, Francisco
    Sanchez-Hernandez, Sabina
    Gutierrez-Guerrero, Alejandra
    Pinedo-Gomez, Javier
    Benabdellah, Karim
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2016, 17 (09):
  • [33] Predicting CRISPR-Cas9 Off-target with Self-supervised Neural Networks
    Chen, Dong
    Shu, Wenjie
    Peng, Shaoliang
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 245 - 250
  • [34] CRISPR GUARD protects off-target sites from Cas9 nuclease activity using short guide RNAs
    Coelho, Matthew A.
    De Braekeleer, Etienne
    Firth, Mike
    Bista, Michal
    Lukasiak, Sebastian
    Cuomo, Maria Emanuela
    Taylor, Benjamin J. M.
    NATURE COMMUNICATIONS, 2020, 11 (01)
  • [35] Recognition of CRISPR/Cas9 off-target sites through ensemble learning of uneven mismatch distributions
    Peng, Hui
    Zheng, Yi
    Zhao, Zhixun
    Liu, Tao
    Li, Jinyan
    BIOINFORMATICS, 2018, 34 (17) : 757 - 765
  • [36] Battling CRISPR-Cas9 off-target genome editing
    Li, Daisy
    Zhou, Hong
    Zeng, Xiao
    CELL BIOLOGY AND TOXICOLOGY, 2019, 35 (05) : 403 - 406
  • [37] Strategies to Increase On-Target and Reduce Off-Target Effects of the CRISPR/Cas9 System in Plants
    Hajiahmadi, Zahra
    Movahedi, Ali
    Wei, Hui
    Li, Dawei
    Orooji, Yasin
    Ruan, Honghua
    Zhuge, Qiang
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (15)
  • [38] Examination of CRISPR/Cas9 design tools and the effect of target site accessibility on Cas9 activity
    Lee, Ciaran M.
    Davis, Timothy H.
    Bao, Gang
    EXPERIMENTAL PHYSIOLOGY, 2018, 103 (04) : 456 - 460
  • [39] Applying CRISPR-Cas9 Off-Target Editing on DNA based Steganography
    Zhou, Hong
    Huan, Xiaoli
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (08) : 1 - 5
  • [40] Designing Safer CRISPR/Cas9 Therapeutics for HIV: Defining Factors That Regulate and Technologies Used to Detect Off-Target Editing
    Sullivan, Neil T.
    Allen, Alexander G.
    Atkins, Andrew J.
    Chung, Cheng-Han
    Dampier, Will
    Nonnemacher, Michael R.
    Wigdahl, Brian
    FRONTIERS IN MICROBIOLOGY, 2020, 11