Interpretable CRISPR/Cas9 off-target activities with mismatches and indels prediction using BERT

被引:4
|
作者
Luo, Ye [1 ]
Chen, Yaowen [1 ]
Xie, HuanZeng [1 ]
Zhu, Wentao [1 ]
Zhang, Guishan [1 ]
机构
[1] Shantou Univ, Coll Engn, Shantou 515063, Peoples R China
基金
中国国家自然科学基金;
关键词
CRISPER/Cas9; Off-target; BERT; Adaptive batch-wise olass balancing; Deep learning; GENOME EDITING TECHNOLOGIES; CLASSIFICATION; CRISPR-CAS9; SPECIFICITY; DESIGN; CAS9; SYSTEMS; DNA;
D O I
10.1016/j.compbiomed.2024.107932
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Off-target effects of CRISPR/Cas9 can lead to suboptimal genome editing outcomes. Numerous deep learning-based approaches have achieved excellent performance for off-target prediction; however, few can predict the off-target activities with both mismatches and indels between single guide RNA (sgRNA) and target DNA sequence pair. In addition, data imbalance is a common pitfall for off-target prediction. Moreover, due to the complexity of genomic contexts, generating an interpretable model also remains challenged. To address these issues, firstly we developed a BERT-based model called CRISPR-BERT for enhancing the prediction of off-target activities with both mismatches and indels. Secondly, we proposed an adaptive batch-wise class balancing strategy to combat the noise exists in imbalanced off-target data. Finally, we applied a visualization approach for investigating the generalizable nucleotide position-dependent patterns of sgRNA-DNA pair for off-target activity. In our comprehensive comparison to existing methods on five mismatches-only datasets and two mismatches-and-indels datasets, CRISPR-BERT achieved the best performance in terms of AUROC and PRAUC. Besides, the visualization analysis demonstrated how implicit knowledge learned by CRISPR-BERT facilitates off-target prediction, which shows potential in model interpretability. Collectively, CRISPR-BERT provides an accurate and interpretable framework for off-target prediction, further contributes to sgRNA optimization in practical use for improved target specificity in CRISPR/Cas9 genome editing. The source code is available at https://github.com/BrokenStringx/CRISPR-BERT
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Minimizing the off-target frequency of the CRISPR/Cas9 system via zwitterionic polymer conjugation and peptide fusion
    Han, Yanjiao
    Yuan, Zhefan
    Zhong, Sijin Luo
    Xu, Haoxian
    Jiang, Shaoyi
    CHEMICAL SCIENCE, 2023, 14 (23) : 6375 - 6382
  • [42] Synthetic switch to minimize CRISPR off-target effects by self-restricting Cas9 transcription and translation
    Shen, Chih-Che
    Hsu, Mu-Nung
    Chang, Chin-Wei
    Lin, Mei-Wei
    Hwu, Jih-Ru
    Tu, Yi
    Hu, Yu-Chen
    NUCLEIC ACIDS RESEARCH, 2019, 47 (03)
  • [43] Potential high-frequency off-target mutagenesis induced by CRISPR/Cas9 in Arabidopsis and its prevention
    Zhang, Qiang
    Xing, Hui-Li
    Wang, Zhi-Ping
    Zhang, Hai-Yan
    Yang, Fang
    Wang, Xue-Chen
    Chen, Qi-Jun
    PLANT MOLECULAR BIOLOGY, 2018, 96 (4-5) : 445 - 456
  • [44] DNA stretching induces Cas9 off-target activity
    Newton, Matthew D.
    Taylor, Benjamin J.
    Driessen, Rosalie P. C.
    Roos, Leonie
    Cvetesic, Nevena
    Allyjaun, Shenaz
    Lenhard, Boris
    Cuomo, Maria Emanuela
    Rueda, David S.
    NATURE STRUCTURAL & MOLECULAR BIOLOGY, 2019, 26 (03) : 185 - +
  • [45] Evaluation of Homology-Independent CRISPR-Cas9 Off-Target Assessment Methods
    Chaudhari, Hemangi G.
    Penterman, Jon
    Whitton, Holly J.
    Spencer, Sarah J.
    Flanagan, Nicole
    Zhang, Maria C. Lei
    Huang, Elaine
    Khedkar, Aditya S.
    Toomey, J. Mike
    Shearer, Courtney A.
    Needham, Alexander W.
    Ho, Tony W.
    Kulman, John D.
    Cradick, T. J.
    Kernytsky, Andrew
    CRISPR JOURNAL, 2020, 3 (06): : 440 - 453
  • [46] Off-Target Analysis in Gene Editing and Applications for Clinical Translation of CRISPR/Cas9 in HIV-1 Therapy
    Atkins, Andrew
    Chung, Cheng-Han
    Allen, Alexander G. G.
    Dampier, Will
    Gurrola, Theodore E. E.
    Sariyer, Ilker K. K.
    Nonnemacher, Michael R. R.
    Wigdahl, Brian
    FRONTIERS IN GENOME EDITING, 2021, 3
  • [47] Detection of on-target and off-target mutations generated by CRISPR/Cas9 and other sequence-specific nucleases
    Zischewski, Julia
    Fischer, Rainer
    Bortesi, Luisa
    BIOTECHNOLOGY ADVANCES, 2017, 35 (01) : 95 - 104
  • [48] High-fidelity CRISPR-Cas9 nucleases with no detectable genome-wide off-target effects
    Kleinstiver, Benjamin P.
    Pattanayak, Vikram
    Prew, Michelle S.
    Tsai, Shengdar Q.
    Nguyen, Nhu T.
    Zheng, Zongli
    Joung, J. Keith
    NATURE, 2016, 529 (7587) : 490 - +
  • [49] Applying CRISPR-Cas9 off-target editing on DNA based steganography
    Zhou H.
    Huan X.
    International Journal of Advanced Computer Science and Applications, 2019, 10 (08): : 1 - 5
  • [50] DNA shape features improve prediction of CRISPR/Cas9 activity
    Vora, Dhvani Sandip
    Bhandari, Sakshi Manoj
    Sundar, Durai
    METHODS, 2024, 226 : 120 - 126