Benchmarking deep learning methods for predicting CRISPR/Cas9 sgRNA on- and off-target activities

被引:11
|
作者
Zhang, Guishan [3 ]
Luo, Ye [3 ]
Dai, Xianhua [1 ,4 ]
Dai, Zhiming [2 ,5 ]
机构
[1] Sun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen 518107, Peoples R China
[2] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
[3] Shantou Univ, Coll Engn, Shantou, Peoples R China
[4] Sun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen, Peoples R China
[5] Sun Yat sen Univ, Sch Comp Sci & Engn, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
CRISPR/Cas9; sgRNA; deep learning; on-target; off-target; GUIDE RNA DESIGN; CHROMATIN ACCESSIBILITY; CLEAVAGE EFFICIENCY; SEQUENCE FEATURES; PAM COMPATIBILITY; NEURAL-NETWORKS; GENOME; CRISPR-CAS9; CAS9; SPECIFICITY;
D O I
10.1093/bib/bbad333
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In silico design of single guide RNA (sgRNA) plays a critical role in clustered regularly interspaced, short palindromic repeats/CRISPR-associated protein 9 (CRISPR/Cas9) system. Continuous efforts are aimed at improving sgRNA design with efficient on-target activity and reduced off-target mutations. In the last 5 years, an increasing number of deep learning-based methods have achieved breakthrough performance in predicting sgRNA on- and off-target activities. Nevertheless, it is worthwhile to systematically evaluate these methods for their predictive abilities. In this review, we conducted a systematic survey on the progress in prediction of on- and off-target editing. We investigated the performances of 10 mainstream deep learning-based on-target predictors using nine public datasets with different sample sizes. We found that in most scenarios, these methods showed superior predictive power on large- and medium-scale datasets than on small-scale datasets. In addition, we performed unbiased experiments to provide in-depth comparison of eight representative approaches for off-target prediction on 12 publicly available datasets with various imbalanced ratios of positive/negative samples. Most methods showed excellent performance on balanced datasets but have much room for improvement on moderate- and severe-imbalanced datasets. This study provides comprehensive perspectives on CRISPR/Cas9 sgRNA on- and off-target activity prediction and improvement for method development.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Potential high-frequency off-target mutagenesis induced by CRISPR/Cas9 in Arabidopsis and its prevention
    Zhang, Qiang
    Xing, Hui-Li
    Wang, Zhi-Ping
    Zhang, Hai-Yan
    Yang, Fang
    Wang, Xue-Chen
    Chen, Qi-Jun
    PLANT MOLECULAR BIOLOGY, 2018, 96 (4-5) : 445 - 456
  • [32] Optimized sgRNA design to maximize activity and minimize off-target effects of CRISPR-Cas9
    Doench, John G.
    Fusi, Nicolo
    Sullender, Meagan
    Hegde, Mudra
    Vaimberg, Emma W.
    Donovan, Katherine F.
    Smith, Ian
    Tothova, Zuzana
    Wilen, Craig
    Orchard, Robert
    Virgin, Herbert W.
    Listgarten, Jennifer
    Root, David E.
    NATURE BIOTECHNOLOGY, 2016, 34 (02) : 184 - +
  • [33] CRISPR/Cas Systems in Genome Editing: Methodologies and Tools for sgRNA Design, Off-Target Evaluation, and Strategies to Mitigate Off-Target Effects
    Manghwar, Hakim
    Li, Bo
    Ding, Xiao
    Hussain, Amjad
    Lindsey, Keith
    Zhang, Xianlong
    Jin, Shuangxia
    ADVANCED SCIENCE, 2020, 7 (06)
  • [34] Predicting CRISPR-Cas9 Off-target with Self-supervised Neural Networks
    Chen, Dong
    Shu, Wenjie
    Peng, Shaoliang
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 245 - 250
  • [35] Deep learning improves the ability of sgRNA off-target propensity prediction
    Qiaoyue Liu
    Xiang Cheng
    Gan Liu
    Bohao Li
    Xiuqin Liu
    BMC Bioinformatics, 21
  • [36] R-CRISPR: A Deep Learning Network to Predict Off-Target Activities with Mismatch, Insertion and Deletion in CRISPR-Cas9 System
    Niu, Rui
    Peng, Jiajie
    Zhang, Zhipeng
    Shang, Xuequn
    GENES, 2021, 12 (12)
  • [37] Measuring and Reducing Off-Target Activities of Programmable Nucleases Including CRISPR-Cas9
    Koo, Taeyoung
    Lee, Jungjoon
    Kim, Jin-Soo
    MOLECULES AND CELLS, 2015, 38 (06) : 475 - 481
  • [38] Off-target Effects in CRISPR/Cas9-mediated Genome Engineering
    Zhang, Xiao-Hui
    Tee, Louis Y.
    Wang, Xiao-Gang
    Huang, Qun-Shan
    Yang, Shi-Hua
    MOLECULAR THERAPY-NUCLEIC ACIDS, 2015, 4 : e264
  • [39] Accurate deep learning off-target prediction with novel sgRNA-DNA sequence encoding in CRISPR-Cas9 gene editing
    Charlier, Jeremy
    Nadon, Robert
    Makarenkov, Vladimir
    BIOINFORMATICS, 2021, 37 (16) : 2299 - 2307
  • [40] Effective use of sequence information to predict CRISPR-Cas9 off-target
    Zhang, Zhong-Rui
    Jiang, Zhen-Ran
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 650 - 661