Benchmarking deep learning methods for predicting CRISPR/Cas9 sgRNA on- and off-target activities
被引:10
|
作者:
Zhang, Guishan
论文数: 0引用数: 0
h-index: 0
机构:
Shantou Univ, Coll Engn, Shantou, Peoples R ChinaSun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen 518107, Peoples R China
Zhang, Guishan
[3
]
Luo, Ye
论文数: 0引用数: 0
h-index: 0
机构:
Shantou Univ, Coll Engn, Shantou, Peoples R ChinaSun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen 518107, Peoples R China
Luo, Ye
[3
]
Dai, Xianhua
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen 518107, Peoples R China
Sun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen, Peoples R ChinaSun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen 518107, Peoples R China
Dai, Xianhua
[1
,4
]
Dai, Zhiming
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
Sun Yat sen Univ, Sch Comp Sci & Engn, Guangzhou, Peoples R ChinaSun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen 518107, Peoples R China
Dai, Zhiming
[2
,5
]
机构:
[1] Sun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen 518107, Peoples R China
[2] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
[3] Shantou Univ, Coll Engn, Shantou, Peoples R China
[4] Sun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen, Peoples R China
[5] Sun Yat sen Univ, Sch Comp Sci & Engn, Guangzhou, Peoples R China
In silico design of single guide RNA (sgRNA) plays a critical role in clustered regularly interspaced, short palindromic repeats/CRISPR-associated protein 9 (CRISPR/Cas9) system. Continuous efforts are aimed at improving sgRNA design with efficient on-target activity and reduced off-target mutations. In the last 5 years, an increasing number of deep learning-based methods have achieved breakthrough performance in predicting sgRNA on- and off-target activities. Nevertheless, it is worthwhile to systematically evaluate these methods for their predictive abilities. In this review, we conducted a systematic survey on the progress in prediction of on- and off-target editing. We investigated the performances of 10 mainstream deep learning-based on-target predictors using nine public datasets with different sample sizes. We found that in most scenarios, these methods showed superior predictive power on large- and medium-scale datasets than on small-scale datasets. In addition, we performed unbiased experiments to provide in-depth comparison of eight representative approaches for off-target prediction on 12 publicly available datasets with various imbalanced ratios of positive/negative samples. Most methods showed excellent performance on balanced datasets but have much room for improvement on moderate- and severe-imbalanced datasets. This study provides comprehensive perspectives on CRISPR/Cas9 sgRNA on- and off-target activity prediction and improvement for method development.