DNA shape features improve prediction of CRISPR/Cas9 activity

被引:6
作者
Vora, Dhvani Sandip [1 ]
Bhandari, Sakshi Manoj [2 ]
Sundar, Durai [1 ,3 ]
机构
[1] Indian Inst Technol Delhi, Dept Biochem Engn & Biotechnol, New Delhi 110016, India
[2] Indian Inst Technol Delhi, Dept Math, New Delhi 110016, India
[3] Indian Inst Technol Delhi, Sch Artificial Intelligence, New Delhi 110016, India
关键词
CRISPR/Cas9; Off-target; DNA shape; Neural networks; LIME; Permutation importance; OFF-TARGET CLEAVAGE; UNBIASED DETECTION; BINDING; CRISPR-CAS9; CAS9; SEQ; SPECIFICITY; DESIGN; RNA;
D O I
10.1016/j.ymeth.2024.04.012
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The CRISPR/Cas9 genome editing technology has transformed basic and translational research in biology and medicine. However, the advances are hindered by off -target effects and a paucity in the knowledge of the mechanism of the Cas9 protein. Machine learning models have been proposed for the prediction of Cas9 activity at unintended sites, yet feature engineering plays a major role in the outcome of the predictors. This study evaluates the improvement in the performance of similar predictors upon inclusion of epigenetic and DNA shape feature groups in the conventionally used sequence -based Cas9 target and off -target datasets. The approach involved the utilization of neural networks trained on a diverse range of parameters, allowing us to systematically assess the performance increase for the meticulously designed datasets- (i) sequence only, (ii) sequence and epigenetic features, and (iii) sequence, epigenetic and DNA shape feature datasets. The addition of DNA shape information significantly improved predictive performance, evaluated by Akaike and Bayesian information criteria. The evaluation of individual feature importance by permutation and LIMEbased methods also indicates that not only sequence features like mismatches and nucleotide composition, but also base pairing parameters like opening and stretch, that are indicative of distortion in the DNA -RNA hybrid in the presence of mismatches, influence model outcomes.
引用
收藏
页码:120 / 126
页数:7
相关论文
共 68 条
[1]   A machine learning approach for predicting CRISPR-Cas9 cleavage efficiencies and patterns underlying its mechanism of action [J].
Abadi, Shiran ;
Yan, Winston X. ;
Amar, David ;
Mayrose, Itay .
PLOS COMPUTATIONAL BIOLOGY, 2017, 13 (10)
[2]   The CRISPR tool kit for genome editing and beyond [J].
Adli, Mazhar .
NATURE COMMUNICATIONS, 2018, 9
[3]  
Akaike H., 1998, International Symposium on Information Theory, Budapest, Proceedings, P199, DOI [DOI 10.1007/978-1-4612-1694-0_15, 10.1007/978-1-4612-1694-015, 10.1007/978-1-4612-1694-0_15, DOI 10.1007/978-1-4612-1694-015]
[4]   CRISPR-Cas9 off-targeting assessment with nucleic acid duplex energy parameters [J].
Alkan, Ferhat ;
Wenzel, Anne ;
Anthon, Christian ;
Havgaard, Jakob Hull ;
Gorodkin, Jan .
GENOME BIOLOGY, 2018, 19
[5]   Structural basis of PAM-dependent target DNA recognition by the Cas9 endonuclease [J].
Anders, Carolin ;
Niewoehner, Ole ;
Duerst, Alessia ;
Jinek, Martin .
NATURE, 2014, 513 (7519) :569-+
[6]   Gene Editing on Center Stage [J].
Bak, Rasmus O. ;
Gomez-Ospina, Natalia ;
Porteus, Matthew H. .
TRENDS IN GENETICS, 2018, 34 (08) :600-611
[7]   High-throughput biochemical profiling reveals sequence determinants of dCas9 off-target binding and unbinding [J].
Boyle, Evan A. ;
Andreasson, Johan O. L. ;
Chircus, Lauren M. ;
Sternberg, Samuel H. ;
Wu, Michelle J. ;
Guegler, Chantal K. ;
Doudna, Jennifer A. ;
Greenleaf, William J. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2017, 114 (21) :5461-5466
[8]   Protospacer Adjacent Motif (PAM)-Distal Sequences Engage CRISPR Cas9 DNA Target Cleavage [J].
Cencic, Regina ;
Miura, Hisashi ;
Malina, Abba ;
Robert, Francis ;
Ethier, Sylvain ;
Schmeing, T. Martin ;
Dostie, Josee ;
Pelletier, Jerry .
PLOS ONE, 2014, 9 (10)
[9]   DNAshapeR: an R/Bioconductor package for DNA shape prediction and feature encoding [J].
Chiu, Tsu-Pei ;
Comoglio, Federico ;
Zhou, Tianyin ;
Yang, Lin ;
Paro, Renato ;
Rohs, Remo .
BIOINFORMATICS, 2016, 32 (08) :1211-1213
[10]   Incorporation of bridged nucleic acids into CRISPR RNAs improves Cas9 endonuclease specificity [J].
Cromwell, Christopher R. ;
Sung, Keewon ;
Park, Jinho ;
Krysler, Amanda R. ;
Jovel, Juan ;
Kim, Seong Keun ;
Hubbard, Basil P. .
NATURE COMMUNICATIONS, 2018, 9