CRISPR-OTE: Prediction of CRISPR On-Target Efficiency Based on Multi-Dimensional Feature Fusion

被引:3
作者
Xie, J. [1 ]
Liu, M. [1 ]
Zhou, L. [2 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, China Hosp Dev Inst, Ctr Med Intelligent & Dev, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Genome editing; CRISPR; On-target efficiency; Deep learning; Prior knowledge; GUIDE-RNA; DESIGN; SINGLE; ENDONUCLEASE; SGRNAS; MODEL; CPF1;
D O I
10.1016/j.irbm.2022.07.003
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Objective: Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) is a powerful genome editing technology. Guide RNA (gRNA) plays an essential guiding role in the CRISPR system by complementary base pairing with target DNA. Since the CRISPR targeting mechanism problem has not yet been fully resolved, it remains a challenge to predict gRNA on-target efficiency. Current gRNA design tools often lack efficient information extraction and cannot learn the target efficiency patterns thoroughly.Material and methods: In this study, CRISPR-OTE is proposed to consider both multi-dimensional sequence information and important complementary prior knowledge based on a simple but effective framework. CRISPR-OTE consists of the local-contextual information branch and the prior knowledge branch. The local-contextual information branch extracts multi-dimensional sequence features from the DNA primary sequence by a parallel framework of Convolutional Neural Networks (CNN) and bidirectional Long Short-Term Memory networks (biLSTM). The prior knowledge branch selects the optimal subset of physicochemical features to provide the neural network with complementary knowledge, such as complex secondary structures. A simple feature fusion strategy is also adopted to fully utilize multi-modal data from the two branches.Results: The experimental results show that the optimal subset of physicochemical features (RNA secondary structure and melting temperature of 34nt target) can effectively improve the prediction performance. Additionally, combining multi-dimensional sequence features and multi-modal features can extract information more comprehensively. Through transfer learning, CRISPR-OTE trained on the CRISPR-Cpf1 system can also be successfully applied to the CRISPR-Cas9 system.Conclusion: The performance of CRISPR-OTE is superior to other methods in different CRISPR systems and species. Therefore, CRISPR-OTE is a simple on-target efficiency prediction framework with better accuracy and generalization performance.(c) 2022 AGBM. Published by Elsevier Masson SAS. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] CRISPRLearner: A Deep Learning-Based System to Predict CRISPR/Cas9 sgRNA On-Target Cleavage Efficiency
    Dimauro, Giovanni
    Colagrande, Pierpasquale
    Carlucci, Roberto
    Ventura, Mario
    Bevilacqua, Vitoantonio
    Caivano, Danilo
    ELECTRONICS, 2019, 8 (12)
  • [2] CrnnCrispr: An Interpretable Deep Learning Method for CRISPR/Cas9 sgRNA On-Target Activity Prediction
    Zhu, Wentao
    Xie, Huanzeng
    Chen, Yaowen
    Zhang, Guishan
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (08)
  • [3] ] CRISPRpred: A flexible and efficient tool for sgRNAs on-target activity prediction in CRISPR/Cas9 systems
    Rahman, Md. Khaledur
    Rahman, M. Sohel
    PLOS ONE, 2017, 12 (08):
  • [4] Increasing on-target cleavage efficiency for CRISPR/Cas9-induced large fragment deletion in Myxococcus xanthus
    Yang, Ying-Jie
    Wang, Ye
    Li, Zhi-Feng
    Gong, Ya
    Zhang, Peng
    Hu, Wen-Chao
    Sheng, Duo-Hong
    Li, Yue-Zhong
    MICROBIAL CELL FACTORIES, 2017, 16
  • [5] CRISPR-MFH: A Lightweight Hybrid Deep Learning Framework with Multi-Feature Encoding for Improved CRISPR-Cas9 Off-Target Prediction
    Zheng, Yanyi
    Zou, Quan
    Li, Jian
    Yang, Yanpeng
    GENES, 2025, 16 (04)
  • [6] CLSTM-AR-Based Multi-Dimensional Feature Fusion for Multi-Energy Load Forecasting
    Ren, Bowen
    Huang, Cunqiang
    Chen, Laijun
    Mei, Shengwei
    An, Juan
    Liu, Xingwen
    Ma, Hengrui
    ELECTRONICS, 2022, 11 (21)
  • [7] Traffic flow prediction with a multi-dimensional feature input: A new method based on attention mechanisms
    Zhang, Shaohu
    Ma, Jianxiao
    Geng, Boshuo
    Wang, Hanbin
    ELECTRONIC RESEARCH ARCHIVE, 2024, 32 (02): : 979 - 1002
  • [8] A Multi-Dimensional Attention Feature Fusion Method for Pedestrian Re-identification
    Chen, X. P.
    Xu, Y.
    ENGINEERING LETTERS, 2023, 31 (04) : 1365 - 1373
  • [9] A Complete Automatic Target Recognition System of Low Altitude, Small RCS and Slow Speed (LSS) Targets Based on Multi-Dimensional Feature Fusion
    Wu, Qi
    Chen, Jie
    Lu, Yue
    Zhang, Yue
    SENSORS, 2019, 19 (22)
  • [10] Prediction of protein interactions between pine and pine wood nematode using deep learning and multi-dimensional feature fusion
    Wang, Liuyan
    Li, Rongguang
    Guan, Xuemei
    Yan, Shanchun
    FRONTIERS IN PLANT SCIENCE, 2024, 15