CRISPR-OTE: Prediction of CRISPR On-Target Efficiency Based on Multi-Dimensional Feature Fusion

被引:3
作者
Xie, J. [1 ]
Liu, M. [1 ]
Zhou, L. [2 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, China Hosp Dev Inst, Ctr Med Intelligent & Dev, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Genome editing; CRISPR; On-target efficiency; Deep learning; Prior knowledge; GUIDE-RNA; DESIGN; SINGLE; ENDONUCLEASE; SGRNAS; MODEL; CPF1;
D O I
10.1016/j.irbm.2022.07.003
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Objective: Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) is a powerful genome editing technology. Guide RNA (gRNA) plays an essential guiding role in the CRISPR system by complementary base pairing with target DNA. Since the CRISPR targeting mechanism problem has not yet been fully resolved, it remains a challenge to predict gRNA on-target efficiency. Current gRNA design tools often lack efficient information extraction and cannot learn the target efficiency patterns thoroughly.Material and methods: In this study, CRISPR-OTE is proposed to consider both multi-dimensional sequence information and important complementary prior knowledge based on a simple but effective framework. CRISPR-OTE consists of the local-contextual information branch and the prior knowledge branch. The local-contextual information branch extracts multi-dimensional sequence features from the DNA primary sequence by a parallel framework of Convolutional Neural Networks (CNN) and bidirectional Long Short-Term Memory networks (biLSTM). The prior knowledge branch selects the optimal subset of physicochemical features to provide the neural network with complementary knowledge, such as complex secondary structures. A simple feature fusion strategy is also adopted to fully utilize multi-modal data from the two branches.Results: The experimental results show that the optimal subset of physicochemical features (RNA secondary structure and melting temperature of 34nt target) can effectively improve the prediction performance. Additionally, combining multi-dimensional sequence features and multi-modal features can extract information more comprehensively. Through transfer learning, CRISPR-OTE trained on the CRISPR-Cpf1 system can also be successfully applied to the CRISPR-Cas9 system.Conclusion: The performance of CRISPR-OTE is superior to other methods in different CRISPR systems and species. Therefore, CRISPR-OTE is a simple on-target efficiency prediction framework with better accuracy and generalization performance.(c) 2022 AGBM. Published by Elsevier Masson SAS. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Transformer-based anti-noise models for CRISPR-Cas9 off-target activities prediction
    Guan, Zengrui
    Jiang, Zhenran
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (03)
  • [32] An active early warning method for abnormal electricity load consumption based on data multi-dimensional feature
    Cui, Jia
    Fu, Tianhe
    Yang, Junyou
    Wang, Shunjiang
    Li, Chaoran
    Han, Ni
    Zhang, Ximing
    ENERGY, 2025, 314
  • [33] Three-dimensional target inversion algorithm based on multi-feature reconstruction
    Xue, Yali
    Zhou, Lizun
    Wang, Linfei
    Ouyang, Quan
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (11): : 2199 - 2207
  • [34] Gas Sensor Array Fault Diagnosis Based on Multi-Dimensional Fusion, an Attention Mechanism, and Multi-Task Learning
    Huang, Pengyu
    Wang, Qingfeng
    Chen, Haotian
    Lu, Geyu
    SENSORS, 2023, 23 (18)
  • [35] Short-term metro passenger flow prediction based on hybrid spatiotemporal extraction and multi-feature fusion
    Wang, Tao
    Song, Jianjun
    Zhang, Jing
    Tian, Junfang
    Wu, Jianjun
    Zheng, Jianfeng
    TUNNELLING AND UNDERGROUND SPACE TECHNOLOGY, 2025, 159
  • [36] Prediction of off-target specificity and cell-specific fitness of CRISPR-Cas System using attention boosted deep learning and network-based gene feature
    Liu, Qiao
    He, Di
    Xie, Lei
    PLOS COMPUTATIONAL BIOLOGY, 2019, 15 (10)
  • [37] Remaining useful life prediction based on parallel multi-scale feature fusion network
    Yin, Yuyan
    Tian, Jie
    Liu, Xinfeng
    JOURNAL OF INTELLIGENT MANUFACTURING, 2024, 36 (5) : 3111 - 3127
  • [38] Recurrence prediction of gastric cancer based on multi-resolution feature fusion and context information
    Zhou, Hongyu
    Tao, Haibo
    Xue, Feiyue
    Wang, Bin
    Jin, Huaiping
    Li, Zhenhui
    Shengwu Yixue Gongchengxue Zazhi/Journal of Biomedical Engineering, 2024, 41 (05): : 886 - 894
  • [39] Multi-dimensional feature extraction-based deep encoder–decoder network for automatic surface defect detection
    Huseyin Uzen
    Muammer Turkoglu
    Davut Hanbay
    Neural Computing and Applications, 2023, 35 : 3263 - 3282
  • [40] An interpretable deep learning multi-dimensional integration framework for exchange rate forecasting based on deep and shallow feature selection and snapshot ensemble technology
    Wang, Jujie
    Dong, Ying
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133