Prediction of coronavirus 3C-like protease cleavage sites using machine-learning algorithms

被引:0
|
作者
Huiting Chen [1 ]
Zhaozhong Zhu [1 ]
Ye Qiu [1 ]
Xingyi Ge [1 ]
Heping Zheng [1 ]
Yousong Peng [1 ]
机构
[1] Bioinformatics Center, College of Biology, Hunan Provincial Key Laboratory of Medical Virology, Hunan University
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
R373 [人体病毒学(致病病毒)];
学科分类号
100103 ; 100705 ;
摘要
The coronavirus 3C-like(3CL) protease, a cysteine protease, plays an important role in viral infection and immune escape. However, there is still a lack of effective tools for determining the cleavage sites of the 3CL protease. This study systematically investigated the diversity of the cleavage sites of the coronavirus 3CL protease on the viral polyprotein, and found that the cleavage motif were highly conserved for viruses in the genera of Alphacoronavirus, Betacoronavirus and Gammacoronavirus. Strong residue preferences were observed at the neighboring positions of the cleavage sites. A random forest(RF) model was built to predict the cleavage sites of the coronavirus 3CL protease based on the representation of residues in cleavage motifs by amino acid indexes, and the model achieved an AUC of 0.96 in cross-validations. The RF model was further tested on an independent test dataset which were composed of cleavage sites on 99 proteins from multiple coronavirus hosts. It achieved an AUC of 0.95 and predicted correctly 80% of the cleavage sites. Then, 1,352 human proteins were predicted to be cleaved by the 3CL protease by the RF model. These proteins were enriched in several GO terms related to the cytoskeleton, such as the microtubule, actin and tubulin. Finally, a webserver named 3CLP was built to predict the cleavage sites of the coronavirus 3CL protease based on the RF model. Overall, the study provides an effective tool for identifying cleavage sites of the 3CL protease and provides insights into the molecular mechanism underlying the pathogenicity of coronaviruses.
引用
收藏
页码:437 / 444
页数:8
相关论文
共 50 条
  • [1] Prediction of coronavirus 3C-like protease cleavage sites using machine-learning algorithms
    Chen, Huiting
    Zhu, Zhaozhong
    Qiu, Ye
    Ge, Xingyi
    Zheng, Heping
    Peng, Yousong
    VIROLOGICA SINICA, 2022, 37 (03) : 437 - 444
  • [2] Prediction of coronavirus 3C-like protease cleavage sites using machine-learning algorithms
    Huiting Chen
    Zhaozhong Zhu
    Ye Qiu
    Xingyi Ge
    Heping Zheng
    Yousong Peng
    Virologica Sinica, 2022, 37 (03) : 437 - 444
  • [3] Prediction and biochemical analysis of putative cleavage sites of the 3C-like protease of Middle East respiratory syndrome coronavirus
    Wu, Andong
    Wang, Yi
    Zeng, Cong
    Huang, Xingyu
    Xu, Shan
    Su, Ceyang
    Wang, Min
    Chen, Yu
    Guo, Deyin
    VIRUS RESEARCH, 2015, 208 : 56 - 65
  • [4] Deciphering the cleavage sites of 3C-like protease in Gammacoronaviruses and Deltacoronaviruses
    Wang, Mengxue
    Sun, Xinyi
    Peng, Shijiang
    Wang, Feifan
    Zhao, Kangli
    Wang, Dang
    BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS, 2025, 1873 (01):
  • [5] A comparison of machine learning algorithms for the prediction of Hepatitis C NS3 protease cleavage sites
    Chown, Harry
    EUROBIOTECH JOURNAL, 2019, 3 (04): : 167 - 174
  • [6] TDFFM: Transformer and Deep Forest Fusion Model for Predicting Coronavirus 3C-Like Protease Cleavage Sites
    Wang, Qingsong
    Ge, Ruiquan
    Wang, Changmiao
    Elazab, Ahmed
    Fang, Qiming
    Zhang, Renfeng
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2024, 21 (05) : 1231 - 1241
  • [7] Cleavage activity of the sapovirus 3C-like protease in Escherichia coli
    T. Oka
    K. Katayama
    S. Ogawa
    G. S. Hansman
    T. Kageyama
    T. Miyamura
    N. Takeda
    Archives of Virology, 2005, 150 : 2539 - 2548
  • [8] Polyprotein processing in Southampton virus: Identification of 3C-like protease cleavage sites by in vitro mutagenesis
    Liu, BL
    Clarke, IN
    Lambden, PR
    JOURNAL OF VIROLOGY, 1996, 70 (04) : 2605 - 2610
  • [9] Cleavage activity of the sapovirus 3C-like protease in Escherichia coli
    Oka, T
    Katayama, K
    Ogawa, S
    Hansman, GS
    Kageyama, T
    Miyamura, T
    Takeda, N
    ARCHIVES OF VIROLOGY, 2005, 150 (12) : 2539 - 2548
  • [10] Isatin compounds as noncovalent SARS coronavirus 3C-like protease inhibitors
    Zhou, Lu
    Liu, Ying
    Zhang, Weilin
    Wei, Ping
    Huang, Changkang
    Pei, Jianfeng
    Yuan, Yaxia
    Lai, Luhua
    JOURNAL OF MEDICINAL CHEMISTRY, 2006, 49 (12) : 3440 - 3443