Empirical Comparison and Analysis of Artificial Intelligence-Based Methods for Identifying Phosphorylation Sites of SARS-CoV-2 Infection

被引:0
作者
Lai, Hongyan [1 ]
Zhu, Tao [1 ]
Xie, Sijia [2 ]
Luo, Xinwei [2 ]
Hong, Feitong [2 ]
Luo, Diyu [1 ]
Dao, Fuying [3 ]
Lin, Hao [2 ]
Shu, Kunxian [1 ]
Lv, Hao [2 ]
机构
[1] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Big Data Bio Intelligence, Chongqing 400065, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Life Sci & Technol, Ctr Informat Biol, Chengdu 611731, Peoples R China
[3] Nanyang Technol Univ, Sch Biol Sci, Singapore 639798, Singapore
基金
中国博士后科学基金;
关键词
SARS-CoV-2; phosphorylation site; machine learning; deep learning; computation tool; BI-PROFILE BAYES; PROTEIN; PREDICTION; DESIGN;
D O I
10.3390/ijms252413674
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a member of the large coronavirus family with high infectivity and pathogenicity and is the primary pathogen causing the global pandemic of coronavirus disease 2019 (COVID-19). Phosphorylation is a major type of protein post-translational modification that plays an essential role in the process of SARS-CoV-2-host interactions. The precise identification of phosphorylation sites in host cells infected with SARS-CoV-2 will be of great importance to investigate potential antiviral responses and mechanisms and exploit novel targets for therapeutic development. Numerous computational tools have been developed on the basis of phosphoproteomic data generated by mass spectrometry-based experimental techniques, with which phosphorylation sites can be accurately ascertained across the whole SARS-CoV-2-infected proteomes. In this work, we have comprehensively reviewed several major aspects of the construction strategies and availability of these predictors, including benchmark dataset preparation, feature extraction and refinement methods, machine learning algorithms and deep learning architectures, model evaluation approaches and metrics, and publicly available web servers and packages. We have highlighted and compared the prediction performance of each tool on the independent serine/threonine (S/T) and tyrosine (Y) phosphorylation datasets and discussed the overall limitations of current existing predictors. In summary, this review would provide pertinent insights into the exploitation of new powerful phosphorylation site identification tools, facilitate the localization of more suitable target molecules for experimental verification, and contribute to the development of antiviral therapies.
引用
收藏
页数:25
相关论文
共 75 条
  • [11] Liu D., Li G., Zuo Y., Function determinants of TET proteins: The arrangements of sequence motifs with specific codes, Brief. Bioinform, 20, pp. 1826-1835, (2019)
  • [12] Guo S., Lei X., Chang Y., Zhao J., Wang J., Dong X., Liu Q., Zhang Z., Wang L., Yi D., Et al., SARS-CoV-2 hijacks cellular kinase CDK2 to promote viral RNA synthesis, Signal Transduct. Target. Ther, 7, (2022)
  • [13] Huang C., Feng F., Shi Y., Li W., Wang Z., Zhu Y., Yuan S., Hu D., Dai J., Jiang Q., Et al., Protein Kinase C Inhibitors Reduce SARS-CoV-2 Replication in Cultured Cells, Microbiol. Spectr, 10, (2022)
  • [14] Naik R.R., Shakya A.K., Aladwan S.M., El-Tanani M., Kinase Inhibitors as Potential Therapeutic Agents in the Treatment of COVID-19, Front. Pharmacol, 13, (2022)
  • [15] Wei L., He W., Malik A., Su R., Cui L., Manavalan B., Computational prediction and interpretation of cell-specific replication origin sites from multiple eukaryotes by exploiting stacking framework, Brief. Bioinform, 22, (2020)
  • [16] Ai C., Yang H., Liu X., Dong R., Ding Y., Guo F., MTMol-GPT: De novo multi-target molecular generation with transformer-based generative adversarial imitation learning, PLoS Comput. Biol, 20, (2024)
  • [17] Liu T., Qiao H., Wang Z., Yang X., Pan X., Yang Y., Ye X., Sakurai T., Lin H., Zhang Y., CodLncScape Provides a Self-Enriching Framework for the Systematic Collection and Exploration of Coding LncRNAs, Adv. Sci, 11, (2024)
  • [18] Schaduangrat N., Anuwongcharoen N., Charoenkwan P., Shoombuatong W., DeepAR: A novel deep learning-based hybrid framework for the interpretable prediction of androgen receptor antagonists, J. Cheminformatics, 15, (2023)
  • [19] Wang R., Jiang Y., Jin J., Yin C., Yu H., Wang F., Feng J., Su R., Nakai K., Zou Q., DeepBIO: An automated and interpretable deep-learning platform for high-throughput biological sequence prediction, functional annotation and visualization analysis, Nucleic Acids Res, 51, pp. 3017-3029, (2023)
  • [20] Manayalan B., Basith S., Shin T.H., Wei L., Lee G., mAHTPred: A sequence-based meta-predictor for improving the prediction of anti-hypertensive peptides using effective feature representation, Bioinformatics, 35, pp. 2757-2765, (2019)