Current computational tools for protein lysine acylation site prediction

被引:1
作者
Qin, Zhaohui [1 ]
Ren, Haoran [1 ]
Zhao, Pei [2 ]
Wang, Kaiyuan [1 ]
Liu, Huixia [1 ]
Miao, Chunbo [1 ]
Du, Yanxiu [1 ]
Li, Junzhou [1 ]
Wu, Liuji [3 ]
Chen, Zhen [1 ]
机构
[1] Henan Agr Univ, Coll Agron, Collaborat Innovat Ctr Henan Grain Crops, Henan Key Lab Rice Mol Breeding & High Efficiency, Zhengzhou 450046, Peoples R China
[2] Chinese Acad Agr Sci CAAS, Inst Cotton Res, State Key Lab Cotton Biol, Anyang 455000, Peoples R China
[3] Henan Agr Univ, Coll Agron, Natl Key Lab Wheat & Maize Crop Sci, Zhengzhou 450046, Peoples R China
基金
中国国家自然科学基金;
关键词
post-translation modification; lysine acylation; deep learning; protein language model; transfer learning; POSTTRANSLATIONAL MODIFICATION SITES; FUNCTIONAL ASSOCIATIONS; WEB SERVER; METABOLIC-REGULATION; HISTONE ACETYLATION; CROTONYLATION SITES; UPDATED DATABASE; GENE-EXPRESSION; RESOURCE; SUCCINYLATION;
D O I
10.1093/bib/bbae469
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
As a main subtype of post-translational modification (PTM), protein lysine acylations (PLAs) play crucial roles in regulating diverse functions of proteins. With recent advancements in proteomics technology, the identification of PTM is becoming a data-rich field. A large amount of experimentally verified data is urgently required to be translated into valuable biological insights. With computational approaches, PLA can be accurately detected across the whole proteome, even for organisms with small-scale datasets. Herein, a comprehensive summary of 166 in silico PLA prediction methods is presented, including a single type of PLA site and multiple types of PLA sites. This recapitulation covers important aspects that are critical for the development of a robust predictor, including data collection and preparation, sample selection, feature representation, classification algorithm design, model evaluation, and method availability. Notably, we discuss the application of protein language models and transfer learning to solve the small-sample learning issue. We also highlight the prediction methods developed for functionally relevant PLA sites and species/substrate/cell-type-specific PLA sites. In conclusion, this systematic review could potentially facilitate the development of novel PLA predictors and offer useful insights to researchers from various disciplines.
引用
收藏
页数:17
相关论文
共 165 条
[1]  
Ahmad MW, 2020, IEEE ACCESS, V8, P77888, DOI [10.1109/ACCESS.2020.2989713, 10.1109/access.2020.2989713]
[2]   Computational identification of multiple lysine PTM sites by analyzing the instance hardness and feature importance [J].
Ahmed, Sabit ;
Rahman, Afrida ;
Hasan, Md Al Mehedi ;
Ahmad, Shamim ;
Shovan, S. M. .
SCIENTIFIC REPORTS, 2021, 11 (01)
[3]   predML-Site: Predicting Multiple Lysine PTM Sites With Optimal Feature Representation and Data Imbalance Minimization [J].
Ahmed, Sabit ;
Rahman, Afrida ;
Hasan, Md. Al Mehedi ;
Rahman, Julia ;
Ben Islam, Md Khaled ;
Ahmad, Shamim .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (06) :3624-3634
[4]   RF-MaloSite and DL-Malosite: Methods based on random forest and deep learning to identify malonylation sites [J].
AL-barakati, Hussam ;
Thapa, Niraj ;
Hiroto, Saigo ;
Roy, Kaushik ;
Newman, Robert H. ;
Kc, Dukka .
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2020, 18 :852-860
[5]   ACETYLATION + METHYLATION OF HISTONES + THEIR POSSIBLE ROLE IN REGULATION OF RNA SYNTHESIS [J].
ALLFREY, VG ;
FAULKNER, R ;
MIRSKY, AE .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1964, 51 (05) :786-+
[6]   Comparing Oversampling Techniques to Handle the Class Imbalance Problem: A Customer Churn Prediction Case Study [J].
Amin, Adnan ;
Anwar, Sajid ;
Adnan, Awais ;
Nawaz, Muhammad ;
Howard, Newton ;
Qadir, Junaid ;
Hawalah, Ahmad ;
Hussain, Amir .
IEEE ACCESS, 2016, 4 :7940-7957
[7]  
Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkh131, 10.1093/nar/gkw1099]
[8]   LAIPT: Lysine Acetylation Site Identification with Polynomial Tree [J].
Bao, Wenzheng ;
Yang, Bin ;
Li, Zhengwei ;
Zhou, Yong .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (01)
[9]   A chemical reporter facilitates the detection and identification of lysine HMGylation on histones [J].
Bao, Xiucong ;
Xiong, Ying ;
Li, Xin ;
Li, Xiang David .
CHEMICAL SCIENCE, 2018, 9 (40) :7797-7801
[10]   STALLION: a stacking-based ensemble learning framework for prokaryotic lysine acetylation site prediction [J].
Basith, Shaherin ;
Lee, Gwang ;
Manavalan, Balachandran .
BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)