A systematic review on the state-of-the-art strategies for protein representation

被引:5
作者
Yue, Zi-Xuan [1 ]
Yan, Tian-Ci [1 ]
Xu, Hong-Quan [1 ]
Liu, Yu-Hong [1 ]
Hong, Yan-Feng [1 ]
Chen, Gong-Xing [1 ]
Xie, Tian [1 ]
Tao, Lin [1 ]
机构
[1] Hangzhou Normal Univ, Sch Pharm, Key Lab Elemene Class Anticanc Chinese Med, Hangzhou 311121, Peoples R China
基金
中国国家自然科学基金;
关键词
Protein representation methods; Sequence-based descriptors; Structure-based descriptors; Machine learning; Drug research; PREDICTION; IDENTIFICATION; NORMALIZATION; SCALE; RNA;
D O I
10.1016/j.compbiomed.2022.106440
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The study of drug-target protein interaction is a key step in drug research. In recent years, machine learning techniques have become attractive for research, including drug research, due to their automated nature, pre-dictive power, and expected efficiency. Protein representation is a key step in the study of drug-target protein interaction by machine learning, which plays a fundamental role in the ultimate accomplishment of accurate research. With the progress of machine learning, protein representation methods have gradually attracted attention and have consequently developed rapidly. Therefore, in this review, we systematically classify current protein representation methods, comprehensively review them, and discuss the latest advances of interest. Ac-cording to the information extraction methods and information sources, these representation methods are generally divided into structure and sequence-based representation methods. Each primary class can be further divided into specific subcategories. As for the particular representation methods involve both traditional and the latest approaches. This review contains a comprehensive assessment of the various methods which researchers can use as a reference for their specific protein-related research requirements, including drug research.
引用
收藏
页数:10
相关论文
共 107 条
[1]   DP-BINDER: machine learning model for prediction of DNA-binding proteins by fusing evolutionary and physicochemical information [J].
Ali, Farman ;
Ahmed, Saeed ;
Swati, Zar Nawab Khan ;
Akbar, Shahid .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2019, 33 (07) :645-658
[2]   Machine learning in protein structure prediction [J].
AlQuraishi, Mohammed .
CURRENT OPINION IN CHEMICAL BIOLOGY, 2021, 65 :1-8
[3]   CovInter: interaction data between coronavirus RNAs and host proteins [J].
Amahong, Kuerbannisha ;
Zhang, Wei ;
Zhou, Ying ;
Zhang, Song ;
Yin, Jiayi ;
Li, Fengcheng ;
Xu, Hongquan ;
Yan, Tianci ;
Yue, Zixuan ;
Liu, Yuhong ;
Hou, Tingjun ;
Qiu, Yunqing ;
Tao, Lin ;
Han, Lianyi ;
Zhu, Feng .
NUCLEIC ACIDS RESEARCH, 2023, 51 (D1) :D546-D556
[4]   Fractal Analysis of DNA Sequences Using Frequency Chaos Game Representation and Small-Angle Scattering [J].
Anitas, Eugen Mircea .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2022, 23 (03)
[5]   Disease gene classification with metagraph representations [J].
Ata, Sezin Kircali ;
Fang, Yuan ;
Wu, Min ;
Li, Xiao-Li ;
Xiao, Xiaokui .
METHODS, 2017, 131 :83-92
[6]   A novel measure to analyze protein structures: Aspect ratio in protein alpha shapes [J].
Bagci, Elife Z. ;
Senguler-Ciftci, Fatma ;
Ciftci, Unver ;
Demir, Ayhan .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2021, 89 (10) :1270-1276
[7]   Machine learning techniques for protein function prediction [J].
Bonetta, Rosalin ;
Valentino, Gianluca .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2020, 88 (03) :397-413
[8]   KrakenUniq: confident and fast metagenomics classification using unique k-mer counts [J].
Breitwieser, F. P. ;
Baker, D. N. ;
Salzberg, S. L. .
GENOME BIOLOGY, 2018, 19
[9]   SCMTHP: A New Approach for Identifying and Characterizing of Tumor-Homing Peptides Using Estimated Propensity Scores of Amino Acids [J].
Charoenkwan, Phasit ;
Chiangjong, Wararat ;
Nantasenamat, Chanin ;
Moni, Mohammad Ali ;
Lio', Pietro ;
Manavalan, Balachandran ;
Shoombuatong, Watshara .
PHARMACEUTICS, 2022, 14 (01)
[10]   PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method [J].
Charoenkwan, Phasit ;
Kanthawong, Sakawrat ;
Schaduangrat, Nalini ;
Yana, Janchai ;
Shoombuatong, Watshara .
CELLS, 2020, 9 (02)