DBPboost:A method of classification of DNA-binding proteins based on improved differential evolution algorithm and feature extraction

被引:5
作者
Sun, Ailun [1 ]
Li, Hongfei [2 ]
Dong, Guanghui [1 ]
Zhao, Yuming [1 ]
Zhang, Dandan [3 ]
机构
[1] Northeast Forestry Univ, Coll Comp & Control Engn, Harbin 150040, Peoples R China
[2] Northeast Forestry Univ, Coll Life Sci, Harbin 150040, Peoples R China
[3] Harbin Med Univ, Affiliated Hosp 1, Dept Obstet & Gynecol, Harbin, Heilongjiang, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Protein structure prediction; Differential evolution; Dipeptide position-specific scoring matrix; Extreme gradient boosting decision tree; Random forest; Support vector machine; WEB SERVER; PREDICTION; IDENTIFICATION; PERFORMANCE; PSEAAC; DPP;
D O I
10.1016/j.ymeth.2024.01.005
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
DNA-binding proteins are a class of proteins that can interact with DNA molecules through physical and chemical interactions. Their main functions include regulating gene expression, maintaining chromosome structure and stability, and more. DNA-binding proteins play a crucial role in cellular and molecular biology, as they are essential for maintaining normal cellular physiological functions and adapting to environmental changes. The prediction of DNA-binding proteins has been a hot topic in the field of bioinformatics. The key to accurately classifying DNA-binding proteins is to find suitable feature sources and explore the information they contain. Although there are already many models for predicting DNA-binding proteins, there is still room for improvement in mining feature source information and calculation methods. In this study, we created a model called DBPboost to better identify DNA-binding proteins. The innovation of this study lies in the use of eight feature extraction methods, the improvement of the feature selection step, which involves selecting some features first and then performing feature selection again after feature fusion, and the optimization of the differential evolution algorithm in feature fusion, which improves the performance of feature fusion. The experimental results show that the prediction accuracy of the model on the UniSwiss dataset is 89.32%, and the sensitivity is 89.01%, which is better than most existing models.
引用
收藏
页码:56 / 64
页数:9
相关论文
共 53 条
[11]   Predicting DNA-binding proteins: approached from Chou's pseudo amino acid composition and other specific sequence features [J].
Fang, Y. ;
Guo, Y. ;
Feng, Y. ;
Li, M. .
AMINO ACIDS, 2008, 34 (01) :103-109
[12]   DBD-Hunter: a knowledge-based method for the prediction of DNA-protein interactions [J].
Gao, Mu ;
Skolnick, Jeffrey .
NUCLEIC ACIDS RESEARCH, 2008, 36 (12) :3978-3992
[13]   Prediction of Enzyme Function Based on Three Parallel Deep CNN and Amino Acid Mutation [J].
Gao, Ruibo ;
Wang, Mengmeng ;
Zhou, Jiaoyan ;
Fu, Yuhang ;
Liang, Meng ;
Guo, Dongliang ;
Nie, Junlan .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (11)
[14]   Accurate classification of membrane protein types based on sequence and evolutionary information using deep learning [J].
Guo, Lei ;
Wang, Shunfang ;
Li, Mingyuan ;
Cao, Zicheng .
BMC BIOINFORMATICS, 2019, 20 (01)
[15]   iPVP-MCV: A Multi-Classifier Voting Model for the Accurate Identification of Phage Virion Proteins [J].
Han, Haitao ;
Zhu, Wenhong ;
Ding, Chenchen ;
Liu, Taigang .
SYMMETRY-BASEL, 2021, 13 (08)
[16]   Improving DNA-Binding Protein Prediction Using Three-Part Sequence-Order Feature Extraction and a Deep Neural Network Algorithm [J].
Hu, Jun ;
Zeng, Wen-Wu ;
Jia, Ning-Xin ;
Arif, Muhammad ;
Yu, Dong-Jun ;
Zhang, Gui-Jun .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 63 (03) :1044-1057
[17]   TargetDBP plus : Enhancing the Performance of Identifying DNA-Binding Proteins via Weighted Convolutional Features [J].
Hu, Jun ;
Rao, Liang ;
Zhu, Yi-Heng ;
Zhang, Gui-Jun ;
Yu, Dong-Jun .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (01) :505-515
[18]   HyperVR: a hybrid deep ensemble learning approach for simultaneously predicting virulence factors and antibiotic resistance genes [J].
Ji, Boya ;
Pi, Wending ;
Liu, Wenjuan ;
Liu, Yannan ;
Cui, Yujun ;
Zhang, Xianglilan ;
Peng, Shaoliang .
NAR GENOMICS AND BIOINFORMATICS, 2023, 5 (01)
[19]   A CELLULAR DNA-BINDING PROTEIN THAT ACTIVATES EUKARYOTIC TRANSCRIPTION AND DNA-REPLICATION [J].
JONES, KA ;
KADONAGA, JT ;
ROSENFELD, PJ ;
KELLY, TJ ;
TJIAN, R .
CELL, 1987, 48 (01) :79-89
[20]   The data complexity index to construct an efficient cross-validation method [J].
Li, Der-Chiang ;
Fang, Yao-Hwei ;
Fang, Y. M. Frank .
DECISION SUPPORT SYSTEMS, 2010, 50 (01) :93-102