XGBFEMF: An XGBoost-Based Framework for Essential Protein Prediction

被引:101
|
作者
Zhong, Jiancheng [1 ]
Sun, Yusui [1 ]
Peng, Wei [2 ]
Xie, Minzhu [1 ]
Yang, Jiahong [1 ]
Tang, Xiwei [3 ]
机构
[1] Hunan Normal Univ, Sch Informat Sci & Engn, Changsha 410081, Hunan, Peoples R China
[2] Kunming Univ Sci & Technol, Comp Ctr, Kunming 650050, Yunnan, Peoples R China
[3] Hunan First Normal Univ, Dept Informat Sci & Engn, Changsha 410205, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Essential protein; feature engineering; multi-model fusion; XGBoost; SUB-EXPAND-SHRINK; XGBFEMF; ESSENTIAL GENES; SUBCELLULAR-LOCALIZATION; CENTRALITY; NETWORKS; DATABASE; GENOME; IDENTIFICATION; BETWEENNESS;
D O I
10.1109/TNB.2018.2842219
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Essential proteins as a vital part of maintaining the cells' life play an important role in the study of biology and drug design. With the generation of large amounts of biological data related to essential proteins, an increasing number of computational methods have been proposed. Different from the methods which adopt a single machine learning method or an ensemble machine learning method, this paper proposes a predicting framework named by XGBFEMF for identifying essential proteins, which includes a SUB-EXPAND-SHRINK method for constructing the composite features with original features and obtaining the better subset of features for essential protein prediction, and also includes a model fusion method for getting a more effective prediction model. We carry out experiments on Yeast data to assess the performance of the XGBFEMF with ROC analysis, accuracy analysis, and top analysis. Meanwhile, we set up experiments on E. coli data for the validation of performance. The test results show that the XGBFEMF framework can effectively improve many essential indicators. In addition, we analyze each step in the XGBFEMF framework; our results show that both each step of the SUB-EXPAND-SHRINK method as well as the step of multi-model fusion can improve prediction performance.
引用
收藏
页码:243 / 250
页数:8
相关论文
共 50 条
  • [1] Essential Protein Prediction Based on node2vec and XGBoost
    Wang, Nian
    Zeng, Min
    Li, Yiming
    Wu, Fang-Xiang
    Li, Min
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2021, 28 (07) : 687 - 700
  • [2] XGBoost-based QoE Prediction for Mobile Networks
    Dayi, A. Burak
    Tuna, Evren
    32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,
  • [3] XGBoost-based prediction of electrical properties for anode aluminium foil
    Zhang, Yue
    Pan, Sining
    MATERIALS TODAY COMMUNICATIONS, 2024, 41
  • [4] A Deep Learning and XGBoost-Based Method for Predicting Protein-Protein Interaction Sites
    Wang, Pan
    Zhang, Guiyang
    Yu, Zu-Guo
    Huang, Guohua
    FRONTIERS IN GENETICS, 2021, 12
  • [5] XGBoost-based prediction modelling and analysis for health literacy assessment
    Hong, Yan
    Zhang, Xiaoda
    Chen, Jinxiang
    INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2021, 39 (03) : 229 - 235
  • [6] A Feature Selection Method for Prediction Essential Protein
    Zhong, Jiancheng
    Wang, Jianxin
    Peng, Wei
    Zhang, Zhen
    Li, Min
    TSINGHUA SCIENCE AND TECHNOLOGY, 2015, 20 (05) : 491 - 499
  • [7] XGBoost-based thermal error prediction and compensation of ball screws
    Gao, Xiangsheng
    Zhang, Kuan
    Zhang, Zitao
    Wang, Min
    Zan, Tao
    Gao, Peng
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART B-JOURNAL OF ENGINEERING MANUFACTURE, 2024, 238 (1-2) : 151 - 163
  • [8] An XGBoost-Based Knowledge Tracing Model
    Su, Wei
    Jiang, Fan
    Shi, Chunyan
    Wu, Dongqing
    Liu, Lei
    Li, Shihua
    Yuan, Yongna
    Shi, Juntai
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2023, 16 (01)
  • [9] XGBoost-Based Android Malware Detection
    Wang, Jiong
    Li, Boquan
    Zeng, Yuwei
    2017 13TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2017, : 268 - 272
  • [10] Essential Protein Prediction Based on Shuffled Frog-Leaping Algorithm
    YANG, Xiaoqin
    Lei, Xiujuan
    ZHAO, Jie
    CHINESE JOURNAL OF ELECTRONICS, 2021, 30 (04) : 704 - 711