Alg-MFDL: A multi-feature deep learning framework for allergenic proteins prediction

被引:1
|
作者
Hu, Xiang [1 ]
Li, Jingyi [2 ]
Liu, Taigang [1 ]
机构
[1] Shanghai Ocean Univ, Coll Informat Technol, Shanghai 201306, Peoples R China
[2] Shanghai Ocean Univ, AIEN Inst, Shanghai 201306, Peoples R China
基金
中国国家自然科学基金;
关键词
Allergenic proteins; Protein language models; Handcrafted features; Deep learning; Feature fusion; LANGUAGE;
D O I
10.1016/j.ab.2024.115701
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The escalating global incidence of allergy patients illustrates the growing impact of allergic issues on global health. Allergens are small molecule antigens that trigger allergic reactions. A widely recognized strategy for allergy prevention involves identifying allergens and avoiding re-exposure. However, the laboratory methods to identify allergenic proteins are often time-consuming and resource-intensive. There is a crucial need to establish efficient and reliable computational approaches for the identification of allergenic proteins. In this study, we developed a novel allergenic proteins predictor named Alg-MFDL, which integrates pre-trained protein language models (PLMs) and traditional handcrafted features to achieve a more complete protein representation. First, we compared the performance of eight pre-trained PLMs from ProtTrans and ESM-2 and selected the bestperforming one from each of the two groups. In addition, we evaluated the performance of three handcrafted features and different combinations of them to select the optimal feature or feature combination. Then, these three protein representations were fused and used as inputs to train the convolutional neural network (CNN). Finally, the independent validation was performed on benchmark datasets to evaluate the performance of AlgMFDL. As a result, Alg-MFDL achieved an accuracy of 0.973, a precision of 0.996, a sensitivity of 0.951, and an F1 value of 0.973, outperforming the most of current state-of-the-art (SOTA) methods across all key metrics. We anticipated that the proposed model could be considered a useful tool for predicting allergen proteins.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] DeepPD: A Deep Learning Method for Predicting Peptide Detectability Based on Multi-feature Representation and Information Bottleneck
    Li, Fenglin
    Bin, Yannan
    Zhao, Jianping
    Zheng, Chunhou
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2025, 17 (01) : 200 - 214
  • [42] A multi-feature deep learning system to enhance glaucoma severity diagnosis with high accuracy and fast speed
    Xuea, Ying
    Zhu, Jiazhu
    Huang, Xiaoling
    Xuc, Xiaobin
    Li, Xiaojing
    Zhengc, Yameng
    Zhu, Zhijing
    Jin, Kai
    Ye, Juan
    Gong, Wei
    Si, Ke
    JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 136
  • [43] An effective deep learning method with multi-feature and attention mechanism for recognition of Chinese rice variety information
    Yu, Helong
    Li, Ziqing
    Bi, Chunguang
    Chen, Huiling
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (11) : 15725 - 15745
  • [44] An effective deep learning method with multi-feature and attention mechanism for recognition of Chinese rice variety information
    Helong Yu
    Ziqing Li
    Chunguang Bi
    Huiling Chen
    Multimedia Tools and Applications, 2022, 81 : 15725 - 15745
  • [45] A novel combined multi-variate prediction framework for air pollution based on feature selection and deep learning models
    Bai, Lu
    Du, Pei
    Wang, Shubin
    Li, Hongmin
    Wang, Jianzhou
    PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2025, 193 : 1157 - 1172
  • [46] A multi-feature selection fused with investor sentiment for stock price prediction
    Zhen, Kehan
    Xie, Dan
    Hu, Xiaochun
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 278
  • [47] Joint Multi-Feature Fusion and Attribute Relationships for Facial Attribute Prediction
    Wang, Pingyu
    Su, Fei
    Zhao, Zhicheng
    2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,
  • [48] Multi-Feature Traffic Prediction Based on Signaling Information for Cellular Network
    Liu, Jiaxin
    Tang, Xiongyan
    Zhu, Guangxu
    Cheng, Xinzhou
    Zhao, Liqiang
    Li, Hang
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (02) : 2280 - 2291
  • [49] An evolutional deep learning method based on multi-feature fusion for fault diagnosis in sucker rod pumping system
    Li, Juanni
    Shao, Jun
    Wang, Wei
    Xie, Wenhao
    ALEXANDRIA ENGINEERING JOURNAL, 2023, 66 : 343 - 355
  • [50] Automatic cucumber recognition algorithm for harvesting robots in the natural environment using deep learning and multi-feature fusion
    Mao, Shihan
    Li, Yuhua
    Ma, You
    Zhang, Baohua
    Zhou, Jun
    Wang, Kai
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 170