VesiMCNN: Using pre-trained protein language models and multiple window scanning convolutional neural networks to identify vesicular transport proteins

被引:0
|
作者
Le, Van The [1 ]
Tseng, Yi-Hsuan [1 ]
Liu, Yu-Chen [1 ]
Malik, Muhammad Shahid [1 ,3 ]
Ou, Yu-Yen [1 ,2 ]
机构
[1] Yuan Ze Univ, Dept Comp Sci & Engn, Chungli 32003, Taiwan
[2] Yuan Ze Univ, Grad Program Biomed Informat, Chungli 32003, Taiwan
[3] Karakoram Int Univ, Dept Comp Sci, Gilgit Baltistan 15100, Pakistan
关键词
Vesicular transport protein; Pre-trained protein language model; Multiple window scanning; IDENTIFICATION; PREDICTION; PROTTRANS;
D O I
10.1016/j.ijbiomac.2024.136048
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Vesicular transport is a critical cellular process responsible for the proper organization and functioning of eukaryotic cells. This mechanism relies on specialized vesicles that shuttle macromolecules, such as proteins, across the cellular landscape, a process pivotal to maintaining cellular homeostasis. Disruptions in vesicular transport have been linked to various disease mechanisms, including cancer and neurodegenerative disorders. In this study, we present vesiMCNN, a novel computational approach that integrates pre-trained protein language models with a multi-window scanning convolutional neural network architecture to accurately identify vesicular transport proteins. To the best of our knowledge, this is the first study to leverage the power of pretrained language models in combination with the multi-window scanning technique for this task. Our method achieved a Matthews Correlation Coefficient (MCC) of 0.558 and an Area Under the Receiver Operating Characteristic (AUC-ROC) of 0.933, outperforming existing state-of-the-art approaches. Additionally, we have curated a comprehensive benchmark dataset for the study of vesicular transport proteins, which can facilitate further research in this field. The remarkable performance of our model, combined with the comprehensive dataset and novel deep learning model, marks a significant advancement in the field of vesicular transport protein research.
引用
收藏
页数:10
相关论文
共 8 条
  • [1] Integrating Pre-Trained protein language model and multiple window scanning deep learning networks for accurate identification of secondary active transporters in membrane proteins
    Malik, Muhammad Shahid
    Ou, Yu-Yen
    METHODS, 2023, 220 : 11 - 20
  • [2] Integration of pre-trained protein language models into geometric deep learning networks
    Wu, Fang
    Wu, Lirong
    Radev, Dragomir
    Xu, Jinbo
    Li, Stan Z.
    COMMUNICATIONS BIOLOGY, 2023, 6 (01)
  • [3] UniproLcad: Accurate Identification of Antimicrobial Peptide by Fusing Multiple Pre-Trained Protein Language Models
    Wang, Xiao
    Wu, Zhou
    Wang, Rong
    Gao, Xu
    SYMMETRY-BASEL, 2024, 16 (04):
  • [4] Object detection and classification of butterflies using efficient CNN and pre-trained deep convolutional neural networks
    Mattins, R. Faerie
    Sarobin, M. Vergin Raja
    Aziz, Azrina Abd
    Srivarshan, S.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (16) : 48457 - 48482
  • [5] Concatenation of Pre-Trained Convolutional Neural Networks for Enhanced COVID-19 Screening Using Transfer Learning Technique
    El Gannour, Oussama
    Hamida, Soufiane
    Cherradi, Bouchaib
    Al-Sarem, Mohammed
    Raihani, Abdelhadi
    Saeed, Faisal
    Hadwan, Mohammed
    ELECTRONICS, 2022, 11 (01)
  • [6] Recent advances in features generation for membrane protein sequences: From multiple sequence alignment to pre-trained language models
    Ou, Yu-Yen
    Ho, Quang-Thai
    Chang, Heng-Ta
    PROTEOMICS, 2023, 23 (23-24)
  • [7] Detection of new coronavirus disease from chest x-ray images using pre-trained convolutional neural networks
    Narin, Ali
    Isler, Yalcin
    JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY, 2021, 36 (04): : 2095 - 2107
  • [8] Multi-stage glaucoma classification using pre-trained convolutional neural networks and voting-based classifier fusion
    Velpula, Vijaya Kumar
    Sharma, Lakhan Dev
    FRONTIERS IN PHYSIOLOGY, 2023, 14