mACPpred 2.0: Stacked Deep Learning for Anticancer Peptide Prediction with Integrated Spatial and Probabilistic Feature Representations

被引:8
作者
Sangaraju, Vinoth Kumar [1 ]
Pham, Nhat Truong [1 ]
Wei, Leyi [2 ]
Yu, Xue [3 ]
Manavalan, Balachandran [1 ]
机构
[1] Sungkyunkwan Univ, Coll Biotechnol & Bioengn, Dept Integrat Biotechnol, Suwon 16419, Gyeonggi Do, South Korea
[2] Macao Polytech Univ, Fac Appl Sci, Macau, Peoples R China
[3] Beidahuang Ind Grp, Gen Hosp, Harbin 150001, Peoples R China
基金
新加坡国家研究基金会;
关键词
anticancer peptides; stacking deep learning; pre-trained natural language processing-based embeddings;
D O I
10.1016/j.jmb.2024.168687
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Anticancer peptides (ACPs), naturally occurring molecules with remarkable potential to target and kill cancer cells. However, identifying ACPs based solely from their primary amino acid sequences remains a major hurdle in immunoinformatics. In the past, several web-based machine learning (ML) tools have been proposed to assist researchers in identifying potential ACPs for further testing. Notably, our meta- approach method, mACPpred, introduced in 2019, has significantly advanced the field of ACP research. Given the exponential growth in the number of characterized ACPs, there is now a pressing need to create an updated version of mACPpred. To develop mACPpred 2.0, we constructed an up-to-date benchmarking dataset by integrating all publicly available ACP datasets. We employed a large-scale of feature descriptors, encompassing both conventional feature descriptors and advanced pre-trained natural language processing (NLP)-based embeddings. We evaluated their ability to discriminate between ACPs and non-ACPs using eleven different classifiers. Subsequently, we employed a stacked deep learning (SDL) approach, incorporating 1D convolutional neural network (1D CNN) blocks and hybrid features. These features included the top seven performing NLP-based features and 90 probabilistic features, allowing us to identify hidden patterns within these diverse features and improve the accuracy of our ACP prediction model. This is the first study to integrate spatial and probabilistic feature representations for predicting ACPs. Rigorous cross-validation and independent tests conclusively demonstrated that mACPpred 2.0 not only surpassed its predecessor (mACPpred) but also outperformed the existing state-of-the-art predictors, highlighting the importance of advanced feature representation capabilities attained through SDL. To facilitate widespread use and accessibility, we have developed a userfriendly for mACPpred 2.0, available at https://balalab-skku.org/mACPpred2/. (c) 2024 Elsevier Ltd. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
引用
收藏
页数:9
相关论文
共 34 条
  • [1] Abbas Z., 2018, Neoplasma, V1, P139, DOI 10.5772/intechopen.76558
  • [2] AntiCP 2.0: an updated model for predicting anticancer peptides
    Agrawal, Piyush
    Bhagat, Dhruv
    Mahalwal, Manish
    Sharma, Neelam
    Raghava, Gajendra P. S.
    [J]. BRIEFINGS IN BIOINFORMATICS, 2021, 22 (03)
  • [3] ACP-MHCNN: an accurate multi-headed deep-convolutional neural network to predict anticancer peptides
    Ahmed, Sajid
    Muhammod, Rafsanjani
    Khan, Zahid Hossain
    Adilina, Sheikh
    Sharma, Alok
    Shatabda, Swakkhar
    Dehzangi, Abdollah
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [4] Cancer chemotherapy and beyond: Current status, drug candidates, associated risks and progress in targeted therapeutics
    Anand, Uttpal
    Dey, Abhijit
    Chandel, Arvind K. Singh
    Sanyal, Rupa
    Mishra, Amarnath
    Pandey, Devendra Kumar
    De Falco, Valentina
    Upadhyay, Arun
    Kandimalla, Ramesh
    Chaudhary, Anupama
    Dhanjal, Jaspreet Kaur
    Dewanjee, Saikat
    Vallamkondu, Jayalakshmi
    de la Lastra, Jose M. Perez
    [J]. GENES & DISEASES, 2023, 10 (04) : 1367 - 1401
  • [5] Expediting the Design, Discovery and Development of Anticancer Drugs using Computational Approaches
    Basith, Shaherin
    Cui, Minghua
    Macalino, Stephani J. Y.
    Choi, Sun
    [J]. CURRENT MEDICINAL CHEMISTRY, 2017, 24 (42) : 4753 - 4778
  • [6] mACPpred: A Support Vector Machine-Based Meta-Predictor for Identification of Anticancer Peptides
    Boopathi, Vinothini
    Subramaniyam, Sathiyamoorthy
    Malik, Adeel
    Lee, Gwang
    Manavalan, Balachandran
    Yang, Deok-Chun
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (08)
  • [7] [Anonymous], 2020, CA Cancer J Clin, V70, P313, DOI [10.3322/caac.21492, 10.3322/caac.21609]
  • [8] CancerGram: An Effective Classifier for Differentiating Anticancer from Antimicrobial Peptides
    Burdukiewicz, Michal
    Sidorczuk, Katarzyna
    Rafacz, Dominik
    Pietluch, Filip
    Bakala, Mateusz
    Slowik, Jadwiga
    Gagat, Przemyslaw
    [J]. PHARMACEUTICS, 2020, 12 (11) : 1 - 14
  • [9] Characterization and identification of antimicrobial peptides with different functional activities
    Chung, Chia-Ru
    Kuo, Ting-Rung
    Wu, Li-Ching
    Lee, Tzong-Yi
    Horng, Jorng-Tzong
    [J]. BRIEFINGS IN BIOINFORMATICS, 2020, 21 (03) : 1098 - 1114
  • [10] ACP-MLC: A two-level prediction engine for identification of anticancer peptides and multi-label classification of their functional types
    Deng, Hua
    Ding, Meng
    Wang, Yimeng
    Li, Weihua
    Liu, Guixia
    Tang, Yun
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 158