MLACP 2.0: An updated machine learning tool for anticancer peptide prediction

被引:0
|
作者
Phan, Le Thi [1 ]
Park, Hyun Woo [2 ]
Pitti, Thejkiran [1 ]
Madhavan, Thirumurthy [3 ]
Jeon, Young -Jun [2 ]
Manavalan, Balachandran [1 ]
机构
[1] Sungkyunkwan Univ, Coll Biotechnol & Bioengn, Dept Integrat Biotechnol, Computat Biol & Bioinformat Lab, Suwon 16419, Gyeonggi Do, South Korea
[2] Sungkyunkwan Univ, Coll Biotechnol & Bioengn, Dept Integrat Biotechnol, Suwon 16419, Gyeonggi Do, South Korea
[3] SRM Inst Sci & Technol, Dept Genet Engn, Computat Biol Lab, Kattankulathur 603203, Tamil Nadu, India
来源
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL | 2022年 / 20卷
基金
新加坡国家研究基金会;
关键词
Anticancer peptides; Convolutional neural network; Feature encodings; Conventional classifiers; Baseline models; Dataset construction;
D O I
暂无
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Anticancer peptides are emerging anticancer drug that offers fewer side effects and is more effective than chemotherapy and targeted therapy. Predicting anticancer peptides from sequence information is one of the most challenging tasks in immunoinformatics. In the past ten years, machine learning-based approaches have been proposed for identifying ACP activity from peptide sequences. These methods include our previous method MLACP (developed in 2017) which made a significant impact on anticancer research. MLACP tool has been widely used by the research community, however, its robustness must be improved significantly for its continued practical application. In this study, the first large non-redundant training and independent datasets were constructed for ACP research. Using the training dataset, the study explored a wide range of feature encodings and developed their respective models using seven dif-ferent conventional classifiers. Subsequently, a subset of encoding-based models was selected for each classifier based on their performance, whose predicted scores were concatenated and trained through a convolutional neural network (CNN), whose corresponding predictor is named MLACP 2.0. The evalu-ation of MLACP 2.0 with a very diverse independent dataset showed excellent performance and signifi-cantly outperformed the recent ACP prediction tools. Additionally, MLACP 2.0 exhibits superior performance during cross-validation and independent assessment when compared to CNN-based embed-ding models and conventional single models. Consequently, we anticipate that our proposed MLACP 2.0 will facilitate the design of hypothesis-driven experiments by making it easier to discover novel ACPs. The MLACP 2.0 is freely available at https://balalab-skku.org/mlacp2.(c) 2022 The Author(s). Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. This is an open access article under the CC BY-NC-ND license (http://creative-commons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:4473 / 4480
页数:8
相关论文
共 50 条
  • [1] MLACP 2.0: An updated machine learning tool for anticancer peptide prediction
    Phan, Le Thi
    Park, Hyun Woo
    Pitti, Thejkiran
    Madhavan, Thirumurthy
    Jeon, Young-Jun
    Manavalan, Balachandran
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 4473 - 4480
  • [2] MLACP: machine-learning-based prediction of anticancer peptides
    Manavalan, Balachandran
    Basith, Shaherin
    Shin, Tae Hwan
    Choi, Sun
    Kim, Myeong Ok
    Lee, Gwang
    ONCOTARGET, 2017, 8 (44) : 77121 - 77136
  • [3] mACPpred 2.0: Stacked Deep Learning for Anticancer Peptide Prediction with Integrated Spatial and Probabilistic Feature Representations
    Sangaraju, Vinoth Kumar
    Pham, Nhat Truong
    Wei, Leyi
    Yu, Xue
    Manavalan, Balachandran
    JOURNAL OF MOLECULAR BIOLOGY, 2024, 436 (17)
  • [4] Machine learning study for the prediction of transdermal peptide
    Eunkyoung Jung
    Seung-Hoon Choi
    Nam Kyung Lee
    Sang-Kee Kang
    Yun-Jaie Choi
    Jae-Min Shin
    Kihang Choi
    Dong Hyun Jung
    Journal of Computer-Aided Molecular Design, 2011, 25 : 339 - 347
  • [5] Machine learning study for the prediction of transdermal peptide
    Jung, Eunkyoung
    Choi, Seung-Hoon
    Lee, Nam Kyung
    Kang, Sang-Kee
    Choi, Yun-Jaie
    Shin, Jae-Min
    Choi, Kihang
    Jung, Dong Hyun
    JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2011, 25 (04) : 339 - 347
  • [6] AntiCP 2.0: an updated model for predicting anticancer peptides
    Agrawal, Piyush
    Bhagat, Dhruv
    Mahalwal, Manish
    Sharma, Neelam
    Raghava, Gajendra P. S.
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (03)
  • [7] Machine Learning as a Tool for Crop Yield Prediction
    P. K. Kutsenogiy
    V. K. Kalichkin
    A. L. Pakul
    S. P. Kutsenogiy
    Russian Agricultural Sciences, 2021, 47 (2) : 188 - 192
  • [8] MLASM: Machine learning based prediction of anticancer small molecules
    Balaji, Priya Dharshini
    Selvam, Subathra
    Sohn, Honglae
    Madhavan, Thirumurthy
    MOLECULAR DIVERSITY, 2024, 28 (04) : 2153 - 2161
  • [9] Evolution of Machine Learning Algorithms in the Prediction and Design of Anticancer Peptides
    Basith, Shaherin
    Manavalan, Balachandran
    Shin, Tae Hwan
    Lee, Da Yeon
    Lee, Gwang
    CURRENT PROTEIN & PEPTIDE SCIENCE, 2020, 21 (12) : 1242 - 1250
  • [10] ACPScanner: Prediction of Anticancer Peptides by Integrated Machine Learning Methodologies
    Zhong, Guolun
    Deng, Lei
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2024, 64 (03) : 1092 - 1104