Anticancer Peptides Classification Using Kernel Sparse Representation Classifier

被引:7
作者
Fazal, Ehtisham [1 ]
Ibrahim, Muhammad Sohail [2 ]
Park, Seongyong [3 ,4 ]
Naseem, Imran [5 ,6 ,7 ]
Wahab, Abdul [8 ]
机构
[1] Lambda Theta, Karachi 74270, Pakistan
[2] Zhejiang Univ, Coll Elect Engn, Hangzhou 310027, Peoples R China
[3] NCI, Canc Data Sci Lab, Ctr Canc Res, Natl Inst Hlth NIH, Bethesda, MD 20894 USA
[4] Univ Ulsan, Coll Med, Asan Med Ctr, Dept Anesthesiol & Pain Med, Songpa gu, Seoul 05505, South Korea
[5] Univ Western Australia, Sch Elect Elect & Comp Engn, Crawley, WA 6009, Australia
[6] Karachi Inst Econ & Technol, Coll Engn, Korangi Creek, Karachi 75190, Pakistan
[7] Love Data, Res & Dev, Karachi 75600, Pakistan
[8] Nazarbayev Univ, Dept Math, Astana 010000, Kazakhstan
关键词
Peptides; Amino acids; Cancer; Dictionaries; Encoding; Principal component analysis; Amino acid composition (AAC); anticancer peptide (ACP); composition of the K-spaced amino acid pairs (CKSAAP); kernel sparse reconstruction classification (KSRC) matching pursuit (MP); over-complete dictionary (OCD); sample-specific classification; PREDICTION; SELECTION; IACP; TOOL;
D O I
10.1109/ACCESS.2023.3246927
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cancer is one of the most challenging diseases because of its complexity, variability, and diversity of causes. It has been one of the major research topics over the past decades, yet it is still poorly understood. To this end, multifaceted therapeutic frameworks are indispensable. Anticancer peptides (ACPs) are the most promising treatment option, but their large-scale identification and synthesis require reliable prediction methods, which is still a problem. In this paper, we present an intuitive classification strategy that differs from the traditional black-box method and is based on the well-known statistical theory of sparse-representation classification (SRC). Specifically, we create over-complete dictionary matrices by embedding the composition of the K-spaced amino acid pairs (CKSAAP). Unlike the traditional SRC frameworks, we use an efficient matching pursuit solver instead of the computationally expensive basis pursuit solver in this strategy. Furthermore, the kernel principal component analysis (KPCA) is employed to cope with non-linearity and dimension reduction of the feature space whereas the synthetic minority oversampling technique (SMOTE) is used to balance the dictionary. The proposed method is evaluated on two benchmark datasets for well-known statistical parameters and is found to outperform the existing methods. The results show the highest sensitivity with the most balanced accuracy, which might be beneficial in understanding structural and chemical aspects and developing new ACPs. The Google-Colab implementation of the proposed method is available on the GitHub page (https://github.com/ehtisham-Fazal/ACP-Kernel-SRC).
引用
收藏
页码:17626 / 17637
页数:12
相关论文
共 58 条
[1]   AntiCP 2.0: an updated model for predicting anticancer peptides [J].
Agrawal, Piyush ;
Bhagat, Dhruv ;
Mahalwal, Manish ;
Sharma, Neelam ;
Raghava, Gajendra P. S. .
BRIEFINGS IN BIOINFORMATICS, 2021, 22 (03)
[2]   Investigation of the antimicrobial activity and hematological pattern of nano-chitosan and its nano-copper composite [J].
Ahmed, Somia B. ;
Mohamed, Hadeer, I ;
Al-Subaie, Abeer M. ;
Al-Ohali, Ahoud, I ;
Mahmoud, Nesrine M. R. .
SCIENTIFIC REPORTS, 2021, 11 (01)
[3]   cACP-DeepGram: Classification of anticancer peptides via deep neural network and skip-gram-based word embedding model [J].
Akbar, Shahid ;
Hayat, Maqsood ;
Tahir, Muhammad ;
Khan, Salman ;
Alarfaj, Fawaz Khaled .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2022, 131
[4]   cACP-2LFS: Classification of Anticancer Peptides Using Sequential Discriminative Model of KSAAP and Two-Level Feature Selection Approach [J].
Akbar, Shahid ;
Hayat, Maqsood ;
Tahir, Muhammad ;
Chong, Kil To .
IEEE ACCESS, 2020, 8 :131939-131948
[5]   cACP: Classifying anticancer peptides using discriminative intelligent model via Chou's 5-step rules and general pseudo components [J].
Akbar, Shahid ;
Rahman, Ateeq Ur ;
Hayat, Maqsood ;
Sohail, Mohammad .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2020, 196
[6]   iACP-GAEnsC: Evolutionary genetic algorithm based ensemble classification of anticancer peptides by utilizing hybrid feature space [J].
Akbar, Shahid ;
Hayat, Maqsood ;
Iqbal, Muhammad ;
Jan, Mian Ahmad .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2017, 79 :62-70
[7]   ECM-LSE: Prediction of Extracellular Matrix Proteins Using Deep Latent Space Encoding of k-Spaced Amino Acid Pairs [J].
Al-Saggaf, Ubaid M. ;
Usman, Muhammad ;
Naseem, Imran ;
Moinuddin, Muhammad ;
Jiman, Ahmad A. ;
Alsaggaf, Mohammed U. ;
Alshoubaki, Hitham K. ;
Khan, Shujaat .
FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2021, 9
[8]   Multi-Kernel Fusion for RBF Neural Networks [J].
Atif, Syed Muhammad ;
Khan, Shujaat ;
Naseem, Imran ;
Togneri, Roberto ;
Bennamoun, Mohammed .
NEURAL PROCESSING LETTERS, 2023, 55 (02) :1045-1069
[9]   Comparison of Anticancer Drug Toxicities: Paradigm Shift in Adverse Effect Profile [J].
Basak, Debasish ;
Arrighi, Scott ;
Darwiche, Yasenya ;
Deb, Subrata .
LIFE-BASEL, 2022, 12 (01)
[10]   Expediting the Design, Discovery and Development of Anticancer Drugs using Computational Approaches [J].
Basith, Shaherin ;
Cui, Minghua ;
Macalino, Stephani J. Y. ;
Choi, Sun .
CURRENT MEDICINAL CHEMISTRY, 2017, 24 (42) :4753-4778