cACP: Classifying anticancer peptides using discriminative intelligent model via Chou's 5-step rules and general pseudo components

被引:64
作者
Akbar, Shahid [1 ]
Rahman, Ateeq Ur [1 ]
Hayat, Maqsood [1 ]
Sohail, Mohammad [2 ]
机构
[1] Abdul Wali Khan Univ Mandan, Dept Comp Sci, Kp 23200, Pakistan
[2] Univ Lahore, Dept Phys, Sargodha Campus, Sargodha 40100, Pakistan
关键词
Anticancer peptides; QSO; SVM; RF; PCA; AMINO-ACID-COMPOSITION; SEQUENCE-BASED PREDICTOR; PROTEIN SUBCELLULAR-LOCALIZATION; WEB SERVER; DIPEPTIDE COMPOSITION; IDENTIFICATION; PSEAAC; SITES; RNA; BIOINFORMATICS;
D O I
10.1016/j.chemolab.2019.103912
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
World widely, cancer is considered a fatal disease and remains the major cause of death. Conventional medication approaches using therapies and anticancer drugs are deemed ineffective due to its high cost and harmful impacts on the normal cells. However, the innovation of anticancer peptides (ACPs) provides an effective way how to deals with cancer affected cells. Due to the rapid increases in peptide sequences, truly characterization of ACPs has become a challenging task for investigators. In this paper, an effort has been carried out to develop a reliable and intelligent computational method for the accurate discrimination of anticancer peptides. Three statistical feature representation schemes namely: Quasisequence order (QSO), conjoint triad feature, and Geary autocorrelation descriptor are applied to express motif of the target class. In order to eradicate irrelevant and noisy features, while select salient, profound and high variated features, principal component analysis is employed. Furthermore, the diverse nature of learning algorithms is utilized in order to select the best operational engine for the proposed model. After examining the empirical outcomes, support vector machine obtained quite encouraging results in combination with QSO feature space. It has achieved an accuracy of 96.91% and 89.54% using the main dataset and alternative dataset, respectively. It is observed that our proposed model shows an outstanding improvement compared to literature methods. It is expected that the developed model may be played a useful role in research academia as well as proteomics and drug development.
引用
收藏
页数:8
相关论文
共 122 条
[1]  
Adeniyi D. A., 2016, Applied Computing and Informatics, V12, P90, DOI 10.1016/j.aci.2014.10.001
[2]   Prediction of Protein Submitochondrial Locations by Incorporating Dipeptide Composition into Chou's General Pseudo Amino Acid Composition [J].
Ahmad, Khurshid ;
Waris, Muhammad ;
Hayat, Maqsood .
JOURNAL OF MEMBRANE BIOLOGY, 2016, 249 (03) :293-304
[3]  
Akbar A., 2015, J APPL ENVIRON BIOL, V5, P28
[4]   iAFP-gap-SMOTE: An Efficient Feature Extraction Scheme Gapped Dipeptide Composition is Coupled with an Oversampling Technique for Identification of Antifreeze Proteins [J].
Akbar, Shahid ;
Hayat, Maqsood ;
Kabir, Muhammad ;
Iqbal, Muhammad .
LETTERS IN ORGANIC CHEMISTRY, 2019, 16 (04) :294-302
[5]   iMethyl-STTNC: Identification of N6-methyladenosine sites by extending the idea of SAAC into Chou's PseAAC to formulate RNA sequences [J].
Akbar, Shahid ;
Hayat, Maqsood .
JOURNAL OF THEORETICAL BIOLOGY, 2018, 455 :205-211
[6]   iACP-GAEnsC: Evolutionary genetic algorithm based ensemble classification of anticancer peptides by utilizing hybrid feature space [J].
Akbar, Shahid ;
Hayat, Maqsood ;
Iqbal, Muhammad ;
Jan, Mian Ahmad .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2017, 79 :62-70
[7]  
[Anonymous], 2017, arXiv
[8]  
[Anonymous], FEATURE SELECTION DI
[9]  
[Anonymous], GENOMICS
[10]  
[Anonymous], 2017, BIOMEDR R BIOCONDUCT