cACP: Classifying anticancer peptides using discriminative intelligent model via Chou's 5-step rules and general pseudo components

被引:64
作者
Akbar, Shahid [1 ]
Rahman, Ateeq Ur [1 ]
Hayat, Maqsood [1 ]
Sohail, Mohammad [2 ]
机构
[1] Abdul Wali Khan Univ Mandan, Dept Comp Sci, Kp 23200, Pakistan
[2] Univ Lahore, Dept Phys, Sargodha Campus, Sargodha 40100, Pakistan
关键词
Anticancer peptides; QSO; SVM; RF; PCA; AMINO-ACID-COMPOSITION; SEQUENCE-BASED PREDICTOR; PROTEIN SUBCELLULAR-LOCALIZATION; WEB SERVER; DIPEPTIDE COMPOSITION; IDENTIFICATION; PSEAAC; SITES; RNA; BIOINFORMATICS;
D O I
10.1016/j.chemolab.2019.103912
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
World widely, cancer is considered a fatal disease and remains the major cause of death. Conventional medication approaches using therapies and anticancer drugs are deemed ineffective due to its high cost and harmful impacts on the normal cells. However, the innovation of anticancer peptides (ACPs) provides an effective way how to deals with cancer affected cells. Due to the rapid increases in peptide sequences, truly characterization of ACPs has become a challenging task for investigators. In this paper, an effort has been carried out to develop a reliable and intelligent computational method for the accurate discrimination of anticancer peptides. Three statistical feature representation schemes namely: Quasisequence order (QSO), conjoint triad feature, and Geary autocorrelation descriptor are applied to express motif of the target class. In order to eradicate irrelevant and noisy features, while select salient, profound and high variated features, principal component analysis is employed. Furthermore, the diverse nature of learning algorithms is utilized in order to select the best operational engine for the proposed model. After examining the empirical outcomes, support vector machine obtained quite encouraging results in combination with QSO feature space. It has achieved an accuracy of 96.91% and 89.54% using the main dataset and alternative dataset, respectively. It is observed that our proposed model shows an outstanding improvement compared to literature methods. It is expected that the developed model may be played a useful role in research academia as well as proteomics and drug development.
引用
收藏
页数:8
相关论文
共 122 条
[51]   Antimicrobial peptides with selective antitumor mechanisms: prospect for anticancer applications [J].
Deslouches, Berthony ;
Di, Y. Peter .
ONCOTARGET, 2017, 8 (28) :46635-46651
[52]   PseAAC-General: Fast Building Various Modes of General Form of Chou's Pseudo-Amino Acid Composition for Large-Scale Protein Datasets [J].
Du, Pufeng ;
Gu, Shuwang ;
Jiao, Yasen .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2014, 15 (03) :3495-3506
[53]   PseAAC-Builder: A cross-platform stand-alone program for generating various special Chou's pseudo-amino acid compositions [J].
Du, Pufeng ;
Wang, Xin ;
Xu, Chao ;
Gao, Yang .
ANALYTICAL BIOCHEMISTRY, 2012, 425 (02) :117-119
[55]   TGF-beta Signaling in Cancer Treatment [J].
Fabregat, Isabel ;
Fernando, Joan ;
Mainez, Jessica ;
Sancho, Patricia .
CURRENT PHARMACEUTICAL DESIGN, 2014, 20 (17) :2934-2947
[56]   Random forests: from early developments to recent advancements [J].
Fawagreh, Khaled ;
Gaber, Mohamed Medhat ;
Elyan, Eyad .
SYSTEMS SCIENCE & CONTROL ENGINEERING, 2014, 2 (01) :602-609
[57]   iDNA6mA-PseKNC: Identifying DNA N6-methyladenosine sites by incorporating nucleotide physicochemical properties into PseKNC [J].
Feng, Pengmian ;
Yang, Hui ;
Ding, Hui ;
Lin, Hao ;
Chen, Wei ;
Chou, Kuo-Chen .
GENOMICS, 2019, 111 (01) :96-102
[58]   iRNA-PseColl: Identifying the Occurrence Sites of Different RNA Modifications by Incorporating Collective Effects of Nucleotides into PseKNC [J].
Feng, Pengmian ;
Ding, Hui ;
Yang, Hui ;
Chen, Wei ;
Lin, Hao ;
Chou, Kuo-Chen .
MOLECULAR THERAPY-NUCLEIC ACIDS, 2017, 7 :155-163
[59]   Estimates of worldwide burden of cancer in 2008: GLOBOCAN 2008 [J].
Ferlay, Jacques ;
Shin, Hai-Rim ;
Bray, Freddie ;
Forman, David ;
Mathers, Colin ;
Parkin, Donald Maxwell .
INTERNATIONAL JOURNAL OF CANCER, 2010, 127 (12) :2893-2917
[60]  
Geary R.C., 1954, Incorp Stat, V5, P115, DOI 10.2307/2986645