Prediction of presynaptic and postsynaptic neurotoxins by combining various Chou's pseudo components

被引:25
作者
Huo, Haiyan [1 ]
Li, Tao [2 ]
Wang, Shiyuan [3 ]
Lv, Yingli [3 ]
Zuo, Yongchun [4 ]
Yang, Lei [3 ]
机构
[1] Hohhot Univ Nationalities, Dept Environm Engn, Hohhot 010051, Peoples R China
[2] Inner Mongolia Agr Univ, Coll Life Sc, Hohhot 010018, Peoples R China
[3] Harbin Med Univ, Coll Bioinformat Sci & Technol, Harbin 150081, Heilongjiang, Peoples R China
[4] Inner Mongolia Univ, Key Lab Mammalian Reprod Biol & Biotechnol, Minist Educ, Hohhot 010021, Peoples R China
来源
SCIENTIFIC REPORTS | 2017年 / 7卷
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
AMINO-ACID-COMPOSITION; SUPPORT VECTOR MACHINES; NAJA-NAJA-SPUTATRIX; CONOTOXIN SUPERFAMILY; PROTEIN-SEQUENCE; NEUROMUSCULAR ACTIVITY; FEATURE-SELECTION; DIFFERENT MODES; CDNA CLONING; WEB SERVER;
D O I
10.1038/s41598-017-06195-y
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Presynaptic and postsynaptic neurotoxins are two groups of neurotoxins. Identification of presynaptic and postsynaptic neurotoxins is an important work for numerous newly found toxins. It is both costly and time consuming to determine these two neurotoxins by experimental methods. As a complement, using computational methods for predicting presynaptic and postsynaptic neurotoxins could provide some useful information in a timely manner. In this study, we described four algorithms for predicting presynaptic and postsynaptic neurotoxins from sequence driven features by using Increment of Diversity (ID), Multinomial Naive Bayes Classifier (MNBC), Random Forest (RF), and K-nearest Neighbours Classifier (IBK). Each protein sequence was encoded by pseudo amino acid (PseAA) compositions and three biological motif features, including MEME, Prosite and InterPro motif features. The Maximum Relevance Minimum Redundancy (MRMR) feature selection method was used to rank the PseAA compositions and the 50 top ranked features were selected to improve the prediction accuracy. The PseAA compositions and three kinds of biological motif features were combined and 12 different parameters that defined as P1-P12 were selected as the input parameters of ID, MNBC, RF, and IBK. The prediction results obtained in this study were significantly better than those of previously developed methods.
引用
收藏
页数:10
相关论文
共 81 条
  • [41] Liu B., 2017, Nat. Sci, V09, P67, DOI [10.4236/ns.2017.94007, DOI 10.4236/NS.2017.94007]
  • [42] Pse-Analysis: a python']python package for DNA/RNA and protein/peptide sequence analysis based on pseudo components and kernel methods
    Liu, Bin
    Wu, Hao
    Zhang, Deyuan
    Wang, Xiaolong
    Chou, Kuo-Chen
    [J]. ONCOTARGET, 2017, 8 (08) : 13338 - 13343
  • [43] iDHS-EL: identifying DNase I hypersensitive sites by fusing three different modes of pseudo nucleotide composition into an ensemble learning framework
    Liu, Bin
    Long, Ren
    Chou, Kuo-Chen
    [J]. BIOINFORMATICS, 2016, 32 (16) : 2411 - 2418
  • [44] Liu B, 2017, BIOINFORMATICS, V33, P35, DOI [10.1093/bioinformatics/btw539, 10.1093/bioinformatics/btv604]
  • [45] Pse-in-One: a web server for generating various modes of pseudo components of DNA, RNA, and protein sequences
    Liu, Bin
    Liu, Fule
    Wang, Xiaolong
    Chen, Junjie
    Fang, Longyun
    Chou, Kuo-Chen
    [J]. NUCLEIC ACIDS RESEARCH, 2015, 43 (W1) : W65 - W71
  • [46] pRNAm-PC: Predicting N6-methyladenosine sites in RNA sequences via physical-chemical properties
    Liu, Zi
    Xiao, Xuan
    Yu, Dong-Jun
    Jia, Jianhua
    Qiu, Wang-Ren
    Chou, Kuo-Chen
    [J]. ANALYTICAL BIOCHEMISTRY, 2016, 497 : 60 - 67
  • [47] Identification of presynaptic neurotoxin complexes in the venoms of three Australian copperheads (Austrelaps spp.) and the efficacy of tiger snake antivenom to prevent or reverse neurotoxicity
    Marcon, Francesca
    Nicholson, Graham M.
    [J]. TOXICON, 2011, 58 (05) : 439 - 452
  • [48] Predicting antimicrobial peptides with improved accuracy by incorporating the compositional, physico-chemical and structural features into Chou's general PseAAC
    Meher, Prabina Kumar
    Sahu, Tanmaya Kumar
    Saini, Varsha
    Rao, Atmakuri Ramakrishna
    [J]. SCIENTIFIC REPORTS, 2017, 7
  • [49] Pseudo amino acid composition and multi-class support vector machines approach for conotoxin superfamily classification
    Mondal, Sukanta
    Bhavna, Rajasekaran
    Babu, Rajasekaran Mohan
    Ramakumar, Suryanarayanarao
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2006, 243 (02) : 252 - 260
  • [50] How do presynaptic PLA2 neurotoxins block nerve terminals?
    Montecucco, C
    Rossetto, O
    [J]. TRENDS IN BIOCHEMICAL SCIENCES, 2000, 25 (06) : 266 - 270