Prediction of N-linked glycosylation sites using position relative features and statistical moments

被引:77
作者
Akmal, Muhammad Aizaz [1 ]
Rasool, Nouman [2 ]
Khan, Yaser Daanial [1 ]
机构
[1] Univ Management & Technol, Sch Syst & Technol, Dept Comp Sci, Lahore, Pakistan
[2] Univ Management & Technol, Sch Sci, Dept Life Sci, Lahore, Pakistan
来源
PLOS ONE | 2017年 / 12卷 / 08期
关键词
ACTIVE FRACTION; PROTEIN; IDENTIFICATION; GLYCOPROTEINS;
D O I
10.1371/journal.pone.0181966
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Glycosylation is one of the most complex post translation modification in eukaryotic cells. Almost 50% of the human proteome is glycosylated as glycosylation plays a vital role in various biological functions such as antigen's recognition, cell-cell communication, expression of genes and protein folding. It is a significant challenge to identify glycosylation sites in protein sequences as experimental methods are time taking and expensive. A reliable computational method is desirable for the identification of glycosylation sites. In this study, a comprehensive technique for the identification of N-linked glycosylation sites has been proposed using machine learning. The proposed predictor was trained using an up-to-date dataset through back propagation algorithm for multilayer neural network. The results of ten-fold cross-validation and other performance measures such as accuracy, sensitivity, specificity and Mathew's correlation coefficient inferred that the accuracy of proposed tool is far better than the existing systems such as Glyomine, GlycoEP, Ensemble SVM and GPP.
引用
收藏
页数:21
相关论文
共 43 条
[11]   A Prediction Model for Membrane Proteins Using Moments Based Features [J].
Butt, Ahmad Hassan ;
Khan, Sher Afzal ;
Jamil, Hamza ;
Rasool, Nouman ;
Khan, Yaser Daanial .
BIOMED RESEARCH INTERNATIONAL, 2016, 2016
[12]   Glycosylation site prediction using ensembles of Support Vector Machine classifiers [J].
Caragea, Cornelia ;
Sinapov, Jivko ;
Silvescu, Adrian ;
Dobbs, Drena ;
Honavar, Vasant .
BMC BIOINFORMATICS, 2007, 8 (1)
[13]  
Chauhan JS, 2013, PLOS ONE, V8, DOI [10.1371/journal.pone.0067008, 10.1371/journal.pone.0080000]
[14]   Chemometrics-Based Approach to Feature Selection of Chromatographic Profiles and its Application to Search Active Fraction of Herbal Medicine [J].
Chen, Chao ;
Yuan, Jie ;
Li, Xiao-Jie ;
Shen, Zhi-Bin ;
Yu, Dao-Hai ;
Zhu, Jun-Fang ;
Zeng, Fan-Lin .
CHEMICAL BIOLOGY & DRUG DESIGN, 2013, 81 (06) :688-694
[15]   A support vector machine based pharmacodynamic prediction model for searching active fraction and ingredients of herbal medicine: Naodesheng prescription as an example [J].
Chen, Chao ;
Li, Shu-xian ;
Wang, Shu-mei ;
Liang, Sheng-wang .
JOURNAL OF PHARMACEUTICAL AND BIOMEDICAL ANALYSIS, 2011, 56 (02) :443-447
[16]   Detecting N6-methyladenosine sites from RNA transcriptomes using ensemble Support Vector Machines [J].
Chen, Wei ;
Xing, Pengwei ;
Zou, Quan .
SCIENTIFIC REPORTS, 2017, 7
[17]   PREDICTION OF PROTEIN STRUCTURAL CLASSES [J].
CHOU, KC ;
ZHANG, CT .
CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1995, 30 (04) :275-349
[18]   Prediction of glycosylation sites using random forests [J].
Hamby, Stephen E. ;
Hirst, Jonathan D. .
BMC BIOINFORMATICS, 2008, 9 (1)
[19]   NetOglyc: Prediction of mucin type O-glycosylation sites based on sequence context and surface accessibility [J].
Hansen, JE ;
Lund, O ;
Tolstrup, N ;
Gooley, AA ;
Williams, KL ;
Brunak, S .
GLYCOCONJUGATE JOURNAL, 1998, 15 (02) :115-130
[20]   Intracellular functions of N-linked glycans [J].
Helenius, A ;
Aebi, M .
SCIENCE, 2001, 291 (5512) :2364-2369