共 24 条
Prediction of subcellular location of apoptosis proteins combining tri-gram encoding based on PSSM and recursive feature elimination
被引:17
作者:
Liu, Taigang
[1
]
Tao, Peiying
[2
]
Li, Xiaowei
[2
]
Qin, Yufang
[1
]
Wang, Chunhua
[1
]
机构:
[1] Shanghai Ocean Univ, Coll Informat Technol, Shanghai 201306, Peoples R China
[2] Shanghai Ocean Univ, Coll Food Sci & Technol, Shanghai 201306, Peoples R China
基金:
中国国家自然科学基金;
关键词:
Feature selection;
Position-specific score matrix;
Protein sequence representation;
Support vector machine;
AMINO-ACID-COMPOSITION;
LOCALIZATION;
D O I:
10.1016/j.jtbi.2014.11.010
中图分类号:
Q [生物科学];
学科分类号:
07 ;
0710 ;
09 ;
摘要:
Knowledge of apoptosis proteins plays an important role in understanding the mechanism of programmed cell death. Obtaining information on subcellular location of apoptosis proteins is very helpful to reveal the apoptosis mechanism and understand the function of apoptosis proteins. Because of the cost in time and labor associated with large-scale wet-bench experiments, computational prediction of apoptosis proteins subcellular location becomes very important and many computational tools have been developed in the recent decades. Existing methods differ in the protein sequence representation techniques and classification algorithms adopted. In this study, we firstly introduce a sequence encoding scheme based on tri-grams computed directly from position-specific score matrices, which incorporates evolution information represented in the PSI-BLAST profile and sequence-order information. Then SVM-RFE algorithm is applied for feature selection and reduced vectors are input to a support vector machine classifier to predict subcellular location of apoptosis proteins. Jackknife tests on three widely used datasets show that our method provides the state-of-the-art performance in comparison with other existing methods. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:8 / 12
页数:5
相关论文