A Novel Feature Fusion Method for Predicting Protein Subcellular Localization with Multiple Sites

被引:0
作者
Wang, Dong [1 ]
Han, Shiyuan [1 ]
Qu, Xumi [1 ]
Bao, Wenzheng [1 ]
Chen, Yuehui [1 ]
Fan, Yuling [1 ]
Zhou, Jin [1 ]
机构
[1] Univ Jinan, Sch Informat Sci & Engn, Jinan, Peoples R China
来源
2015 INTERNATIONAL CONFERENCE ON INFORMATIVE AND CYBERNETICS FOR COMPUTATIONAL SOCIAL SYSTEMS (ICCSS) | 2015年
关键词
pseudo amino acid composition; Gpos-mPloc; stereochemical properties; multi-label k nearest neighbor; evaluation metrics; LABEL; CLASSIFIER;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a novel feature fusion method for the protein subcellular multiple-site localization prediction. Several types of features are employed in this novel protein coding method. The first one is the composition of amino acids. The second is pseudo amino acid composition, which mainly extract the location information of each amino acid residues in protein sequence. Lastly, the information for local sequence of amino acids is taken into consideration in this research. Generally, k nearest neighbor, supporting vector machine and other methods, has been used in the field of protein subcellular localization prediction. In our research, the multi-label k nearest neighbor algorithm has been employed in the classification model. The overall accuracy rate may reach 66.7304% in Gnos-mploc dataset.
引用
收藏
页码:15 / 19
页数:5
相关论文
共 8 条
[1]   Some remarks on predicting multi-label attributes in molecular biosystems [J].
Chou, Kuo-Chen .
MOLECULAR BIOSYSTEMS, 2013, 9 (06) :1092-1100
[2]   Pseudo Amino Acid Composition and its Applications in Bioinformatics, Proteomics and System Biology [J].
Chou, Kuo-Chen .
CURRENT PROTEOMICS, 2009, 6 (04) :262-274
[3]  
Du PF, 2013, EXPERT REV PROTEOMIC, V10, P227, DOI [10.1586/epr.13.16, 10.1586/EPR.13.16]
[4]   Predicting subcellular localization of proteins based on their N-terminal amino acid sequence [J].
Emanuelsson, O ;
Nielsen, H ;
Brunak, S ;
von Heijne, G .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 300 (04) :1005-1016
[5]   mPLR-Loc: An adaptive decision multi-label classifier based on penalized logistic regression for protein subcellular localization prediction [J].
Wan, Shibiao ;
Mak, Man-Wai ;
Kung, Sun-Yuan .
ANALYTICAL BIOCHEMISTRY, 2015, 473 :14-27
[6]   iLoc-Virus: A multi-label learning classifier for identifying the subcellular localization of virus proteins with both single and multiple sites [J].
Xiao, Xuan ;
Wu, Zhi-Cheng ;
Chou, Kuo-Chen .
JOURNAL OF THEORETICAL BIOLOGY, 2011, 284 (01) :42-51
[7]   ML-KNN: A lazy learning approach to multi-label leaming [J].
Zhang, Min-Ling ;
Zhou, Zhi-Hua .
PATTERN RECOGNITION, 2007, 40 (07) :2038-2048
[8]   MSLoc-DT: A new method for predicting the protein subcellular location of multispecies based on decision templates [J].
Zhang, Shao-Wu ;
Liu, Yan-Fang ;
Yu, Yong ;
Zhang, Ting-He ;
Fan, Xiao-Nan .
ANALYTICAL BIOCHEMISTRY, 2014, 449 :164-171