A Homology and Pseudo Amino Acid Composition-based Multi-label Model for Predicting Human Membrane Protein Types

被引:1
|
作者
Huang, Yanjun [1 ]
Huang, Guohua [2 ,3 ]
机构
[1] Shaoyang Univ, Coll Sport, Shaoyang 422000, Hunan, Peoples R China
[2] Shaoyang Univ, Prov Key Lab Informat Serv Rural Area Southwester, Shaoyang 422000, Hunan, Peoples R China
[3] Shaoyang Univ, Coll Informat Engn, Shaoyang 422000, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
BLAST; membrane protein type; multiple label; nearest neighbor algorithm; pseudo amino acid composition; sequence homology; PHYSICOCHEMICAL PROPERTIES; RESOURCE UNIPROT; GENERAL-FORM; CLASSIFIER; SEQUENCES; TOPOLOGY; FEATURES; DATABASE; PSSM; SVM;
D O I
10.2174/1570164614666171030162205
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Membrane proteins are embedded into biological membranes and interact with them, playing a large range of roles from transporting materials to catalyzing interactions in the cellular processes. The functions of membrane proteins are closely associated with types they belong to. Membrane proteins have simultaneously more than one type, but most of the computational predictions can deal with only one type. Objective and Method: To bridge the gap, we proposed a multi-label method based on the sequence homology and pseudo amino acid composition for predicting human membrane protein types. The method is a two-step decision. The uncharacterized membrane protein firstly was aligned against the database consisting of membrane proteins with known types and types of the most homological membrane protein were transferred to it. If it had no homological membrane protein, the pseudo amino acid composition-based method was used to predict its types. Results: The predictive accuracies of the leave-one-out cross-validation test on these three benchmark datasets are 0.8817, 0.8206 and 0.7276, respectively, better than our previous algorithm. We collected 5752 manually reviewed human membrane proteins with annotated types as the training set, and developed a program MemPred for predicting multi-label types of membrane proteins. Conclusion: We have proposed a multi-label computational method for predicting membrane protein types and achieved a better performance. The advantage of the proposed method is that it can predict simultaneously more than one type.
引用
收藏
页码:135 / 141
页数:7
相关论文
共 50 条
  • [41] DNA binding protein identification by combining pseudo amino acid composition and profile-based protein representation
    Liu, Bin
    Wang, Shanyi
    Wang, Xiaolong
    SCIENTIFIC REPORTS, 2015, 5
  • [42] Dual-Layer Wavelet SVM for Predicting Protein Structural Class Via the General Form of Chou's Pseudo Amino Acid Composition
    Chen, Chao
    Shen, Zhi-Bin
    Zou, Xiao-Yong
    PROTEIN AND PEPTIDE LETTERS, 2012, 19 (04) : 422 - 429
  • [43] Predicting protein structural classes with pseudo amino acid composition: An approach using geometric moments of cellular automaton image
    Xiao, Xuan
    Wang, Pu
    Chou, Kuo-Chen
    JOURNAL OF THEORETICAL BIOLOGY, 2008, 254 (03) : 691 - 696
  • [44] Protein remote homology detection by combining Chou’s distance-pair pseudo amino acid composition and principal component analysis
    Bin Liu
    Junjie Chen
    Xiaolong Wang
    Molecular Genetics and Genomics, 2015, 290 : 1919 - 1931
  • [45] Multi-label l2-regularized logistic regression for predicting activation/inhibition relationships in human protein-protein interaction networks
    Mei, Suyu
    Zhang, Kun
    SCIENTIFIC REPORTS, 2016, 6
  • [46] Predicting Protein Subcellular Localization by Pseudo Amino Acid Composition with a Segment-Weighted and Features-Combined Approach
    Wang, Wei
    Geng, XingBo
    Dou, Yongchao
    Liu, Taigang
    Zheng, Xiaoqi
    PROTEIN AND PEPTIDE LETTERS, 2011, 18 (05) : 480 - 487
  • [47] Predicting Golgi-resident protein types using pseudo amino acid compositions: Approaches with positional specific physicochemical properties
    Jiao, Ya-Sen
    Du, Pu-Feng
    JOURNAL OF THEORETICAL BIOLOGY, 2016, 391 : 35 - 42
  • [48] Predicting protein submitochondrial locations by incorporating the pseudo-position specific scoring matrix into the general Chou's pseudo-amino acid composition
    Qiu, Wenying
    Li, Shan
    Cui, Xiaowen
    Yu, Zhaomin
    Wang, Minghui
    Du, Junwei
    Peng, Yanjun
    Yu, Bin
    JOURNAL OF THEORETICAL BIOLOGY, 2018, 450 : 86 - 103
  • [49] Prediction of Rat Protein Subcellular Localization with Pseudo Amino Acid Composition Based on Multiple Sequential Features
    Shi, Ruijia
    Xu, Cunshuan
    PROTEIN AND PEPTIDE LETTERS, 2011, 18 (06) : 625 - 633
  • [50] An image-based multi-label human protein subcellular localization predictor (iLocator) reveals protein mislocalizations in cancer tissues
    Xu, Ying-Ying
    Yang, Fan
    Zhang, Yang
    Shen, Hong-Bin
    BIOINFORMATICS, 2013, 29 (16) : 2032 - 2040