A Homology and Pseudo Amino Acid Composition-based Multi-label Model for Predicting Human Membrane Protein Types

被引:1
|
作者
Huang, Yanjun [1 ]
Huang, Guohua [2 ,3 ]
机构
[1] Shaoyang Univ, Coll Sport, Shaoyang 422000, Hunan, Peoples R China
[2] Shaoyang Univ, Prov Key Lab Informat Serv Rural Area Southwester, Shaoyang 422000, Hunan, Peoples R China
[3] Shaoyang Univ, Coll Informat Engn, Shaoyang 422000, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
BLAST; membrane protein type; multiple label; nearest neighbor algorithm; pseudo amino acid composition; sequence homology; PHYSICOCHEMICAL PROPERTIES; RESOURCE UNIPROT; GENERAL-FORM; CLASSIFIER; SEQUENCES; TOPOLOGY; FEATURES; DATABASE; PSSM; SVM;
D O I
10.2174/1570164614666171030162205
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Membrane proteins are embedded into biological membranes and interact with them, playing a large range of roles from transporting materials to catalyzing interactions in the cellular processes. The functions of membrane proteins are closely associated with types they belong to. Membrane proteins have simultaneously more than one type, but most of the computational predictions can deal with only one type. Objective and Method: To bridge the gap, we proposed a multi-label method based on the sequence homology and pseudo amino acid composition for predicting human membrane protein types. The method is a two-step decision. The uncharacterized membrane protein firstly was aligned against the database consisting of membrane proteins with known types and types of the most homological membrane protein were transferred to it. If it had no homological membrane protein, the pseudo amino acid composition-based method was used to predict its types. Results: The predictive accuracies of the leave-one-out cross-validation test on these three benchmark datasets are 0.8817, 0.8206 and 0.7276, respectively, better than our previous algorithm. We collected 5752 manually reviewed human membrane proteins with annotated types as the training set, and developed a program MemPred for predicting multi-label types of membrane proteins. Conclusion: We have proposed a multi-label computational method for predicting membrane protein types and achieved a better performance. The advantage of the proposed method is that it can predict simultaneously more than one type.
引用
收藏
页码:135 / 141
页数:7
相关论文
共 50 条
  • [31] Identify Golgi Protein Types with Modified Mahalanobis Discriminant Algorithm and Pseudo Amino Acid Composition
    Ding, Hui
    Liu, Li
    Guo, Feng-Biao
    Huang, Jian
    Lin, Hao
    PROTEIN AND PEPTIDE LETTERS, 2011, 18 (01) : 58 - 63
  • [32] PaPI: pseudo amino acid composition to score human protein-coding variants
    Ivan Limongelli
    Simone Marini
    Riccardo Bellazzi
    BMC Bioinformatics, 16
  • [33] PaPI: pseudo amino acid composition to score human protein-coding variants
    Limongelli, Ivan
    Marini, Simone
    Bellazzi, Riccardo
    BMC BIOINFORMATICS, 2015, 16
  • [34] Predicting Protein Functional Class with the Weighted Segmented Pseudo-Amino Acid Composition Moment Vector
    Zhou, Xinyuan
    Li, Xi
    Li, Man
    Lu, Xinguo
    MATCH-COMMUNICATIONS IN MATHEMATICAL AND IN COMPUTER CHEMISTRY, 2011, 66 (01) : 445 - 462
  • [35] Using the augmented Chou's pseudo amino acid composition for predicting protein submitochondria locations based on auto covariance approach
    Zeng, Yu-hong
    Guo, Yan-zhi
    Xiao, Rong-quan
    Yang, Li
    Yu, Le-zheng
    Li, Meng-long
    JOURNAL OF THEORETICAL BIOLOGY, 2009, 259 (02) : 366 - 372
  • [36] iMem-2LSAAC: A two-level model for discrimination of membrane proteins and their types by extending the notion of SAAC into chou's pseudo amino acid composition
    Arif, Muhammad
    Hayat, Maqsood
    Jan, Zahoor
    JOURNAL OF THEORETICAL BIOLOGY, 2018, 442 : 11 - 21
  • [37] Predicting protein structural class with pseudo-amino acid composition and support vector machine fusion network
    Chen, Chao
    Zhou, Xibin
    Tian, Yuanxin
    Zou, Xiaoyong
    Cai, Peixiang
    ANALYTICAL BIOCHEMISTRY, 2006, 357 (01) : 116 - 121
  • [38] Nearest neighbour algorithm for predicting protein subcellular location by combining functional domain composition and pseudo-amino acid composition
    Cai, YD
    Chou, KC
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2003, 305 (02) : 407 - 411
  • [39] Using optimized evidence-theoretic K-nearest neighbor classifier and pseudo-amino acid composition to predict membrane protein types
    Shen, HB
    Chou, KC
    BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2005, 334 (01) : 288 - 292
  • [40] Predicting Secretory Proteins of Malaria Parasite by Incorporating Sequence Evolution Information into Pseudo Amino Acid Composition via Grey System Model
    Lin, Wei-Zhong
    Fang, Jian-An
    Xiao, Xuan
    Chou, Kuo-Chen
    PLOS ONE, 2012, 7 (11):