A Homology and Pseudo Amino Acid Composition-based Multi-label Model for Predicting Human Membrane Protein Types

被引:1
|
作者
Huang, Yanjun [1 ]
Huang, Guohua [2 ,3 ]
机构
[1] Shaoyang Univ, Coll Sport, Shaoyang 422000, Hunan, Peoples R China
[2] Shaoyang Univ, Prov Key Lab Informat Serv Rural Area Southwester, Shaoyang 422000, Hunan, Peoples R China
[3] Shaoyang Univ, Coll Informat Engn, Shaoyang 422000, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
BLAST; membrane protein type; multiple label; nearest neighbor algorithm; pseudo amino acid composition; sequence homology; PHYSICOCHEMICAL PROPERTIES; RESOURCE UNIPROT; GENERAL-FORM; CLASSIFIER; SEQUENCES; TOPOLOGY; FEATURES; DATABASE; PSSM; SVM;
D O I
10.2174/1570164614666171030162205
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Membrane proteins are embedded into biological membranes and interact with them, playing a large range of roles from transporting materials to catalyzing interactions in the cellular processes. The functions of membrane proteins are closely associated with types they belong to. Membrane proteins have simultaneously more than one type, but most of the computational predictions can deal with only one type. Objective and Method: To bridge the gap, we proposed a multi-label method based on the sequence homology and pseudo amino acid composition for predicting human membrane protein types. The method is a two-step decision. The uncharacterized membrane protein firstly was aligned against the database consisting of membrane proteins with known types and types of the most homological membrane protein were transferred to it. If it had no homological membrane protein, the pseudo amino acid composition-based method was used to predict its types. Results: The predictive accuracies of the leave-one-out cross-validation test on these three benchmark datasets are 0.8817, 0.8206 and 0.7276, respectively, better than our previous algorithm. We collected 5752 manually reviewed human membrane proteins with annotated types as the training set, and developed a program MemPred for predicting multi-label types of membrane proteins. Conclusion: We have proposed a multi-label computational method for predicting membrane protein types and achieved a better performance. The advantage of the proposed method is that it can predict simultaneously more than one type.
引用
收藏
页码:135 / 141
页数:7
相关论文
共 50 条
  • [1] Predicting membrane protein types by fusing composite protein sequence features into pseudo amino acid composition
    Hayat, Maqsood
    Khan, Asifullah
    JOURNAL OF THEORETICAL BIOLOGY, 2011, 271 (01) : 10 - 17
  • [2] Fuzzy KNN for predicting membrane protein types from pseudo-amino acid composition
    Shen, Hong-Bin
    Yang, Jie
    Chou, Kuo-Chen
    JOURNAL OF THEORETICAL BIOLOGY, 2006, 240 (01) : 9 - 13
  • [3] ProClusEnsem: Predicting membrane protein types by fusing different modes of pseudo amino acid composition
    Wang, Jingyan
    Li, Yongping
    Wang, Quanquan
    You, Xinge
    Man, Jiaju
    Wang, Chao
    Gao, Xin
    COMPUTERS IN BIOLOGY AND MEDICINE, 2012, 42 (05) : 564 - 574
  • [4] MultiP-SChlo: multi-label protein subchloroplast localization prediction with Chou's pseudo amino acid composition and a novel multi-label classifier
    Wang, Xiao
    Zhang, Weiwei
    Zhang, Qiuwen
    Li, Guo-Zheng
    BIOINFORMATICS, 2015, 31 (16) : 2639 - 2645
  • [5] Weighted-support vector machines for predicting membrane protein types based on pseudo-amino acid composition
    Wang, M
    Yang, J
    Liu, GP
    Xu, ZJ
    Chou, KC
    PROTEIN ENGINEERING DESIGN & SELECTION, 2004, 17 (06) : 509 - 516
  • [6] A Multilabel Model Based on Chou's Pseudo-Amino Acid Composition for Identifying Membrane Proteins with Both Single and Multiple Functional Types
    Huang, Chao
    Yuan, Jing-Qi
    JOURNAL OF MEMBRANE BIOLOGY, 2013, 246 (04) : 327 - 334
  • [7] Classification of membrane protein types using Voting Feature Interval in combination with Chou's Pseudo Amino Acid Composition
    Ali, Farman
    Hayat, Maqsood
    JOURNAL OF THEORETICAL BIOLOGY, 2015, 384 : 78 - 83
  • [8] Predicting protein-protein interactions by weighted pseudo amino acid composition
    Goktepe, Yunus Emre
    Ilhan, Ilhan
    Kahramanli, Sirzat
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2016, 15 (03) : 272 - 290
  • [9] Predicting Viral Protein Subcellular Localization with Chou's Pseudo Amino Acid Composition and Imbalance-Weighted Multi-Label K-Nearest Neighbor Algorithm
    Cao, Jun-Zhe
    Liu, Wen-Qi
    Gu, Hong
    PROTEIN AND PEPTIDE LETTERS, 2012, 19 (11) : 1163 - 1169
  • [10] A Multi-label Classifier for Prediction Membrane Protein Functional Types in Animal
    Zou, Hong-Liang
    JOURNAL OF MEMBRANE BIOLOGY, 2014, 247 (11) : 1141 - 1148