A Homology and Pseudo Amino Acid Composition-based Multi-label Model for Predicting Human Membrane Protein Types

被引：1

作者：

Huang, Yanjun ^{[1
]}

Huang, Guohua ^{[2
,3
]}

机构：

[1] Shaoyang Univ, Coll Sport, Shaoyang 422000, Hunan, Peoples R China

[2] Shaoyang Univ, Prov Key Lab Informat Serv Rural Area Southwester, Shaoyang 422000, Hunan, Peoples R China

[3] Shaoyang Univ, Coll Informat Engn, Shaoyang 422000, Hunan, Peoples R China

来源：

CURRENT PROTEOMICS | 2018年 / 15卷 / 02期

基金：

中国国家自然科学基金;

关键词：

BLAST; membrane protein type; multiple label; nearest neighbor algorithm; pseudo amino acid composition; sequence homology; PHYSICOCHEMICAL PROPERTIES; RESOURCE UNIPROT; GENERAL-FORM; CLASSIFIER; SEQUENCES; TOPOLOGY; FEATURES; DATABASE; PSSM; SVM;

D O I：

10.2174/1570164614666171030162205

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Background: Membrane proteins are embedded into biological membranes and interact with them, playing a large range of roles from transporting materials to catalyzing interactions in the cellular processes. The functions of membrane proteins are closely associated with types they belong to. Membrane proteins have simultaneously more than one type, but most of the computational predictions can deal with only one type. Objective and Method: To bridge the gap, we proposed a multi-label method based on the sequence homology and pseudo amino acid composition for predicting human membrane protein types. The method is a two-step decision. The uncharacterized membrane protein firstly was aligned against the database consisting of membrane proteins with known types and types of the most homological membrane protein were transferred to it. If it had no homological membrane protein, the pseudo amino acid composition-based method was used to predict its types. Results: The predictive accuracies of the leave-one-out cross-validation test on these three benchmark datasets are 0.8817, 0.8206 and 0.7276, respectively, better than our previous algorithm. We collected 5752 manually reviewed human membrane proteins with annotated types as the training set, and developed a program MemPred for predicting multi-label types of membrane proteins. Conclusion: We have proposed a multi-label computational method for predicting membrane protein types and achieved a better performance. The advantage of the proposed method is that it can predict simultaneously more than one type.

引用

页码：135 / 141

页数：7

共 50 条

[21] Predicting membrane protein types by incorporating protein topology, domains, signal peptides, and physicochemical properties into the general form of Chou's pseudo amino acid composition
Chen, Yen-Kuang
Li, Kuo-Bin
JOURNAL OF THEORETICAL BIOLOGY, 2013, 318 : 1 - 12
[22] Protein Remote Homology Detection by Combining Chou's Pseudo Amino Acid Composition and Profile-Based Protein Representation
Liu, Bin
Wang, Xiaolong
Zou, Quan
Dong, Qiwen
Chen, Qingcai
MOLECULAR INFORMATICS, 2013, 32 (9-10) : 775 - 782
[23] Protein remote homology detection by combining Chou's distance-pair pseudo amino acid composition and principal component analysis
Liu, Bin
Chen, Junjie
Wang, Xiaolong
MOLECULAR GENETICS AND GENOMICS, 2015, 290 (05) : 1919 - 1931
[24] Predicting Human Enzyme Family Classes by Using Pseudo Amino Acid Composition
Wu, Yun
Tang, Hua
Chen, Wei
Lin, Hao
CURRENT PROTEOMICS, 2016, 13 (02) : 99 - 104
[25] Predicting the Subcellular Localization of Multi-site Protein Based on Fusion Feature and Multi-label Deep Forest Model
Yang, Hongri
Meng, Qingfang
Chen, Yuehui
Zhong, Lianxin
INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2022, PT II, 2022, 13394 : 334 - 344
[26] A Multilabel Model Based on Chou’s Pseudo–Amino Acid Composition for Identifying Membrane Proteins with Both Single and Multiple Functional Types
Chao Huang
Jing-Qi Yuan
The Journal of Membrane Biology, 2013, 246 : 327 - 334
[27] Chou's pseudo amino acid composition improves sequence-based antifreeze protein prediction
Mondal, Sukanta
Pai, Priyadarshini P.
JOURNAL OF THEORETICAL BIOLOGY, 2014, 356 : 30 - 35
[28] PrESOgenesis: A two-layer multi-label predictor for identifying fertility-related proteins using support vector machine and pseudo amino acid composition approach
Bakhtiarizadeh, Mohammad Reza
Rahimi, Maryam
Mohammadi-Sangcheshmeh, Abdollah
Shariati, Vahid J.
Salami, Seyed Alireza
SCIENTIFIC REPORTS, 2018, 8
[29] Analyzes of the similarities of protein sequences based on the pseudo amino acid composition
Zhang, Yan-ping
Ruan, Ji-shuo
He, Ping-an
CHEMICAL PHYSICS LETTERS, 2013, 590 : 239 - 244
[30] Protein structural classification based on pseudo amino acid composition using SVM classifier
Krajewski, Zbigniew
Tkacz, Ewaryst
BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2013, 33 (02) : 77 - 87

← 1 2 3 4 5 →