Classification of enzyme function from protein sequence based on feature representation

被引:0
|
作者
Lee, Bum Ju [1 ]
Lee, Jong Yun [2 ]
Lee, Heon Gu [1 ]
Ryu, Keun Ho [1 ]
机构
[1] Chungbuk Natl Univ, Database Bioinformat Lab, Chungju, South Korea
[2] Chungbuk Natl Univ, Dept Comp Educ, Chungju, South Korea
关键词
enzyme function; function classification; feature extraction; amino acid composition; attribute selection; machine learning; protein classification; feature analysis;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Enzymes are the proteins that accelerate the rate of chemical reaction, and both their structures and dynamics may be important to their function of catalyzing biochemical reactions. For the function prediction and classification of enzymes, many methods based on sequence similarity to detect similar proteins have been developed. However, these methods often miscarry in the case of the absence of similar sequences or poor similarity among proteins. Therefore, many researchers have been developing alternative approaches that assign function from protein features without consideration of sequence similarity. In this paper, we propose a method of sequence-driven feature extraction and enzyme functional classification using only the features of protein sequence, excluding predicted secondary structures and annotation information of protein databases. Our experimental results demonstrate that the enzyme classification based on the Chi-Squared ranking method among various attribute selection methods is efficient. Also, we find that amino acid composition of specific enzyme differs from composition of other enzymes.
引用
收藏
页码:741 / +
页数:3
相关论文
共 50 条
  • [21] MANIFOLD: protein fold recognition based on secondary structure, sequence similarity and enzyme classification
    Bindewald, E
    Cestaro, A
    Hesser, J
    Heiler, M
    Tosatto, SCE
    PROTEIN ENGINEERING, 2003, 16 (11): : 785 - 789
  • [22] Efficient Feature Selection and Classification of Protein Sequence Data in Bioinformatics
    Iqbal, Muhammad Javed
    Faye, Ibrahima
    Samir, Brahim Belhaouari
    Said, Abas Md
    SCIENTIFIC WORLD JOURNAL, 2014,
  • [23] A feature-based trust sequence classification algorithm
    Yahyaoui, Hamdi
    Al-Mutairi, Aisha
    INFORMATION SCIENCES, 2016, 328 : 455 - 484
  • [24] From protein sequence to function
    Danchin, A
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 1999, 9 (03) : 363 - 367
  • [25] Conjoint Feature Representation of GO and Protein Sequence for PPI Prediction Based on an Inception RNN Attention Network
    Zhao, Lingling
    Wang, Junjie
    Hu, Yang
    Cheng, Liang
    MOLECULAR THERAPY NUCLEIC ACIDS, 2020, 22 : 198 - 208
  • [26] Estimation of Position Specific Energy as a Feature of Protein Residues from Sequence Alone for Structural Classification
    Iqbal, Sumaiya
    Hoque, Md Tamjidul
    PLOS ONE, 2016, 11 (09):
  • [27] Classification of sequence signatures: a guide to Hox protein function
    Merabet, Samir
    Hudry, Bruno
    Saadaoui, Mehdi
    Graba, Yacine
    BIOESSAYS, 2009, 31 (05) : 500 - 511
  • [28] Cluster Based Symbolic Representation and Feature Selection for Text Classification
    Harish, B. S.
    Guru, D. S.
    Manjunath, S.
    Dinesh, R.
    ADVANCED DATA MINING AND APPLICATIONS (ADMA 2010), PT II, 2010, 6441 : 158 - 166
  • [29] Hashing Based Hierarchical Feature Representation for Hyperspectral Imagery Classification
    Pan, Bin
    Shi, Zhenwei
    Xu, Xia
    Yang, Yi
    REMOTE SENSING, 2017, 9 (11)
  • [30] FEATURE EXTRACTION AND CLASSIFICATION OF POLSAR IMAGES BASED ON SPARSE REPRESENTATION
    Zhang, Lamei
    Sun, Liangjie
    Moon, Wooil M.
    2014 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2014,