Protein fold families prediction based on graph representations and machine learning methods

被引:0
作者
Areiza-Laverde, H. J. [1 ]
Mercado-Diaz, L. R. [1 ]
Castro-Ospina, A. E. [1 ]
Jaramillo-Garzon, J. A. [1 ]
机构
[1] Inst Tecnol Metropolitano, Medellin, Antioquia, Colombia
来源
2016 XXI SYMPOSIUM ON SIGNAL PROCESSING, IMAGES AND ARTIFICIAL VISION (STSIVA) | 2016年
关键词
STRUCTURAL CLASSIFICATION; STRUCTURE ALIGNMENT; DATABASE; SCOP;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prediction of protein fold families remains an existing challenge in molecular biology and bioinformatics, mainly because proteins form a broad range of complex three-dimensional configurations and because the number of proteins registered in datasets has dramatically increased in the recent years. Computational alternatives must then be designed for substituting experimental methods. However, implementations of computational methods have found a problem to extract features that involve the physical-chemical attributes and spatial features of the protein to improve the accuracy in predictions. In this paper, we propose the use of graph theory for representing position of amino acids of the protein as graph nodes, and graph edges connect amino acids that are close to each other under a given threshold. In this way we can get very descriptive features related to spatial and physical-chemical properties of the proteins to describe their three-dimensional structure and so predict the protein fold families with a good accuracy.
引用
收藏
页数:6
相关论文
共 36 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] [Anonymous], SCI REPORTS
  • [3] A machine learning approach to predicting protein-ligand binding affinity with applications to molecular docking
    Ballester, Pedro J.
    Mitchell, John B. O.
    [J]. BIOINFORMATICS, 2010, 26 (09) : 1169 - 1175
  • [4] Announcing the worldwide Protein Data Bank
    Berman, H
    Henrick, K
    Nakamura, H
    [J]. NATURE STRUCTURAL BIOLOGY, 2003, 10 (12) : 980 - 980
  • [5] Analysis of microarray data using Z score transformation
    Cheadle, C
    Vawter, MP
    Freed, WJ
    Becker, KG
    [J]. JOURNAL OF MOLECULAR DIAGNOSTICS, 2003, 5 (02) : 73 - 81
  • [6] Csardi G., 2006, Inter Journal, Complex Systems, P1695
  • [7] DENNIS EA, 1994, J BIOL CHEM, V269, P13057
  • [8] Dhifli W., 2015, ARXIV E PRINTS
  • [9] Drug discovery: A historical perspective
    Drews, J
    [J]. SCIENCE, 2000, 287 (5460) : 1960 - 1964
  • [10] C-type lectin-like domains
    Drickamer, K
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 1999, 9 (05) : 585 - 590