Nonlinear dimensionality reduction on 3-D protein image analysis

被引:0
作者
Wang, Guisong [1 ]
Kinser, Jason [1 ]
机构
[1] George Mason Univ, Dept Bioinformat & Computat Biol, Fairfax, VA 22030 USA
来源
2006 FORTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-5 | 2006年
关键词
D O I
10.1109/ACSSC.2006.354901
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A new perspective to investigate protein structures was introduced by applying Isomap algorithm on the protein 3-D structures. Through representing the location of each amino acid in the protein by its C alpha atom, the protein 3D structure was described as a set of points in 3D space. The reasonable nearest neighbors were defined through statistical amino acids contacts information and protein structure analysis interests. The lsomap algorithm on the 3D protein structure generated the meaningful protein geodesic structure. The residual variance analysis demonstrated that the best dimensionality for representing protein geodesic structure is 3D, but the computational results also showed that there is a nearly linear relationship between the geodesic shortest distance in original 3D and the Euclidean distance in the 2D constructed by the first two of Isomap's coordinates. The 2D protein geodesic representations can be used to measure the protein similarity and the analysis of the 2D protein geodesic representation image reveals the different protein secondary structures having different patterns. The lsomap's global coordinates provided a new approach to analyze the protein structures.
引用
收藏
页码:994 / +
页数:2
相关论文
共 22 条
  • [1] [Anonymous], 2 BIOKDD WORKSH DAT
  • [2] Laplacian eigenmaps for dimensionality reduction and data representation
    Belkin, M
    Niyogi, P
    [J]. NEURAL COMPUTATION, 2003, 15 (06) : 1373 - 1396
  • [3] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [4] Cox T., 2000, MULTIDIMENSIONAL SCA
  • [5] Low-dimensional, free-energy landscapes of protein-folding reactions by nonlinear dimensionality reduction
    Das, Payel
    Moll, Mark
    Stamati, Hernan
    Kavraki, Lydia E.
    Clementi, Cecilia
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (26) : 9885 - 9890
  • [6] Sample phenotype clusters in high-density oligonucleotide microarray data sets are revealed using Isomap, a nonlinear algorithm
    Dawson, K
    Rodriguez, RL
    Malyj, W
    [J]. BMC BIOINFORMATICS, 2005, 6 (1)
  • [7] Topological determinants of protein folding
    Dokholyan, NV
    Li, L
    Ding, F
    Shakhnovich, EI
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (13) : 8637 - 8641
  • [8] Nonlinear dimensionality reduction in climate data
    Gámez, AJ
    Zhou, CS
    Timmermann, A
    Kurths, J
    [J]. NONLINEAR PROCESSES IN GEOPHYSICS, 2004, 11 (03) : 393 - 398
  • [9] Jackson JE, 1991, A user's guide to principal components
  • [10] SOLUTION FOR BEST ROTATION TO RELATE 2 SETS OF VECTORS
    KABSCH, W
    [J]. ACTA CRYSTALLOGRAPHICA SECTION A, 1976, 32 (SEP1): : 922 - 923