Similarity/Dissimilarity Studies of Protein Sequences Based on a New 2D Graphical Representation

被引:50
作者
Yao, Yu-Hua [1 ]
Dai, Qi [2 ]
Li, Ling [3 ]
Nan, Xu-Ying [1 ]
He, Ping-An [1 ]
Zhang, Yao-Zhou [1 ]
机构
[1] Zhejiang Sci Tech Univ, Coll Life Sci, Hangzhou 310018, Peoples R China
[2] Hangzhou Dianzi Univ, Inst Biomed Engn & Instrumentat, Hangzhou 310018, Peoples R China
[3] Zhejiang Shuren Univ, Basic Courses Dept, Hangzhou 310015, Zhejiang, Peoples R China
关键词
similarity/dissimilarity; protein; graphical representation; descriptor; coefficient of determination; HYDROPHOBIC CLUSTER-ANALYSIS; CHAOS GAME REPRESENTATIONS; DNA PRIMARY SEQUENCES; NUMERICAL CHARACTERIZATION; DISTANCE MATRICES; LOW DEGENERACY; NUCLEOTIDE; DESCRIPTORS; INFORMATION; PROTEOMICS;
D O I
10.1002/jcc.21391
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
A (two-dimensional) 21) graphical representation of protein sequences based oil six physicochemical properties of amino acids is Outlined. The numerical characterization of protein graphs is given its descriptors of protein sequences. It is not Only useful for comparative Study of proteins but also for encoding innate information about the structure of Proteins. The Coefficient of determination is proposed as a new similarity/dissimilarity measure. Finally, a simple example is taken to highlight the behavior of the new similarity/dissimilarity measure on protein sequences taken from the ND6 (NADH dehydrogenase subunit 6) proteins for eight different species. The results demonstrate the approach is convenient, fast, and efficient. (C) 2009 Wiley Periodicals, Inc. J Comput Chem 31: 1045-1052, 2010
引用
收藏
页码:1045 / 1052
页数:8
相关论文
共 56 条
  • [1] On graphical and numerical representation of protein sequences
    Bai, FL
    Wang, TM
    [J]. JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2006, 23 (05) : 537 - 545
  • [2] BARANIDHARAN S, 1994, INT J GENOME RES, V1, P309
  • [3] Chaos game representation of proteins
    Basu, S
    Pan, A
    Dutta, C
    Das, J
    [J]. JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 1997, 15 (05) : 279 - 289
  • [4] A novel 2D graphical representation of DNA sequences and its application
    Dai, Qi
    Liu, Xiaoqing
    Wang, Tianming
    [J]. JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2006, 25 (03) : 340 - 344
  • [5] Characterization of protein primary sequences based on partial ordering
    Feng, Jie
    Wang, Tian-Ming
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2008, 254 (04) : 752 - 755
  • [6] HYDROPHOBIC CLUSTER-ANALYSIS - AN EFFICIENT NEW WAY TO COMPARE AND ANALYZE AMINO-ACID-SEQUENCES
    GABORIAUD, C
    BISSERY, V
    BENCHETRIT, T
    MORNON, JP
    [J]. FEBS LETTERS, 1987, 224 (01): : 149 - 155
  • [7] A SIMPLE WAY TO LOOK AT DNA
    GATES, MA
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 1986, 119 (03) : 319 - 328
  • [8] NUCLEOTIDE, DINUCLEOTIDE AND TRINUCLEOTIDE FREQUENCIES EXPLAIN PATTERNS OBSERVED IN CHAOS GAME REPRESENTATIONS OF DNA-SEQUENCES
    GOLDMAN, N
    [J]. NUCLEIC ACIDS RESEARCH, 1993, 21 (10) : 2487 - 2491
  • [9] Proteomics, networks and connectivity indices
    Gonzalez-Diaz, Humberto
    Gonzalez-Diaz, Yenny
    Santana, Lourdes
    Ubeira, Florencio M.
    Uriarte, Eugenio
    [J]. PROTEOMICS, 2008, 8 (04) : 750 - 778
  • [10] Medicinal chemistry and bioinformatics -: Current trends in drugs discovery with networks topological indices
    Gonzalez-Diaz, Humberto
    Vilar, Santiago
    Santana, Lourdes
    Uriarte, Eugenio
    [J]. CURRENT TOPICS IN MEDICINAL CHEMISTRY, 2007, 7 (10) : 1015 - 1029