Primary structure similarity analysis of proteins sequences by a new graphical representation

被引:7
作者
Xu, S. C. [1 ]
Li, Z. [1 ]
Zhang, S. P. [2 ]
Hu, J. L. [1 ]
机构
[1] Zhejiang Sci Tech Univ, Coll Sci, Hangzhou, Zhejiang, Peoples R China
[2] Univ Nebraska, Dept Stat, Lincoln, NE 68583 USA
关键词
curvature; shape analysis; torsion; protein sequence; cubic B-spline curve; DNA-SEQUENCES; PHYSICOCHEMICAL PROPERTIES; SIMILARITY/DISSIMILARITY;
D O I
10.1080/1062936X.2014.955055
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
A new graphical description of the primary structure of protein sequences is introduced. First, a three-dimensional space discrete point set of a protein sequence is created based on the three main physicochemical properties of the amino acids. Secondly, a continuous cubic B-spline curve interpolating the amino acid points is constructed to represent the shape of the protein sequence. Then the geometric properties (curvature and torsion) of the continuous curve are extracted for the purpose of analyzing the similarity between protein sequences. Finally, an improved Canberra distance comparison is introduced for the similarity analysis of protein sequences with different lengths. Experimental results show that our method is effective for the similarity comparison of protein sequences.
引用
收藏
页码:791 / 803
页数:13
相关论文
共 50 条
  • [21] 3D graphical representation of protein sequences and their statistical characterization
    el Maaty, Moheb I. Abo
    Abo-Elkhier, Mervat M.
    Abd Elwahaab, Marwa A.
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2010, 389 (21) : 4668 - 4676
  • [22] P-H Curve, a Graphical Representation of Protein Sequences for Similarities Analysis
    Liu, Yuxin
    Li, Dan
    Lu, Kebo
    Jiao, Yandong
    He, Ping-An
    MATCH-COMMUNICATIONS IN MATHEMATICAL AND IN COMPUTER CHEMISTRY, 2013, 70 (01) : 451 - 466
  • [23] A novel graphical representation of proteins and its application
    He, Ping-an
    Wei, Jinzhou
    Yao, Yuhua
    Tie, Zhixin
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2012, 391 (1-2) : 93 - 99
  • [24] On graphical and numerical representation of protein sequences
    Bai, FL
    Wang, TM
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2006, 23 (05) : 537 - 545
  • [25] Novel graphical representation of genome sequence and its applications in similarity analysis
    Yu, Hong-Jie
    Huang, De-Shuang
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2012, 391 (23) : 6128 - 6136
  • [26] Analysis of Similarities/Dissimilarities of DNA Sequences Based on a Novel Graphical Representation
    Yu, Jia-Feng
    Wang, Ji-Hua
    Sun, Xiao
    MATCH-COMMUNICATIONS IN MATHEMATICAL AND IN COMPUTER CHEMISTRY, 2010, 63 (02) : 493 - 512
  • [27] A time series representation of protein sequences for similarity comparison
    Li, Cancan
    Dai, Qi
    He, Ping-an
    JOURNAL OF THEORETICAL BIOLOGY, 2022, 538
  • [28] A new graphical representation of protein sequences based on dual-vector model
    Zhang, Zhujin, 1600, Springer Verlag (472): : 629 - 632
  • [29] A New Graphical Representation of Protein Sequences Based on Dual-Vector Model
    Zhang, Zhujin
    Zeng, Xiangxiang
    Chen, Zhihua
    Wang, Eric Ke
    BIO-INSPIRED COMPUTING - THEORIES AND APPLICATIONS, BIC-TA 2014, 2014, 472 : 629 - 632
  • [30] A 2D Graphical Representation of Protein Sequence and Their Similarity Analysis with Probabilistic Method
    Gupta, Manoj Kumar
    Niyogi, Rajdeep
    Misra, Manoj
    MATCH-COMMUNICATIONS IN MATHEMATICAL AND IN COMPUTER CHEMISTRY, 2014, 72 (02) : 519 - 532