Primary structure similarity analysis of proteins sequences by a new graphical representation

被引:7
作者
Xu, S. C. [1 ]
Li, Z. [1 ]
Zhang, S. P. [2 ]
Hu, J. L. [1 ]
机构
[1] Zhejiang Sci Tech Univ, Coll Sci, Hangzhou, Zhejiang, Peoples R China
[2] Univ Nebraska, Dept Stat, Lincoln, NE 68583 USA
关键词
curvature; shape analysis; torsion; protein sequence; cubic B-spline curve; DNA-SEQUENCES; PHYSICOCHEMICAL PROPERTIES; SIMILARITY/DISSIMILARITY;
D O I
10.1080/1062936X.2014.955055
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
A new graphical description of the primary structure of protein sequences is introduced. First, a three-dimensional space discrete point set of a protein sequence is created based on the three main physicochemical properties of the amino acids. Secondly, a continuous cubic B-spline curve interpolating the amino acid points is constructed to represent the shape of the protein sequence. Then the geometric properties (curvature and torsion) of the continuous curve are extracted for the purpose of analyzing the similarity between protein sequences. Finally, an improved Canberra distance comparison is introduced for the similarity analysis of protein sequences with different lengths. Experimental results show that our method is effective for the similarity comparison of protein sequences.
引用
收藏
页码:791 / 803
页数:13
相关论文
共 50 条
  • [1] Similarity/Dissimilarity Analysis of Protein Sequences by a New Graphical Representation
    Huang, Guohua
    Hu, Jerry
    CURRENT BIOINFORMATICS, 2013, 8 (05) : 539 - 544
  • [2] A new graphical representation of similarity/dissimilarity studies of protein sequences
    He, P.
    SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2010, 21 (5-6) : 571 - 580
  • [3] Similarity analysis of protein sequences based on a new graphical representation method
    Zhang, Yuyan
    Wen, Jia
    COMMUNICATIONS IN INFORMATION AND SYSTEMS, 2018, 18 (03) : 193 - 208
  • [4] A novel graphical representation and similarity analysis of protein sequences based on physicochemical properties
    Mahmoodi-Reihani, Mehri
    Abbasitabar, Fatemeh
    Zare-Shahabadi, Vahid
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2018, 510 : 477 - 485
  • [5] A Novel Method of 3D Graphical Representation and Similarity Analysis for Proteins
    Li, Zhong
    Geng, Changchun
    He, Pingan
    Yao, Yuhua
    MATCH-COMMUNICATIONS IN MATHEMATICAL AND IN COMPUTER CHEMISTRY, 2014, 71 (01) : 213 - 226
  • [6] Similarity/Dissimilarity Analysis of Protein Sequences Based on a New Spectrum-Like Graphical Representation
    Yao, Yuhua
    Yan, Shoujiang
    Xu, Huimin
    Han, Jianning
    Nan, Xuying
    He, Ping-an
    Dai, Qi
    EVOLUTIONARY BIOINFORMATICS, 2014, 10 : 87 - 96
  • [7] A new graphical representation and its application in similarity/dissimilarity analysis of DNA sequences
    Luo, Jiawei
    Guo, Jiachen
    Li, Yang
    2010 4TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING (ICBBE 2010), 2010,
  • [8] Graphical Representation and Similarity Analysis of Protein Sequences Based on Fractal Interpolation
    Hu, Hailong
    Li, Zhong
    Dong, Hongwei
    Zhou, Tianhe
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2017, 14 (01) : 182 - 192
  • [9] Graphical Representation and Similarity Analysis of DNA Sequences Based on Trigonometric Functions
    Xie, Guo-Sen
    Jin, Xiao-Bo
    Yang, Chunlei
    Pu, Jiexin
    Mo, Zhongxi
    ACTA BIOTHEORETICA, 2018, 66 (02) : 113 - 133
  • [10] 3D-PAF Curve: A Novel Graphical Representation of Protein Sequences for Similarity Analysis
    Mu, Zengchao
    Li, Guojun
    Wu, Haiyan
    Qi, Xingqin
    MATCH-COMMUNICATIONS IN MATHEMATICAL AND IN COMPUTER CHEMISTRY, 2016, 75 (02) : 447 - 462