Primary structure similarity analysis of proteins sequences by a new graphical representation

被引:7
作者
Xu, S. C. [1 ]
Li, Z. [1 ]
Zhang, S. P. [2 ]
Hu, J. L. [1 ]
机构
[1] Zhejiang Sci Tech Univ, Coll Sci, Hangzhou, Zhejiang, Peoples R China
[2] Univ Nebraska, Dept Stat, Lincoln, NE 68583 USA
关键词
curvature; shape analysis; torsion; protein sequence; cubic B-spline curve; DNA-SEQUENCES; PHYSICOCHEMICAL PROPERTIES; SIMILARITY/DISSIMILARITY;
D O I
10.1080/1062936X.2014.955055
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
A new graphical description of the primary structure of protein sequences is introduced. First, a three-dimensional space discrete point set of a protein sequence is created based on the three main physicochemical properties of the amino acids. Secondly, a continuous cubic B-spline curve interpolating the amino acid points is constructed to represent the shape of the protein sequence. Then the geometric properties (curvature and torsion) of the continuous curve are extracted for the purpose of analyzing the similarity between protein sequences. Finally, an improved Canberra distance comparison is introduced for the similarity analysis of protein sequences with different lengths. Experimental results show that our method is effective for the similarity comparison of protein sequences.
引用
收藏
页码:791 / 803
页数:13
相关论文
共 50 条
[41]   Graphical Representation for DNA Sequences via Joint Diagonalization of Matrix Pencil [J].
Yu, Hong-Jie ;
Huang, De-Shuang .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2013, 17 (03) :503-511
[42]   The Graphical Representation of Protein Sequences Based on the Physicochemical Properties and Its Applications [J].
He, Ping-An ;
Zhang, Yan-Ping ;
Yao, Yu-Hua ;
Tang, Yi-Fa ;
Nan, Xu-Ying .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2010, 31 (11) :2136-2142
[43]   A Novel 2D graphical representation and its application in the similarities/dissimilarities analysis of protein sequences [J].
Zhu X. .
Zhu, Xianyou (364715358@qq.com), 1600, Science Publications (12) :47-55
[44]   Similarity analysis of DNA sequences based on the EMD method [J].
Bai, Fenglan ;
Zhang, Jihong ;
Zheng, Junsheng .
APPLIED MATHEMATICS LETTERS, 2011, 24 (02) :232-237
[45]   A novel method for similarity/dissimilarity analysis of protein sequences [J].
Mu, Zengchao ;
Wu, Jing ;
Zhang, Yusen .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2013, 392 (24) :6361-6366
[46]   Novel 20-D descriptors of protein sequences and it's applications in similarity analysis [J].
Yu, Hong-Jie ;
Huang, De-Shuang .
CHEMICAL PHYSICS LETTERS, 2012, 531 :261-266
[47]   Measuring Similarity among Protein Sequences Using a New Descriptor [J].
Abo-Elkhier, Mervat M. ;
Abd Elwahaab, Marwa A. ;
Abo El Maaty, Moheb I. .
BIOMED RESEARCH INTERNATIONAL, 2019, 2019
[48]   2D Graphical Representation of DNA Sequences Based on Variant Map [J].
Wu, Ruoxue ;
Liu, Wenjia ;
Mao, Yuyuan ;
Zheng, Jeffrey .
IEEE ACCESS, 2020, 8 :173755-173765
[49]   WormStep: An Improved Compact Graphical Representation of DNA Sequences Based on Worm Curve [J].
Zhang, Zhujin ;
Zeng, Xiangxiang ;
Song, Tao ;
Chen, Zhihua ;
Wang, Xun ;
Ye, Yunming .
JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2013, 10 (01) :189-193
[50]   A new 2D graphical representation of protein sequence and its application [J].
Wang, Lei ;
Peng, Hui ;
Zheng, Jinhua ;
Qiu, Yanzi .
INTERNATIONAL JOURNAL OF BIOMATHEMATICS, 2015, 8 (05)