4D Graphical representation research of DNA sequences

被引:10
作者
Tan, Chengjie [1 ]
Li, Shanshan [1 ]
Zhu, Ping [1 ]
机构
[1] Jiangnan Univ, Sch Sci, Wuxi 214122, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Euclidean distance; graphical representation; geometric center; similarity analysis; AMINO-ACID-COMPOSITION; CELLULAR-AUTOMATA; PROTEOMICS; CURVES;
D O I
10.1142/S1793524515500047
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Graphical representation of DNA sequences is a key component in studying biological problems. In order to gain new insights in DNA sequences, this paper combined the digitized methods of single-base, base pairs and coding in triplet bases with the times of base appearing, and then a novel 4D graphical representation method of DNA sequences was put forward. It was a one-to-one correspondence of the arbitrary DNA sequence and 4D graphical representation, that avoided causing non-unique 4D graphical representation and overlapping lines. The method could reflect the biological information features of DNA sequence more comprehensively and effectively without any losses. Based on the 4D graphical representation, we used the geometric center of 4D graphical representation as eigenvalue of DNA sequences analyses, which kept the original features of the data, and then established the Euclidean distances and included angles between vectors' terminal point for similarity analyses of the first extron of the beta-globulin gene among 11 species. Finally, we established the graph of systematic hierarchical cluster analysis of 11 species to observe more easily the relationship between species. A positive outcome was reached, and the results were in accord with biological taxonomy, which also supported the rationality and effectiveness of the novel 4D graphical representation.
引用
收藏
页数:12
相关论文
共 29 条
[1]   Proteomics, networks and connectivity indices [J].
Gonzalez-Diaz, Humberto ;
Gonzalez-Diaz, Yenny ;
Santana, Lourdes ;
Ubeira, Florencio M. ;
Uriarte, Eugenio .
PROTEOMICS, 2008, 8 (04) :750-778
[2]   3D-QSAR study for DNA cleavage proteins with a potential anti-tumor ATCUN-like motif [J].
Gonzalez-Diaz, Humberto ;
Sanchez-Gonzalez, Angeles ;
Gonzalez-Diaz, Yenny .
JOURNAL OF INORGANIC BIOCHEMISTRY, 2006, 100 (07) :1290-1297
[3]  
HAMORI E, 1983, J BIOL CHEM, V258, P1318
[4]   NOVEL DNA-SEQUENCE REPRESENTATIONS [J].
HAMORI, E .
NATURE, 1985, 314 (6012) :585-585
[5]  
Li G., 2008, SCI TECHNOL ENG, V6, P1405
[6]   A 4D representation of DNA sequences and its application [J].
Liao, B ;
Tan, MS ;
Ding, KQ .
CHEMICAL PHYSICS LETTERS, 2005, 402 (4-6) :380-383
[7]   New 2D graphical representation of DNA sequences [J].
Liao, B ;
Wang, TM .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2004, 25 (11) :1364-1368
[8]   3-D graphical representation of DNA sequences and their numerical characterization [J].
Liao, B ;
Wang, TM .
JOURNAL OF MOLECULAR STRUCTURE-THEOCHEM, 2004, 681 (1-3) :209-212
[9]   A 2D graphical representation of DNA sequence [J].
Liao, B .
CHEMICAL PHYSICS LETTERS, 2005, 401 (1-3) :196-199
[10]   Analysis of similarity/dissimilarity of DNA sequences based on a condensed curve representation [J].
Liao, B ;
Zhang, Y ;
Ding, KQ ;
Wang, TM .
JOURNAL OF MOLECULAR STRUCTURE-THEOCHEM, 2005, 717 (1-3) :199-203