DUC-Curve, a highly compact 2D graphical representation of DNA sequences and its application in sequence alignment

被引:13
作者
Li, Yushuang [1 ]
Liu, Qian [1 ]
Zheng, Xiaoqi [2 ]
机构
[1] Yanshan Univ, Coll Sci, Qinhuangdao 066004, Peoples R China
[2] Shanghai Normal Univ, Dept Math, Shanghai 200234, Peoples R China
基金
中国国家自然科学基金;
关键词
Graphical representation; DNA sequence; DUC-Curve; Sequence alignment; Sequence comparison; CHAOS-GAME REPRESENTATION; NUMERICAL CHARACTERIZATION; PLACENTAL MAMMALS; SIMILARITY ANALYSIS; DUAL NUCLEOTIDES; INTERORDINAL RELATIONSHIPS; PROTEIN SEQUENCES; MAJOR CLADES; 2-D; MAP;
D O I
10.1016/j.physa.2016.03.061
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
A highly compact and simple 2D graphical representation of DNA sequences, named DUC-Curve, is constructed through mapping four nucleotides to a unit circle with a cyclic order. DUC-Curve could directly detect nucleotide, di-nucleotide compositions and microsatellite structure from DNA sequences. Moreover, it also could be used for DNA sequence alignment. Taking geometric center vectors of DUC-Curves as sequence descriptor, we perform similarity analysis on the first exons of beta-globin genes of 11 species, oncogene TP53 of 27 species and twenty-four Influenza A viruses, respectively. The obtained reasonable results illustrate that the proposed method is very effective in sequence comparison problems, and will at least play a complementary role in classification and clustering problems. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:256 / 270
页数:15
相关论文
共 66 条
[1]   Comparative analysis of amino acid repeats in rodents and humans [J].
Albà, MM ;
Guigó, R .
GENOME RESEARCH, 2004, 14 (04) :549-554
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   Vector representation and its application of DNA sequences based on nucleotide triplet codons [J].
Bai, FengLan ;
Zhang, JiHong ;
Zheng, JunSheng ;
Li, Chao ;
Liu, LiWei .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2015, 62 :150-156
[4]   Effective Encoding for DNA Sequence Visualization Based on Nucleotide's Ring Structure [J].
Bari, A. T. M. Golam ;
Reaz, Rokeya ;
Islam, A. K. M. Tauhidul ;
Choi, Ho-Jin ;
Jeong, Byeong-Soo .
EVOLUTIONARY BIOINFORMATICS, 2013, 9 :251-261
[5]   2D-dynamic representation of DNA sequences [J].
Bielinska-Waz, Dorota ;
Clark, Timothy ;
Waz, Piotr ;
Nowak, Wieslaw ;
Nandy, Ashesh .
CHEMICAL PHYSICS LETTERS, 2007, 442 (1-3) :140-144
[6]   Graphical and numerical representations of DNA sequences: statistical aspects of similarity [J].
Bielinska-Waz, Dorota .
JOURNAL OF MATHEMATICAL CHEMISTRY, 2011, 49 (10) :2345-2407
[7]   Four-component spectral representation of DNA sequences [J].
Bielinska-Waz, Dorota .
JOURNAL OF MATHEMATICAL CHEMISTRY, 2010, 47 (01) :41-51
[8]   A 3D Graphical Representation of DNA Sequence Based on Numerical Coding Method [J].
Cao, Zhi ;
Li, Renfa ;
Chen, Weiyang .
INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY, 2010, 110 (05) :975-980
[9]   A novel 2D graphical representation of DNA sequences and its application [J].
Dai, Qi ;
Liu, Xiaoqing ;
Wang, Tianming .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2006, 25 (03) :340-344
[10]   SIMPLER DNA-SEQUENCE REPRESENTATIONS [J].
GATES, MA .
NATURE, 1985, 316 (6025) :219-219