Similarity analysis of DNA sequences based on the generalized LZ complexity of (0,1)-sequences

被引:10
作者
Li, Chun [1 ,2 ]
Wang, Jun [3 ]
机构
[1] Bohai Univ, Dept Math, Jinzhou 121000, Peoples R China
[2] Dalian Univ Technol, Dept Appl Math, Dalian 116024, Peoples R China
[3] Dalian Univ Technol, Coll Adv Sci & Technol, Dept Appl Math, Dalian 116024, Peoples R China
基金
中国国家自然科学基金;
关键词
DNA; complexity; (0,1)-sequence; permutation;
D O I
10.1007/s10910-006-9176-8
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Based on the permutation of a binary alphabet, four generalized LZ complexities of a (0,1)-sequence are introduced. Since the logical representation of a DNA primary sequence includes four logical sequences, a DNA primary sequence can be characterized by a 16-D vector whose entries are the complexities corresponding to the logical sequences. The utility of the new quantitative characterization of DNA sequences is illustrated by an examination of the similarity among the full beta-globin genes of 11 different species.
引用
收藏
页码:26 / 31
页数:6
相关论文
共 29 条
[1]   Investigating extended regulatory regions of genomic DNA sequences [J].
Babenko, VN ;
Kosarev, PS ;
Vishnevsky, OV ;
Levitsky, VG ;
Basin, VV ;
Frolov, AS .
BIOINFORMATICS, 1999, 15 (7-8) :644-653
[2]   DNA invariants based on nonoverlapping triplets of nucleotide bases [J].
Balaban, AT ;
Plavsic, D ;
Randic, M .
CHEMICAL PHYSICS LETTERS, 2003, 379 (1-2) :147-154
[3]  
Cover T., 2003, ELEMENTS INFORM THEO
[4]  
Gusev VD, 1999, BIOINFORMATICS, V15, P994
[5]  
He P.-an, 2002, INTERNET ELECT J MOL, V1, P668
[6]   Characteristic sequences for DNA primary sequence [J].
He, PA ;
Wang, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (05) :1080-1085
[7]  
JI M, 2006, IN PRESS J MATH CHEM, V40
[8]  
JIANG T, 2002, CURRENT TOPICS COMPU, P157
[9]   COMPLEXITY OF FINITE SEQUENCES [J].
LEMPEL, A ;
ZIV, J .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1976, 22 (01) :75-81
[10]  
Li C, 2005, J CHEM INF MODEL, V45, P115, DOI 10.1021/ci0498741