Local scaling and multifractal spectrum analyses of DNA sequences - GenBank data analysis

被引:9
作者
Su, Zhi-Yuan [1 ]
Wu, Tzuyin [2 ]
Wang, Shu-Yin [3 ]
机构
[1] Chia Nan Univ Pharm & Sci, Dept Informat Management, Tainan 717, Taiwan
[2] Natl Taiwan Univ, Dept Mech Engn, Taipei 106, Taiwan
[3] Chinese Culture Univ, Dept Anim Sci, Taipei 111, Taiwan
关键词
PROTEIN-CODING REGIONS; RANGE CORRELATION MEASURES; TIME-SERIES MODEL; NONCODING DNA; COMPLETE GENOMES; GENERALIZED DIMENSIONS; STATISTICAL PROPERTIES; SINGULARITY SPECTRUM; BLOODED VERTEBRATES; STRANGE ATTRACTORS;
D O I
10.1016/j.chaos.2007.09.078
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Base sequences of deoxyribonucleic acid (DNA) in all organism carry all the instructions regarding its growth and development. Oil the surface, such sequences seem irregular; yet in reality, they are symbolic sequences with all organized structure. This study investigates the characteristics of base arrangement and distribution in DNA sequences from the fractal theory viewpoint. In addition to multifractal features demonstrated by the DNA sequence, this study also compares the multifractal spectra derived from a particular family of gene among several different species. The results reveal that a considerable correlation exists between base distribution and evolutionary order. Furthermore, local scaling exponent (Holder exponent) differences between coding segments (exon) and non-coding segments (intron) are also examined. It is suggested that such differences in the local distribution of bases can be applied to find coding segments within the DNA sequence that is to be translated into protein. This local scaling analysis is feasible and has the potential to become an effective tool for rapid location of possible coding sites in DNA sequences. The authors hope that future studies using more complicated bioinformatics methods for analyzing DNA sequences call benefit from this study. (C) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1750 / 1765
页数:16
相关论文
共 51 条
[1]   Long- and short-range correlations in genome organization [J].
Almirantis, Y ;
Provata, A .
JOURNAL OF STATISTICAL PHYSICS, 1999, 97 (1-2) :233-262
[2]   Multifractal characterization of complete genomes [J].
Anh, V ;
Lau, KS ;
Yu, ZG .
JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 2001, 34 (36) :7127-7139
[3]   Recognition of an organism from fragments of its complete genome [J].
Anh, V.V. ;
Lau, K.S. ;
Yu, Z.G. .
Physical Review E - Statistical, Nonlinear, and Soft Matter Physics, 2002, 66 (03) :1-031910
[4]  
[Anonymous], 1983, FRACTAL GEOMETRY NAT
[5]   COMPOSITIONAL TRANSITIONS IN THE NUCLEAR GENOMES OF COLD-BLOODED VERTEBRATES [J].
BERNARDI, G ;
BERNARDI, G .
JOURNAL OF MOLECULAR EVOLUTION, 1990, 31 (04) :282-293
[6]   THE MOSAIC GENOME OF WARM-BLOODED VERTEBRATES [J].
BERNARDI, G ;
OLOFSSON, B ;
FILIPSKI, J ;
ZERIAL, M ;
SALINAS, J ;
CUNY, G ;
MEUNIERROTIVAL, M ;
RODIER, F .
SCIENCE, 1985, 228 (4702) :953-958
[7]   Multifractal and probabilistic properties of DNA sequences [J].
Bershadskii, A .
PHYSICS LETTERS A, 2001, 284 (2-3) :136-140
[8]   Generalized entropy and multifractality of time-series: relationship between order and intermittency [J].
Bickel, DR .
CHAOS SOLITONS & FRACTALS, 2002, 13 (03) :491-497
[9]   FRACTAL LANDSCAPES AND MOLECULAR EVOLUTION - MODELING THE MYOSIN HEAVY-CHAIN GENE FAMILY [J].
BULDYREV, SV ;
GOLDBERGER, AL ;
HAVLIN, S ;
PENG, CK ;
STANLEY, HE ;
STANLEY, MHR ;
SIMONS, M .
BIOPHYSICAL JOURNAL, 1993, 65 (06) :2673-2679
[10]   LONG-RANGE CORRELATION-PROPERTIES OF CODING AND NONCODING DNA-SEQUENCES - GENBANK ANALYSIS [J].
BULDYREV, SV ;
GOLDBERGER, AL ;
HAVLIN, S ;
MANTEGNA, RN ;
MATSA, ME ;
PENG, CK ;
SIMONS, M ;
STANLEY, HE .
PHYSICAL REVIEW E, 1995, 51 (05) :5084-5091