Analysis of human DNA through power-law statistics

被引:21
作者
Costa, M. O. [1 ]
Silva, R. [1 ,2 ]
Anselmo, D. H. A. L. [2 ]
Silva, J. R. P. [1 ]
机构
[1] Univ Estado Rio Grande do Norte, Dept Fis, BR-59610210 Mossoro, Brazil
[2] Univ Fed Rio Grande do Norte, Dept Fis, BR-59072970 Natal, RN, Brazil
关键词
LONG-RANGE CORRELATIONS; GENERALIZED ENTROPIES; SIZE DISTRIBUTION; SEQUENCES; DISTRIBUTIONS;
D O I
10.1103/PhysRevE.99.022112
中图分类号
O35 [流体力学]; O53 [等离子体物理学];
学科分类号
070204 ; 080103 ; 080704 ;
摘要
We report an analysis of Homo sapiens DNA through the formalism of kappa statistics, which encompasses power-law correlations and provides an optimization principle that permits us to model distinct physical systems; i.e., the power-law distribution of the length of DNA bases is calculated from a general model which follows arguments similar to those proposed in Maxwell's deduction of statistical distributions. The viability of the model is tested using a data set from a catalog of proteins collected from the Ensembl Project. The results indicate that the short-range correlations, always present in coding DNA sequences, are appropriately captured through the Kaniadakis power-law distribution, adequately describing the cumulative length distribution of DNA bases, in contrast with the case of the traditional exponential statistical model.
引用
收藏
页数:8
相关论文
共 46 条
[1]   DNA-based nanobiostructured devices: The role of quasiperiodicity and correlation effects [J].
Albuquerque, E. L. ;
Fulco, U. L. ;
Freire, V. N. ;
Caetano, E. W. S. ;
Lyra, M. L. ;
de Moura, F. A. B. F. .
PHYSICS REPORTS-REVIEW SECTION OF PHYSICS LETTERS, 2014, 535 (04) :139-209
[2]   Long- and short-range correlations in genome organization [J].
Almirantis, Y ;
Provata, A .
JOURNAL OF STATISTICAL PHYSICS, 1999, 97 (1-2) :233-262
[3]   CHARACTERIZING LONG-RANGE CORRELATIONS IN DNA-SEQUENCES FROM WAVELET ANALYSIS [J].
ARNEODO, A ;
BACRY, E ;
GRAVES, PV ;
MUZY, JF .
PHYSICAL REVIEW LETTERS, 1995, 74 (16) :3293-3296
[4]   Long-range correlations in genomic DNA: A signature of the nucleosomal structure [J].
Audit, B ;
Thermes, C ;
Vaillant, C ;
d'Aubenton-Carafa, Y ;
Muzy, JF ;
Ameodo, A .
PHYSICAL REVIEW LETTERS, 2001, 86 (11) :2471-2474
[5]   Generalised information and entropy measures in physics [J].
Beck, Christian .
CONTEMPORARY PHYSICS, 2009, 50 (04) :495-510
[6]   Universal Internucleotide Statistics in Full Genomes: A Footprint of the DNA Structure and Packaging? [J].
Bogachev, Mikhail I. ;
Kayumov, Airat R. ;
Bunde, Armin .
PLOS ONE, 2014, 9 (12)
[7]   Information Measure for Long-Range Correlated Sequences: the Case of the 24 Human Chromosomes [J].
Carbone, A. .
SCIENTIFIC REPORTS, 2013, 3
[8]   NON-GAUSSIAN STATISTICS AND STELLAR ROTATIONAL VELOCITIES OF MAIN-SEQUENCE FIELD STARS [J].
Carvalho, J. C. ;
do Nascimento, J. D., Jr. ;
Silva, R. ;
De Medeiros, J. R. .
ASTROPHYSICAL JOURNAL LETTERS, 2009, 696 (01) :L48-L51
[9]   The κ-generalized distribution:: A new descriptive model for the size distribution of incomes [J].
Clementi, F. ;
Di Matteo, T. ;
Gallegati, M. ;
Kaniadakis, G. .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2008, 387 (13) :3201-3208
[10]   κ-Generalized statistics in personal income distribution [J].
Clementi, F. ;
Gallegati, M. ;
Kaniadakis, G. .
EUROPEAN PHYSICAL JOURNAL B, 2007, 57 (02) :187-193