DNA-sequence analysis using Markov chain models

被引:0
作者
Ryabko, Boris [1 ]
Usotskaya, Natalie [2 ]
机构
[1] Siberian Univ Telecommun Informat, Inst Computat Technol, RAS, Siberian Branch, Novosibirsk, Russia
[2] Novosibirsk State Univ, Novosibirsk, Russia
来源
2008 IEEE INFORMATION THEORY WORKSHOP | 2008年
基金
俄罗斯基础研究基金会;
关键词
D O I
10.1109/ITW.2008.4578634
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The statistical structure of DNA-sequences is of a great interest to molecular biology, genetics and the theory of evolution (see Chen and others, GIW-99, 1999, Aktulga and others, EURASIP J. of Bioinformatics and Systems Biology, 2007, Li, Computers and Chemistry, 1997). One of the approaches is a sequence modeling using Markov processes of different orders, and further statistical estimation of their parameters (see Simons and others, JSPI, 2005). In this paper we use firstly the test for the serial independence from Ryabko, Astola (Slat. Methodology, 2006) to estimate the "memory" (or connectivity) of genetic texts and secondly we apply the homogeneity test for solving the DNA-based problem connected to the phylogenetic system of various organisms.
引用
收藏
页码:119 / +
页数:2
相关论文
共 10 条
[1]   Identifying Statistical Dependence in Genomic Sequences via Mutual Information Estimates [J].
Aktulga, Hasan Metin ;
Kontoyiannis, Ioannis ;
Lyznik, L. Alex ;
Szpankowski, Lukasz ;
Grama, Ananth Y. ;
Szpankowski, Wojciech .
EURASIP JOURNAL ON BIOINFORMATICS AND SYSTEMS BIOLOGY, 2007, (01)
[2]  
Chen X., 1999, GENOME INFORM, V10, P51
[3]  
FARACH M, 1994, P 6 ANN ACM SIAM S D, P48
[4]  
Hagenauer J, 2004, 2004 IEEE INFORMATION THEORY WORKSHOP, PROCEEDINGS, P55
[5]  
KARP RM, 2002, NOT AM MATH SOC, V49, P544
[6]   The study of correlation structures of DNA sequences: a critical review [J].
Li, WT .
COMPUTERS & CHEMISTRY, 1997, 21 (04) :257-271
[7]  
OPREA I, 2004, LEONARDO ELECT J PRA, V3, P53
[8]  
RYABKO B, 2006, STAT METHODOL, V3, P375, DOI DOI 10.1016/J.STAMET.2005.10.004
[9]   Global Markov models for eukaryote nucleotide data [J].
Simons, G ;
Yao, YC ;
Morton, G .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2005, 130 (1-2) :251-275
[10]   A SEQUENTIAL ALGORITHM FOR THE UNIVERSAL CODING OF FINITE MEMORY SOURCES [J].
WEINBERGER, MJ ;
LEMPEL, A ;
ZIV, J .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1992, 38 (03) :1002-1014