BASE COMPOSITIONAL STRUCTURE OF GENOMES

被引:108
作者
FICKETT, JW [1 ]
TORNEY, DC [1 ]
WOLF, DR [1 ]
机构
[1] LOS ALAMOS NATL LAB,COMPLEX SYST GRP,LOS ALAMOS,NM 87545
关键词
D O I
10.1016/0888-7543(92)90019-O
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
We model the base compositional structure of the human and Escherichia coli genomes. Three particular properties are first quantified: (1) There is a significant tendency for any region of either genome to have a strand-symmetric base composition. (2) The variation in base composition from region to region, within each genome, is very much larger than expected from common homogeneous stochastic models. (3) A given local base composition tends to persist over a scale of at least kilobases (E. coli) or tens of kilobases (human). Multidomain stochastic models from the literature are reviewed and sharpened. In particular, quantitative measurements of the third property lead us to suggest a significant shift in the style of domain models, in which the variation of A + T content with position is modeled by a random walk with frequent small steps rather than with large quantum jumps. As an application, we suggest a way to reduce the amount of computation in the assembly of large sequences from sequences of randomly chosen fragments. © 1992.
引用
收藏
页码:1056 / 1064
页数:9
相关论文
共 23 条
[1]  
BERNARDI G, 1989, ANNU REV GENET, V23, P637, DOI 10.1146/annurev.ge.23.120189.003225
[2]   TOWARDS MODELING DNA-SEQUENCES AS AUTOMATA [J].
BURKS, C ;
FARMER, D .
PHYSICA D, 1984, 10 (1-2) :157-167
[3]  
BURKS C, 1990, METHOD ENZYMOL, V183, P1
[4]  
CHURCHILL GA, 1989, B MATH BIOL, V51, P79
[5]  
DEVORE JL, 1982, PROBABILITY STATISTI
[6]   THEORETICAL MODELS FOR HETEROGENEITY OF BASE COMPOSITION IN DNA [J].
ELTON, RA .
JOURNAL OF THEORETICAL BIOLOGY, 1974, 45 (02) :533-553
[7]  
FELLER W, 1950, INTRO PROBABILITY TH, V1
[8]  
FICHANT G, 1988, MOL CELL BIOL LIFE S, V7, P49
[9]   RECOGNITION OF PROTEIN CODING REGIONS IN DNA-SEQUENCES [J].
FICKETT, JW .
NUCLEIC ACIDS RESEARCH, 1982, 10 (17) :5303-5318
[10]  
FICKETT JW, 1990, ABSTR AM MATH SOC