Regression trees for analysis of mutational spectra in nucleotide sequences

被引:12
作者
Berikov, VB [1 ]
Rogozin, IB
机构
[1] Russian Acad Sci, Inst Math, Siberian Branch, Novosibirsk 630090, Russia
[2] Russian Acad Sci, Inst Cytol & Genet, Siberian Branch, Novosibirsk 630090, Russia
[3] Penn State Univ, Inst Mol Evolutionary Genet, University Pk, PA 16802 USA
[4] Penn State Univ, Dept Biol, University Pk, PA 16802 USA
关键词
D O I
10.1093/bioinformatics/15.7.553
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The study and comparison of mutational spectra is an important problem in molecular biology, because these spectra often reveal important features of the action of various mutagens and the functioning of repair/replication enzymes. As is known, mutability varies significantly along nucleotide sequences: mutations often concentrate at certain positions in a sequence, otherwise termed 'hotspots'. Results: Herein, we propose a regression analysis method based on the use of regression trees in order to analyse the influence of nucleotide context on the occurrence of such hotspots. The REGRT programme developed has been tested on simulated and real mutational spectra. For the G:C-->T:A mutational spectra induced by Sn1 alkylating agents (nine spectra), the prediction accuracy was 0.99. Availability: The REGRT programme is available upon request from V.Berikov. Contact: berikov@ math.nsc.ru.
引用
收藏
页码:553 / 562
页数:10
相关论文
共 28 条
[1]  
BABENKO VN, 1999, IN PRESS BIOFIZIKA
[3]   PASSENGER TRANSGENES REVEAL INTRINSIC SPECIFICITY OF THE ANTIBODY HYPERMUTATION MECHANISM - CLUSTERING, POLARITY, AND SPECIFIC HOT-SPOTS [J].
BETZ, AG ;
RADA, C ;
PANNELL, R ;
MILSTEIN, C ;
NEUBERGER, MS .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1993, 90 (06) :2385-2388
[4]  
BOULIKAS T, 1992, J MOL EVOL, V35, P156, DOI 10.1007/BF00183227
[5]  
BROWN BW, 1993, LIB C ROUTINES RANDO
[6]  
BURNS PA, 1988, CANCER RES, V48, P4455
[7]   INFLUENCE OF NEIGHBORING BASE SEQUENCE ON N-METHYL-N'-NITRO-N-NITROSOGUANIDINE MUTAGENESIS IN THE LACI GENE OF ESCHERICHIA-COLI [J].
BURNS, PA ;
GORDON, AJE ;
GLICKMAN, BW .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 194 (03) :385-390
[8]   NOMENCLATURE FOR INCOMPLETELY SPECIFIED BASES IN NUCLEIC-ACID SEQUENCES - RECOMMENDATIONS 1984 [J].
CORNISHBOWDEN, A .
NUCLEIC ACIDS RESEARCH, 1985, 13 (09) :3021-3030
[9]   MOLECULAR-BASIS OF BASE SUBSTITUTION HOTSPOTS IN ESCHERICHIA-COLI [J].
COULONDRE, C ;
MILLER, JH ;
FARABAUGH, PJ ;
GILBERT, W .
NATURE, 1978, 274 (5673) :775-780
[10]   The logic of inductive inference [J].
Fisher, RA .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY, 1935, 98 :39-82