A NEW METHOD FOR FINDING LONG CONSENSUS PATTERNS IN NUCLEIC-ACID SEQUENCES

被引:0
作者
TAYLOR, P
ROSENBERG, P
SAMSONOVA, MG
机构
[1] LENINGRAD STATE UNIV,DEPT GENET,LENINGRAD 199034,USSR
[2] UNIV GLASGOW,COMP SERV,GLASGOW G12 8QQ,SCOTLAND
来源
COMPUTER APPLICATIONS IN THE BIOSCIENCES | 1991年 / 7卷 / 04期
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We describe a fast computer algorithm for identifying consensus patterns in DNA sequences. The method requires no prior assumptions about the consensus pattern other than its length. In particular no previous knowledge of the frequency or spacing of consensus patterns is required. However, a priori information about the shape of the consensus pattern, or invariability of individual positions, or the overall conservation level, can be utilized to enhance the selectivity and sensitivity of search. As the number of all possible consensus words increases very rapidly with length, comprehensive searches have usually been restricted to a maximum of 10-12 nucleotides, even when large mainframes are used. Our algorithm enables searching for consensus patterns of this order on current mid-range and powerful microcomputers. Searches may be conducted on single, long sequences or a set of possibly aligned shorter sequences. We give examples of identified consensus patterns in both prokaryotic and eukaryotic DNA sequences, along with some typical program timings.
引用
收藏
页码:495 / 500
页数:6
相关论文
共 46 条
[1]   2 GENES FOR RIBOSOMAL PROTEIN-51 OF SACCHAROMYCES-CEREVISIAE COMPLEMENT AND CONTRIBUTE TO THE RIBOSOMES [J].
ABOVICH, N ;
ROSBASH, M .
MOLECULAR AND CELLULAR BIOLOGY, 1984, 4 (09) :1871-1879
[2]   GCN4 PROTEIN, A POSITIVE TRANSCRIPTION FACTOR IN YEAST, BINDS GENERAL CONTROL PROMOTERS AT ALL 5' TGACTC 3' SEQUENCES [J].
ARNDT, K ;
FINK, GR .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1986, 83 (22) :8516-8520
[3]   THE SEQUENCE OF THE DNAS CODING FOR THE MATING-TYPE LOCI OF SACCHAROMYCES-CEREVISIAE [J].
ASTELL, CR ;
AHLSTROMJONASSON, L ;
SMITH, M ;
TATCHELL, K ;
NASMYTH, KA ;
HALL, BD .
CELL, 1981, 27 (01) :15-23
[4]   MULTIPLE SEQUENCE ALIGNMENT [J].
BACON, DJ ;
ANDERSON, WF .
JOURNAL OF MOLECULAR BIOLOGY, 1986, 191 (02) :153-161
[5]  
DABEVA MD, 1987, J BIOL CHEM, V262, P16055
[6]   THE COMPLETE DNA-SEQUENCE OF VARICELLA-ZOSTER VIRUS [J].
DAVISON, AJ ;
SCOTT, JE .
JOURNAL OF GENERAL VIROLOGY, 1986, 67 :1759-1816
[7]   NUMERICAL-METHODS FOR INFERRING EVOLUTIONARY TREES [J].
FELSENSTEIN, J .
QUARTERLY REVIEW OF BIOLOGY, 1982, 57 (04) :379-404
[8]   RIGOROUS PATTERN-RECOGNITION METHODS FOR DNA-SEQUENCES - ANALYSIS OF PROMOTER SEQUENCES FROM ESCHERICHIA-COLI [J].
GALAS, DJ ;
EGGERT, M ;
WATERMAN, MS .
JOURNAL OF MOLECULAR BIOLOGY, 1985, 186 (01) :117-128
[9]   STRUCTURE OF A SPLIT YEAST GENE - COMPLETE NUCLEOTIDE-SEQUENCE OF THE ACTIN GENE IN SACCHAROMYCES-CEREVISIAE [J].
GALLWITZ, D ;
SURES, I .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-BIOLOGICAL SCIENCES, 1980, 77 (05) :2546-2550
[10]   CONSISTENCY OF OPTIMAL SEQUENCE ALIGNMENTS [J].
GOTOH, O .
BULLETIN OF MATHEMATICAL BIOLOGY, 1990, 52 (04) :509-525