A STREAMLINED RANDOM SEQUENCING STRATEGY FOR FINDING CODING EXONS

被引:24
作者
CLAVERIE, JM
机构
[1] National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894
关键词
D O I
10.1006/geno.1994.1545
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The random (shotgun) DNA sequencing strategy is used for most large-scale sequencing projects, including the identification of human disease genes after positional cloning. The principle of the method-sequence assembly from overlap-requires the candidate gene region to be partitioned into 15- to 20-kb pieces (usually lambda inserts), themselves randomly subcloned into M13 prior to sequencing with a 6- to 8-fold redundancy. Most often, a time-consuming directed strategy must be invoked to close the remaining gaps. Ultimately, computer-based methods are invoked to locate putative coding exons within the finished genomic sequence. Given the small average size of vertebrate exons, I show here that they can be detected from the computer analysis of the individual runs, much before completion of contiguity. However, the successful assessment of coding potential from the raw data depends on a combination of new sequence masking techniques. When the identification of coding exons is the primary goal, the usual random sequencing strategy can thus be greatly optimized. The streamlined approach requires only a 2- to 2.5-fold sequencing re dundancy, can dispense with the subcloning in lambda and the closure of gaps, and can be fully automated. The feasibility of this strategy is demonstrated using data from the X-linked Kallmann syndrome gene region. (C) 1994 Academic Press, Inc.
引用
收藏
页码:575 / 581
页数:7
相关论文
共 39 条
  • [1] SEQUENCE IDENTIFICATION OF 2,375 HUMAN BRAIN GENES
    ADAMS, MD
    DUBNICK, M
    KERLAVAGE, AR
    MORENO, R
    KELLEY, JM
    UTTERBACK, TR
    NAGLE, JW
    FIELDS, C
    VENTER, JC
    [J]. NATURE, 1992, 355 (6361) : 632 - 634
  • [2] DNA-SEQUENCE AND EXPRESSION OF THE B95-8 EPSTEIN-BARR VIRUS GENOME
    BAER, R
    BANKIER, AT
    BIGGIN, MD
    DEININGER, PL
    FARRELL, PJ
    GIBSON, TJ
    HATFULL, G
    HUDSON, GS
    SATCHWELL, SC
    SEGUIN, C
    TUFFNELL, PS
    BARRELL, BG
    [J]. NATURE, 1984, 310 (5974) : 207 - 211
  • [3] THE RISE AND FALL OF POSITIONAL CLONING
    BALLABIO, A
    [J]. NATURE GENETICS, 1993, 3 (04) : 277 - 279
  • [4] Bankier A T, 1991, DNA Seq, V2, P1, DOI 10.3109/10425179109008433
  • [5] EXON AMPLIFICATION - A STRATEGY TO ISOLATE MAMMALIAN GENES BASED ON RNA SPLICING
    BUCKLER, AJ
    CHANG, DD
    GRAW, SL
    BROOK, JD
    HABER, DA
    SHARP, PA
    HOUSMAN, DE
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1991, 88 (09) : 4005 - 4009
  • [6] Genome sequence of the nematode C-elegans:: A platform for investigating biology
    不详
    [J]. SCIENCE, 1998, 282 (5396) : 2012 - 2018
  • [7] ORDERED SHOTGUN SEQUENCING, A STRATEGY FOR INTEGRATED MAPPING AND SEQUENCING OF YAC CLONES
    CHEN, EY
    SCHLESSINGER, D
    KERE, J
    [J]. GENOMICS, 1993, 17 (03) : 651 - 656
  • [8] CLAVERIE JM, 1990, METHOD ENZYMOL, V183, P237
  • [9] CLAVERIE JM, 1993, NATURE, V364, P19
  • [10] INFORMATION ENHANCEMENT METHODS FOR LARGE-SCALE SEQUENCE-ANALYSIS
    CLAVERIE, JM
    STATES, DJ
    [J]. COMPUTERS & CHEMISTRY, 1993, 17 (02): : 191 - 201