Algorithms for sequence analysis via mutagenesis

被引:5
作者
Keith, JM [1 ]
Adams, P
Bryant, D
Cochran, DAE
Lala, GH
Mitchelson, KR
机构
[1] Univ Queensland, Dept Math, St Lucia, Qld 4072, Australia
[2] Univ Queensland, Australian Genome Res Facil, St Lucia, Qld 4072, Australia
[3] Univ Queensland, Inst Mol Biosci, St Lucia, Qld 4072, Australia
关键词
D O I
10.1093/bioinformatics/bth258
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Despite many successes of conventional DNA sequencing methods, some DNAs remain difficult or impossible to sequence. Unsequenceable regions occur in the genomes of many biologically important organisms, including the human genome. Such regions range in length from tens to millions of bases, and may contain valuable information such as the sequences of important genes. The authors have recently developed a technique that renders a wide range of problematic DNAs amenable to sequencing. The technique is known as sequence analysis via mutagenesis (SAM). This paper presents a number of algorithms for analysing and interpreting data generated by this technique. Results: The essential idea of SAM is to infer the target sequence using the sequences of mutants derived from the target. We describe three algorithms used in this process. The first algorithm predicts the number of mutants that will be required to infer the target sequence with a desired level of accuracy. The second algorithm infers the target sequence itself, using the mutant sequences. The third algorithm assigns quality values to each inferred base. The algorithms are illustrated using mutant sequences generated in the laboratory.
引用
收藏
页码:2401 / 2410
页数:10
相关论文
共 26 条
[1]   Comparison of the base pairing properties of a series of nitroazole nucleobase analogs in the oligodeoxyribonucleotide sequence 5'-d(CGCXAATTYGCG)-3' [J].
Bergstrom, DE ;
Zhang, PM ;
Johnson, WT .
NUCLEIC ACIDS RESEARCH, 1997, 25 (10) :1935-1942
[2]   Novel sulfoxides facilitate GC-rich template amplification [J].
Chakrabarti, R ;
Schutt, CE .
BIOTECHNIQUES, 2002, 32 (04) :866-+
[3]   INCORPORATION OF DITP OR 7-DEAZA DGTP DURING PCR IMPROVES SEQUENCING OF THE PRODUCT [J].
DIERICK, H ;
STUL, M ;
DEKELVER, W ;
MARYNEN, P ;
CASSIMAN, JJ .
NUCLEIC ACIDS RESEARCH, 1993, 21 (18) :4427-4428
[4]   Base-calling of automated sequencer traces using phred.: II.: Error probabilities [J].
Ewing, B ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :186-194
[5]   Base-calling of automated sequencer traces using phred.: I.: Accuracy assessment [J].
Ewing, B ;
Hillier, L ;
Wendl, MC ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :175-185
[6]  
Fernandez-Rachubinski F, 1990, DNA Seq, V1, P137, DOI 10.3109/10425179009016041
[7]   Sequence and analysis of chromosome 2 of Dictyostelium discoideum [J].
Glöckner, G ;
Eichinger, L ;
Szafranski, K ;
Pachebat, JA ;
Bankier, AT ;
Dear, PH ;
Lehmann, D ;
Baumgart, C ;
Parra, G ;
Abril, JF ;
Guigó, R ;
Kumpf, K ;
Tunggal, B ;
Cox, E ;
Quail, MA ;
Platzer, M ;
Rosenthal, A ;
Noegel, AA .
NATURE, 2002, 418 (6893) :79-85
[8]  
Hunt AR, 2003, GENOME MAPPING AND SEQUENCING, P315
[9]   STUDIES ON NUCLEIC-ACID INTERACTIONS .1. STABILITIES OF MINI-DUPLEXES (DG2A4XA4G2.DC2T4YT4C2) AND SELF-COMPLEMENTARY D(GGGAAXYTTCCC) CONTAINING DEOXYINOSINE AND OTHER MISMATCHED BASES [J].
KAWASE, Y ;
IWAI, S ;
INOUE, H ;
MIURA, K ;
OHTSUKA, E .
NUCLEIC ACIDS RESEARCH, 1986, 14 (19) :7727-7736
[10]   Unlocking hidden genomic sequence [J].
Keith, JM ;
Cochran, DAE ;
Lala, GH ;
Adams, P ;
Bryant, D ;
Mitchelson, KR .
NUCLEIC ACIDS RESEARCH, 2004, 32 (03) :e35