DNAfan: a software tool for automated extraction and analysis of user-defined sequence regions

被引:5
作者
Gisel, A
Panetta, M
Grillo, G
Licciulli, VF
Liuni, S
Saccone, C
Pesole, G
机构
[1] Univ Milan, Dipartimento Sci Biomol & Biotechnol, I-20133 Milan, Italy
[2] CNR, Sez Bioinformat & Genom Bari, Ist Tecnol Biomed, I-70126 Bari, Italy
[3] Univ Bari, Dipartimento Biochim & Biol Mol, I-70125 Bari, Italy
关键词
D O I
10.1093/bioinformatics/bth420
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
DNAfan (DNA Feature ANalyzer) is a tool combining sequence-filtering and pattern searching. DNAfan automatically extracts user-defined sets of sequence fragments from large sequence sets. Fragments are defined by annotated gene feature keys and co- or non-occurring patterns within the feature or close to it. A gene feature parser and a pattern-based filter tool localizes and extracts the specific subset of sequences. The selected sequence data can subsequently be retrieved for analyses or further processed with DNAfan to find the occurrence of specific patterns or structural motifs. DNAfan is a powerful tool for pattern analysis. Its filter features restricts the pattern search to a well-defined set of sequences, allowing drastic reduction in false positive hits.
引用
收藏
页码:3676 / 3679
页数:4
相关论文
共 10 条
[1]   Visualizing the competitive recognition of TATA-boxes in vertebrate promoters [J].
Audic, S ;
Claverie, JM .
TRENDS IN GENETICS, 1998, 14 (01) :10-11
[2]   Ensembl 2004 [J].
Birney, E ;
Andrews, D ;
Bevan, P ;
Caccamo, M ;
Cameron, G ;
Chen, Y ;
Clarke, L ;
Coates, G ;
Cox, T ;
Cuff, J ;
Curwen, V ;
Cutts, T ;
Down, T ;
Durbin, R ;
Eyras, E ;
Fernandez-Suarez, XM ;
Gane, P ;
Gibbins, B ;
Gilbert, J ;
Hammond, M ;
Hotz, H ;
Iyer, V ;
Kahari, A ;
Jekosch, K ;
Kasprzyk, A ;
Keefe, D ;
Keenan, S ;
Lehvaslaiho, H ;
McVicker, G ;
Melsopp, C ;
Meidl, P ;
Mongin, E ;
Pettett, R ;
Potter, S ;
Proctor, G ;
Rae, M ;
Searle, S ;
Slater, G ;
Smedley, D ;
Smith, J ;
Spooner, W ;
Stabenau, A ;
Stalker, J ;
Storey, R ;
Ureta-Vidal, A ;
Woodwark, C ;
Clamp, M ;
Hubbard, T .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D468-D470
[3]   WEIGHT MATRIX DESCRIPTIONS OF 4 EUKARYOTIC RNA POLYMERASE-II PROMOTER ELEMENTS DERIVED FROM 502 UNRELATED PROMOTER SEQUENCES [J].
BUCHER, P .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 212 (04) :563-578
[4]   PatSearch:: a program for the detection of patterns and structural motifs in nucleotide sequences [J].
Grillo, G ;
Licciulli, F ;
Liuni, S ;
Sbisà, E ;
Pesole, G .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3608-3612
[5]   EnsMart: A generic system for fast and flexible access to biological data [J].
Kasprzyk, A ;
Keefe, D ;
Smedley, D ;
London, D ;
Spooner, W ;
Melsopp, C ;
Hammond, M ;
Rocca-Serra, P ;
Cox, T ;
Birney, E .
GENOME RESEARCH, 2004, 14 (01) :160-169
[6]   The EMBL nucleotide sequence database [J].
Kulikova, T ;
Aldebert, P ;
Althorpe, N ;
Baker, W ;
Bates, K ;
Browne, P ;
van den Broek, A ;
Cochrane, G ;
Duggan, K ;
Eberhardt, R ;
Faruque, N ;
Garcia-Pastor, M ;
Harte, N ;
Kanz, C ;
Leinonen, R ;
Lin, Q ;
Lombard, V ;
Lopez, R ;
Mancuso, R ;
McHale, M ;
Nardone, F ;
Silventoinen, V ;
Stoehr, P ;
Stoesser, G ;
Tuli, MA ;
Tzouvara, K ;
Vaughan, R ;
Wu, D ;
Zhu, WM ;
Apweiler, R .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D27-D30
[7]   The downstream promoter element DPE appears to be as widely used as the TATA box in Drosophila core promoters [J].
Kutach, AK ;
Kadonaga, JT .
MOLECULAR AND CELLULAR BIOLOGY, 2000, 20 (13) :4754-4764
[8]   Untranslated regions of mRNAs [J].
Mignone, Flavio ;
Gissi, Carmela ;
Liuni, Sabino ;
Pesole, Graziano .
GENOME BIOLOGY, 2002, 3 (03)
[9]   Position specific variation in the rate of evolution in transcription factor binding sites [J].
Moses, AM ;
Chiang, DY ;
Kellis, M ;
Lander, ES ;
Eisen, MB .
BMC EVOLUTIONARY BIOLOGY, 2003, 3 (1)
[10]   Transcription regulation of human chemokine receptor CCR3:: Evidence for a rare TATA-less promoter structure conserved between Drosophila and humans [J].
Vijh, S ;
Dayhoff, DE ;
Wang, CE ;
Imam, Z ;
Ehrenberg, PK ;
Michael, NL .
GENOMICS, 2002, 80 (01) :86-95