An alignment-free method to identify candidate orthologous enhancers in multiple Drosophila genomes

被引:17
作者
Arunachalam, Manonmani [1 ,2 ]
Jayasurya, Karthik [2 ]
Tomancak, Pavel [1 ]
Ohler, Uwe [2 ]
机构
[1] Max Planck Inst Mol Cell Biol & Genet, Dresden, Germany
[2] Duke Univ, Inst Genome Sci & Policy, Durham, NC USA
基金
美国国家卫生研究院;
关键词
CIS-REGULATORY MODULES; SEQUENCES; ELEMENTS; MOTIFS; EVOLUTION; DISCOVERY; GENES; CONSERVATION; MELANOGASTER; EXPRESSION;
D O I
10.1093/bioinformatics/btq358
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Evolutionarily conserved non-coding genomic sequences represent a potentially rich source for the discovery of gene regulatory region such as transcriptional enhancers. However, detecting orthologous enhancers using alignment-based methods in higher eukaryotic genomes is particularly challenging, as regulatory regions can undergo considerable sequence changes while maintaining their functionality. Results: We have developed an alignment-free method which identifies conserved enhancers in multiple diverged species. Our method is based on similarity metrics between two sequences based on the co-occurrence of sequence patterns regardless of their order and orientation, thus tolerating sequence changes observed in non-coding evolution. We show that our method is highly successful in detecting orthologous enhancers in distantly related species without requiring additional information such as knowledge about transcription factors involved, or predicted binding sites. By estimating the significance of similarity scores, we are able to discriminate experimentally validated functional enhancers from seemingly equally conserved candidates without function. We demonstrate the effectiveness of this approach on a wide range of enhancers in Drosophila, and also present encouraging results to detect conserved functional regions across large evolutionary distances. Our work provides encouraging steps on the way to oh initio unbiased enhancer prediction to complement ongoing experimental efforts.
引用
收藏
页码:2109 / 2115
页数:7
相关论文
共 32 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]  
ASHBURNER M, 1994, DEVELOPMENT, V120, P2077
[3]   Computational identification of developmental enhancers:: conservation and function of transcription factor binding-site clusters in Drosophila melanogaster and Drosophila pseudoobscura -: art. no. R61 [J].
Berman, BP ;
Pfeiffer, BD ;
Laverty, TR ;
Salzberg, SL ;
Rubin, GM ;
Eisen, MB ;
Celniker, SE .
GENOME BIOLOGY, 2004, 5 (09)
[4]   Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome [J].
Berman, BP ;
Nibu, Y ;
Pfeiffer, BD ;
Tomancak, P ;
Celniker, SE ;
Levine, M ;
Rubin, GM ;
Eisen, MB .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (02) :757-762
[5]   Discovery of regulatory elements by a computational method for phylogenetic footprinting [J].
Blanchette, M ;
Tompa, M .
GENOME RESEARCH, 2002, 12 (05) :739-748
[6]   Using hexamers to predict cis-regulatory motifs in Drosophila [J].
Chan, BY ;
Kibler, D .
BMC BIOINFORMATICS, 2005, 6 (1)
[7]   Surveying Saccharomyces genomes to identify functional elements by comparative DNA sequence analysis [J].
Cliften, PF ;
Hillier, LW ;
Fulton, L ;
Graves, T ;
Miner, T ;
Gish, WR ;
Waterston, RH ;
Johnston, M .
GENOME RESEARCH, 2001, 11 (07) :1175-1186
[8]   Footer:: A quantitative comparative genomics method for efficient recognition of cis-regulatory elements [J].
Corcoran, DL ;
Feingold, E ;
Dominick, J ;
Wright, M ;
Harnaha, J ;
Trucco, M ;
Giannoukakis, N ;
Benos, PV .
GENOME RESEARCH, 2005, 15 (06) :840-847
[9]   Coordinate enhancers share common organizational features in the Drosophila genome [J].
Erives, A ;
Levine, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (11) :3851-3856
[10]   REDfly:: A regulatory element database for Drosophila [J].
Gallo, SM ;
Li, L ;
Hu, Z ;
Halfon, MS .
BIOINFORMATICS, 2006, 22 (03) :381-383