Approaches to comparative sequence analysis: towards a functional view of vertebrate genomes

被引:46
作者
Margulies, Elliott H. [1 ]
Birney, Ewan [2 ]
机构
[1] NHGRI, Genome Technol Branch, Genome Informat Sect, NIH, Bethesda, MD 20892 USA
[2] European Bioinformat Inst, Cambridge CB10 1SD, England
关键词
D O I
10.1038/nrg2185
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The comparison of genomic sequences is now a common approach to identifying and characterizing functional regions in vertebrate genomes. However, for theoretical reasons and because of practical issues, the generation of these data sets is non-trivial and can have many pitfalls. We are currently seeing an explosion of comparative sequence data, the benefits and limitations of which need to be disseminated to the scientific community. This Review provides a critical overview of the different types of sequence data that are available for analysis and of contemporary comparative sequence analysis methods, highlighting both their strengths and limitations. Approaches to determining the biological significance of constrained sequence are also explored.
引用
收藏
页码:303 / 313
页数:11
相关论文
共 65 条
  • [1] Whole-genome re-sequencing
    Bentley, David R.
    [J]. CURRENT OPINION IN GENETICS & DEVELOPMENT, 2006, 16 (06) : 545 - 552
  • [2] Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project
    Birney, Ewan
    Stamatoyannopoulos, John A.
    Dutta, Anindya
    Guigo, Roderic
    Gingeras, Thomas R.
    Margulies, Elliott H.
    Weng, Zhiping
    Snyder, Michael
    Dermitzakis, Emmanouil T.
    Stamatoyannopoulos, John A.
    Thurman, Robert E.
    Kuehn, Michael S.
    Taylor, Christopher M.
    Neph, Shane
    Koch, Christoph M.
    Asthana, Saurabh
    Malhotra, Ankit
    Adzhubei, Ivan
    Greenbaum, Jason A.
    Andrews, Robert M.
    Flicek, Paul
    Boyle, Patrick J.
    Cao, Hua
    Carter, Nigel P.
    Clelland, Gayle K.
    Davis, Sean
    Day, Nathan
    Dhami, Pawandeep
    Dillon, Shane C.
    Dorschner, Michael O.
    Fiegler, Heike
    Giresi, Paul G.
    Goldy, Jeff
    Hawrylycz, Michael
    Haydock, Andrew
    Humbert, Richard
    James, Keith D.
    Johnson, Brett E.
    Johnson, Ericka M.
    Frum, Tristan T.
    Rosenzweig, Elizabeth R.
    Karnani, Neerja
    Lee, Kirsten
    Lefebvre, Gregory C.
    Navas, Patrick A.
    Neri, Fidencio
    Parker, Stephen C. J.
    Sabo, Peter J.
    Sandstrom, Richard
    Shafer, Anthony
    [J]. NATURE, 2007, 447 (7146) : 799 - 816
  • [3] An intermediate grade of finished genomic sequence suitable for comparative analyses
    Blakesley, RW
    Hansen, NF
    Mullikin, JC
    Thomas, PJ
    McDowell, JC
    Maskeri, B
    Young, AC
    Benjamin, B
    Brooks, SY
    Coleman, BI
    Gupta, J
    Ho, SL
    Karlins, EM
    Maduro, QL
    Stantripop, S
    Tsurgeon, C
    Vogt, JL
    Walker, MA
    Masiello, CA
    Guan, XB
    Bouffared, GG
    Green, ED
    [J]. GENOME RESEARCH, 2004, 14 (11) : 2235 - 2244
  • [4] Aligning multiple genomic sequences with the threaded blockset aligner
    Blanchette, M
    Kent, WJ
    Riemer, C
    Elnitski, L
    Smit, AFA
    Roskin, KM
    Baertsch, R
    Rosenbloom, K
    Clawson, H
    Green, ED
    Haussler, D
    Miller, W
    [J]. GENOME RESEARCH, 2004, 14 (04) : 708 - 715
  • [5] Discovery of regulatory elements by a computational method for phylogenetic footprinting
    Blanchette, M
    Tompa, M
    [J]. GENOME RESEARCH, 2002, 12 (05) : 739 - 748
  • [6] A framework for collaborative analysis of ENCODE data: Making large-scale analyses biologist-friendly
    Blankenberg, Daniel
    Taylor, James
    Schenck, Ian
    He, Jianbin
    Zhang, Yi
    Ghent, Matthew
    Veeraraghavan, Narayanan
    Albert, Istvan
    Miller, Webb
    Makova, Kateryna D.
    Hardison, Ross C.
    Nekrutenko, Anton
    [J]. GENOME RESEARCH, 2007, 17 (06) : 960 - 964
  • [7] Phylogenetic shadowing of primate sequences to find functional regions of the human genome
    Boffelli, D
    McAuliffe, J
    Ovcharenko, D
    Lewis, KD
    Ovcharenko, I
    Pachter, L
    Rubin, EM
    [J]. SCIENCE, 2003, 299 (5611) : 1391 - 1394
  • [8] MAVID: Constrained ancestral alignment of multiple sequences
    Bray, N
    Pachter, L
    [J]. GENOME RESEARCH, 2004, 14 (04) : 693 - 699
  • [9] LAGAN and Multi-LAGAN: Efficient tools for large-scale multiple alignment of genomic DNA
    Brudno, M
    Do, CB
    Cooper, GM
    Kim, MF
    Davydov, E
    Green, ED
    Sidow, A
    Batzoglou, S
    [J]. GENOME RESEARCH, 2003, 13 (04) : 721 - 731
  • [10] Evolution of genes and genomes on the Drosophila phylogeny
    Clark, Andrew G.
    Eisen, Michael B.
    Smith, Douglas R.
    Bergman, Casey M.
    Oliver, Brian
    Markow, Therese A.
    Kaufman, Thomas C.
    Kellis, Manolis
    Gelbart, William
    Iyer, Venky N.
    Pollard, Daniel A.
    Sackton, Timothy B.
    Larracuente, Amanda M.
    Singh, Nadia D.
    Abad, Jose P.
    Abt, Dawn N.
    Adryan, Boris
    Aguade, Montserrat
    Akashi, Hiroshi
    Anderson, Wyatt W.
    Aquadro, Charles F.
    Ardell, David H.
    Arguello, Roman
    Artieri, Carlo G.
    Barbash, Daniel A.
    Barker, Daniel
    Barsanti, Paolo
    Batterham, Phil
    Batzoglou, Serafim
    Begun, Dave
    Bhutkar, Arjun
    Blanco, Enrico
    Bosak, Stephanie A.
    Bradley, Robert K.
    Brand, Adrianne D.
    Brent, Michael R.
    Brooks, Angela N.
    Brown, Randall H.
    Butlin, Roger K.
    Caggese, Corrado
    Calvi, Brian R.
    de Carvalho, A. Bernardo
    Caspi, Anat
    Castrezana, Sergio
    Celniker, Susan E.
    Chang, Jean L.
    Chapple, Charles
    Chatterji, Sourav
    Chinwalla, Asif
    Civetta, Alberto
    [J]. NATURE, 2007, 450 (7167) : 203 - 218