Approaches to comparative sequence analysis: towards a functional view of vertebrate genomes

被引:46
作者
Margulies, Elliott H. [1 ]
Birney, Ewan [2 ]
机构
[1] NHGRI, Genome Technol Branch, Genome Informat Sect, NIH, Bethesda, MD 20892 USA
[2] European Bioinformat Inst, Cambridge CB10 1SD, England
关键词
D O I
10.1038/nrg2185
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The comparison of genomic sequences is now a common approach to identifying and characterizing functional regions in vertebrate genomes. However, for theoretical reasons and because of practical issues, the generation of these data sets is non-trivial and can have many pitfalls. We are currently seeing an explosion of comparative sequence data, the benefits and limitations of which need to be disseminated to the scientific community. This Review provides a critical overview of the different types of sequence data that are available for analysis and of contemporary comparative sequence analysis methods, highlighting both their strengths and limitations. Approaches to determining the biological significance of constrained sequence are also explored.
引用
收藏
页码:303 / 313
页数:11
相关论文
共 65 条
  • [11] Finishing the euchromatic sequence of the human genome
    Collins, FS
    Lander, ES
    Rogers, J
    Waterston, RH
    [J]. NATURE, 2004, 431 (7011) : 931 - 945
  • [12] Distribution and intensity of constraint in mammalian genomic sequence
    Cooper, GM
    Stone, EA
    Asimenos, G
    Green, ED
    Batzoglou, S
    Sidow, A
    [J]. GENOME RESEARCH, 2005, 15 (07) : 901 - 913
  • [13] Fast algorithms for large-scale genome alignment and comparison
    Delcher, AL
    Phillippy, A
    Carlton, J
    Salzberg, SL
    [J]. NUCLEIC ACIDS RESEARCH, 2002, 30 (11) : 2478 - 2483
  • [14] Dewey Colin N., 2007, V395, P221
  • [15] ProbCons: Probabilistic consistency-based multiple sequence alignment
    Do, CB
    Mahabhashyam, MSP
    Brudno, M
    Batzoglou, S
    [J]. GENOME RESEARCH, 2005, 15 (02) : 330 - 340
  • [16] Durbin R., 1998, Biological sequence analysis: Probabilistic models of proteins and nucleic acids
  • [17] A model of the statistical power of comparative genome sequence analysis
    Eddy, SR
    [J]. PLOS BIOLOGY, 2005, 3 (01) : 95 - 102
  • [18] Conservation of RET regulatory function from human to zebrafish without sequence similarity
    Fisher, S
    Grice, EA
    Vinton, RM
    Bessling, SL
    McCallion, AS
    [J]. SCIENCE, 2006, 312 (5771) : 276 - 279
  • [19] Ensembl 2008
    Flicek, P.
    Aken, B. L.
    Beal, K.
    Ballester, B.
    Caccamo, M.
    Chen, Y.
    Clarke, L.
    Coates, G.
    Cunningham, F.
    Cutts, T.
    Down, T.
    Dyer, S. C.
    Eyre, T.
    Fitzgerald, S.
    Fernandez-Banet, J.
    Graf, S.
    Haider, S.
    Hammond, M.
    Holland, R.
    Howe, K. L.
    Howe, K.
    Johnson, N.
    Jenkinson, A.
    Kahari, A.
    Keefe, D.
    Kokocinski, F.
    Kulesha, E.
    Lawson, D.
    Longden, I.
    Megy, K.
    Meidl, P.
    Overduin, B.
    Parker, A.
    Pritchard, B.
    Prlic, A.
    Rice, S.
    Rios, D.
    Schuster, M.
    Sealy, I.
    Slater, G.
    Smedley, D.
    Spudich, G.
    Trevanion, S.
    Vilella, A. J.
    Vogel, J.
    White, S.
    Wood, M.
    Birney, E.
    Cox, T.
    Curwen, V.
    [J]. NUCLEIC ACIDS RESEARCH, 2008, 36 : D707 - D714
  • [20] Galaxy: A platform for interactive large-scale genome analysis
    Giardine, B
    Riemer, C
    Hardison, RC
    Burhans, R
    Elnitski, L
    Shah, P
    Zhang, Y
    Blankenberg, D
    Albert, I
    Taylor, J
    Miller, W
    Kent, WJ
    Nekrutenko, A
    [J]. GENOME RESEARCH, 2005, 15 (10) : 1451 - 1455