Genome reannotation of the lizard Anolis carolinensis based on 14 adult and embryonic deep transcriptomes

被引:48
作者
Eckalbar, Walter L. [1 ]
Hutchins, Elizabeth D. [1 ]
Markov, Glenn J. [1 ]
Allen, April N. [2 ]
Corneveaux, Jason J. [2 ]
Lindblad-Toh, Kerstin [3 ,4 ]
Di Palma, Federica [3 ]
Alfoeldi, Jessica [3 ]
Huentelman, Matthew J. [2 ]
Kusumi, Kenro [1 ,2 ]
机构
[1] Arizona State Univ, Sch Life Sci, Tempe, AZ 85287 USA
[2] Translat Genom Res Inst, Neurogen Div, Phoenix, AZ 85004 USA
[3] Broad Inst MIT & Harvard, Cambridge, MA 02142 USA
[4] Uppsala Univ, Dept Med Biochem & Microbiol, Sci Life Lab Uppsala, Uppsala, Sweden
基金
美国国家卫生研究院;
关键词
Annotation; Lizard; Anolis carolinensis; Transcriptome; Genome; RNA-Seq; Gene; Vertebrate; Embryo; Tissue-specific; FUNCTIONAL GENOMICS; CD-HIT; ANNOTATION; ALIGNMENT; GENE; CONVERGENCE; EVOLUTION; BLAST2GO; PROGRAM; SIGNALS;
D O I
10.1186/1471-2164-14-49
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: The green anole lizard, Anolis carolinensis, is a key species for both laboratory and field-based studies of evolutionary genetics, development, neurobiology, physiology, behavior, and ecology. As the first non-avian reptilian genome sequenced, A. carolinesis is also a prime reptilian model for comparison with other vertebrate genomes. The public databases of Ensembl and NCBI have provided a first generation gene annotation of the anole genome that relies primarily on sequence conservation with related species. A second generation annotation based on tissue-specific transcriptomes would provide a valuable resource for molecular studies. Results: Here we provide an annotation of the A. carolinensis genome based on de novo assembly of deep transcriptomes of 14 adult and embryonic tissues. This revised annotation describes 59,373 transcripts, compared to 16,533 and 18,939 currently for Ensembl and NCBI, and 22,962 predicted protein-coding genes. A key improvement in this revised annotation is coverage of untranslated region (UTR) sequences, with 79% and 59% of transcripts containing 5' and 3' UTRs, respectively. Gaps in genome sequence from the current A. carolinensis build (Anocar2.0) are highlighted by our identification of 16,542 unmapped transcripts, representing 6,695 orthologues, with less than 70% genomic coverage. Conclusions: Incorporation of tissue-specific transcriptome sequence into the A. carolinensis genome annotation has markedly improved its utility for comparative and functional studies. Increased UTR coverage allows for more accurate predicted protein sequence and regulatory analysis. This revised annotation also provides an atlas of gene expression specific to adult and embryonic tissues.
引用
收藏
页数:10
相关论文
共 51 条
[1]   The genome of the green anole lizard and a comparative analysis with birds and mammals [J].
Alfoeldi, Jessica ;
Di Palma, Federica ;
Grabherr, Manfred ;
Williams, Christina ;
Kong, Lesheng ;
Mauceli, Evan ;
Russell, Pamela ;
Lowe, Craig B. ;
Glor, Richard E. ;
Jaffe, Jacob D. ;
Ray, David A. ;
Boissinot, Stephane ;
Shedlock, Andrew M. ;
Botka, Christopher ;
Castoe, Todd A. ;
Colbourne, John K. ;
Fujita, Matthew K. ;
Moreno, Ricardo Godinez ;
ten Hallers, Boudewijn F. ;
Haussler, David ;
Heger, Andreas ;
Heiman, David ;
Janes, Daniel E. ;
Johnson, Jeremy ;
de Jong, Pieter J. ;
Koriabine, Maxim Y. ;
Lara, Marcia ;
Novick, Peter A. ;
Organ, Chris L. ;
Peach, Sally E. ;
Poe, Steven ;
Pollock, David D. ;
de Queiroz, Kevin ;
Sanger, Thomas ;
Searle, Steve ;
Smith, Jeremy D. ;
Smith, Zachary ;
Swofford, Ross ;
Turner-Maier, Jason ;
Wade, Juli ;
Young, Sarah ;
Zadissa, Amonida ;
Edwards, Scott V. ;
Glenn, Travis C. ;
Schneider, Christopher J. ;
Losos, Jonathan B. ;
Lander, Eric S. ;
Breen, Matthew ;
Ponting, Chris P. ;
Lindblad-Toh, Kerstin .
NATURE, 2011, 477 (7366) :587-591
[2]   De novo transcriptome assembly with ABySS [J].
Birol, Inanc ;
Jackman, Shaun D. ;
Nielsen, Cydney B. ;
Qian, Jenny Q. ;
Varhol, Richard ;
Stazyk, Greg ;
Morin, Ryan D. ;
Zhao, Yongjun ;
Hirst, Martin ;
Schein, Jacqueline E. ;
Horsman, Doug E. ;
Connors, Joseph M. ;
Gascoyne, Randy D. ;
Marra, Marco A. ;
Jones, Steven J. M. .
BIOINFORMATICS, 2009, 25 (21) :2872-2877
[3]   MAKER: An easy-to-use annotation pipeline designed for emerging model organism genomes [J].
Cantarel, Brandi L. ;
Korf, Ian ;
Robb, Sofia M. C. ;
Parra, Genis ;
Ross, Eric ;
Moore, Barry ;
Holt, Carson ;
Alvarado, Alejandro Sanchez ;
Yandell, Mark .
GENOME RESEARCH, 2008, 18 (01) :188-196
[4]   Sequencing the genome of the Burmese python']python (Python']Python molurus bivittatus) as a model for studying extreme adaptations in snakes [J].
Castoe, Todd A. ;
de Koning, A. P. Jason ;
Hall, Kathryn T. ;
Yokoyama, Ken D. ;
Gu, Wanjun ;
Smith, Eric N. ;
Feschotte, Cedric ;
Uetz, Peter ;
Ray, David A. ;
Dobry, Jason ;
Bogden, Robert ;
Mackessy, Stephen P. ;
Bronikowski, Anne M. ;
Warren, Wesley C. ;
Secor, Stephen M. ;
Pollock, David D. .
GENOME BIOLOGY, 2011, 12 (07)
[5]   Blast2GO:: a universal tool for annotation, visualization and analysis in functional genomics research [J].
Conesa, A ;
Götz, S ;
García-Gómez, JM ;
Terol, J ;
Talón, M ;
Robles, M .
BIOINFORMATICS, 2005, 21 (18) :3674-3676
[6]  
Conesa Ana, 2008, Int J Plant Genomics, V2008, P619832, DOI 10.1155/2008/619832
[7]   Somitogenesis in the anole lizard and alligator reveals evolutionary convergence and divergence in the amniote segmentation clock [J].
Eckalbar, Walter L. ;
Lasku, Eris ;
Infante, Carlos R. ;
Elsey, Ruth M. ;
Markov, Glenn J. ;
Allen, April N. ;
Corneveaux, Jason J. ;
Losos, Jonathan B. ;
DeNardo, Dale F. ;
Huentelman, Matthew J. ;
Wilson-Rawls, Jeanne ;
Rawls, Alan ;
Kusumi, Kenro .
DEVELOPMENTAL BIOLOGY, 2012, 363 (01) :308-319
[8]   Ensembl 2012 [J].
Flicek, Paul ;
Amode, M. Ridwan ;
Barrell, Daniel ;
Beal, Kathryn ;
Brent, Simon ;
Carvalho-Silva, Denise ;
Clapham, Peter ;
Coates, Guy ;
Fairley, Susan ;
Fitzgerald, Stephen ;
Gil, Laurent ;
Gordon, Leo ;
Hendrix, Maurice ;
Hourlier, Thibaut ;
Johnson, Nathan ;
Kaehaeri, Andreas K. ;
Keefe, Damian ;
Keenan, Stephen ;
Kinsella, Rhoda ;
Komorowska, Monika ;
Koscielny, Gautier ;
Kulesha, Eugene ;
Larsson, Pontus ;
Longden, Ian ;
McLaren, William ;
Muffato, Matthieu ;
Overduin, Bert ;
Pignatelli, Miguel ;
Pritchard, Bethan ;
Riat, Harpreet Singh ;
Ritchie, Graham R. S. ;
Ruffier, Magali ;
Schuster, Michael ;
Sobral, Daniel ;
Tang, Y. Amy ;
Taylor, Kieron ;
Trevanion, Stephen ;
Vandrovcova, Jana ;
White, Simon ;
Wilson, Mark ;
Wilder, Steven P. ;
Aken, Bronwen L. ;
Birney, Ewan ;
Cunningham, Fiona ;
Dunham, Ian ;
Durbin, Richard ;
Fernandez-Suarez, Xose M. ;
Harrow, Jennifer ;
Herrero, Javier ;
Hubbard, Tim J. P. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D84-D90
[9]   The Anolis Lizard Genome: An Amniote Genome without Isochores [J].
Fujita, Matthew K. ;
Edwards, Scott V. ;
Ponting, Chris P. .
GENOME BIOLOGY AND EVOLUTION, 2011, 3 :974-984
[10]   High-throughput functional annotation and data mining with the Blast2GO suite [J].
Gotz, Stefan ;
Garcia-Gomez, Juan Miguel ;
Terol, Javier ;
Williams, Tim D. ;
Nagaraj, Shivashankar H. ;
Nueda, Maria Jose ;
Robles, Montserrat ;
Talon, Manuel ;
Dopazo, Joaquin ;
Conesa, Ana .
NUCLEIC ACIDS RESEARCH, 2008, 36 (10) :3420-3435