Augmented Annotation of the Schizosaccharomyces pombe Genome Reveals Additional Genes Required for Growth and Viability

被引:23
作者
Bitton, Danny A. [1 ]
Wood, Valerie [4 ]
Scutt, Paul J. [1 ]
Grallert, Agnes [2 ]
Yates, Tim [1 ]
Smith, Duncan L. [3 ]
Hagan, Iain M. [2 ]
Miller, Crispin J. [1 ]
机构
[1] Univ Manchester, Canc Res UK Appl Computat Biol & Bioinformat Grp, Manchester M20 4BX, Lancs, England
[2] Univ Manchester, Canc Res UK Cell Div Grp, Manchester M20 4BX, Lancs, England
[3] Univ Manchester, Biol Mass Spectrometry Facil, Canc Res UK, Paterson Inst Canc Res, Manchester M20 4BX, Lancs, England
[4] London Res Inst, Canc Res UK, London WC2A 3PX, England
基金
英国惠康基金;
关键词
FALSE DISCOVERY RATES; FISSION YEAST; MASS-SPECTROMETRY; CELL-DIVISION; DATABASE; TRANSCRIPTOME; SPORULATION; GENERATION; INHIBITOR; PEPTIDES;
D O I
10.1534/genetics.110.123497
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genome annotation is a synthesis of computational prediction and experimental evidence. Small genes are notoriously difficult to detect because the patterns used to identify them are often indistinguishable from chance occurrences, leading to an arbitrary cutoff threshold for the length of a protein-coding gene identified solely by in silico analysis. We report a systematic reappraisal of the Schizosaccharomyces pombe genome that ignores thresholds. A complete six-frame translation was compared to a proteome data set, the Pfam domain database, and the genomes of six other fungi. Thirty-nine novel loci were identified. RTPCR and RNA-Seq confirmed transcription at 38 loci; 33 novel gene structures were delineated by 59 and 39 RACE. Expression levels of 14 transcripts fluctuated during meiosis. Translational evidence for 10 genes, evolutionary conservation data supporting 35 predictions, and distinct phenotypes upon ORF deletion (one essential, four slow-growth, two delayed-division phenotypes) suggest that all 39 predictions encode functional proteins. The popularity of S. pombe as a model organism suggests that this augmented annotation will be of interest in diverse areas of molecular and cellular biology, while the generality of the approach suggests widespread applicability to other genomes.
引用
收藏
页码:1207 / U369
页数:24
相关论文
共 39 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] Bähler J, 1998, YEAST, V14, P943, DOI 10.1002/(SICI)1097-0061(199807)14:10<943::AID-YEA292>3.0.CO
  • [3] 2-Y
  • [4] TATA BOX MUTATIONS IN THE SCHIZOSACCHAROMYCES-POMBE NMT-1 PROMOTER AFFECT TRANSCRIPTION EFFICIENCY BUT NOT THE TRANSCRIPTION START POINT OR THIAMINE REPRESSIBILITY
    BASI, G
    SCHMID, E
    MAUNDRELL, K
    [J]. GENE, 1993, 123 (01) : 131 - 136
  • [5] An Integrated Mass-Spectrometry Pipeline Identifies Novel Protein Coding-Regions in the Human Genome
    Bitton, Danny A.
    Smith, Duncan L.
    Connolly, Yvonne
    Scutt, Paul J.
    Miller, Crispin J.
    [J]. PLOS ONE, 2010, 5 (01):
  • [6] Discovery and revision of Arabidopsis genes by proteogenomics
    Castellana, Natalie E.
    Payne, Samuel H.
    Shen, Zhouxin
    Stanke, Mario
    Bafna, Vineet
    Briggs, Steven P.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (52) : 21034 - 21038
  • [7] Dynamic transcriptome of Schizosaccharomyces pombe shown by RNA-DNA hybrid mapping
    Dutrow, Natalie
    Nix, David A.
    Holt, Derick
    Milash, Brett
    Dalley, Brian
    Westbroek, Erick
    Parnell, Timothy J.
    Cairns, Bradley R.
    [J]. NATURE GENETICS, 2008, 40 (08) : 977 - 986
  • [8] Novel gene and gene model detection using a whole genome open reading frame analysis in proteomics
    Fermin, Damian
    Allen, Baxter B.
    Blackwell, Thomas W.
    Menon, Rajasree
    Adamski, Marcin
    Xu, Yin
    Ulintz, Peter
    Omenn, Gilbert S.
    States, David J.
    [J]. GENOME BIOLOGY, 2006, 7 (04)
  • [9] The Pfam protein families database
    Finn, Robert D.
    Mistry, Jaina
    Tate, John
    Coggill, Penny
    Heger, Andreas
    Pollington, Joanne E.
    Gavin, O. Luke
    Gunasekaran, Prasad
    Ceric, Goran
    Forslund, Kristoffer
    Holm, Liisa
    Sonnhammer, Erik L. L.
    Eddy, Sean R.
    Bateman, Alex
    [J]. NUCLEIC ACIDS RESEARCH, 2010, 38 : D211 - D222
  • [10] Saccharomyces cerevisiae S288C genome annotation:: a working hypothesis
    Fisk, Dianna G.
    Ball, Catherine A.
    Dolinski, Kara
    Engel, Stacia R.
    Hong, Eurie L.
    Issel-Tarver, Laurie
    Schwartz, Katja
    Sethuraman, Anand
    Botstein, David
    Cherry, J. Michael
    [J]. YEAST, 2006, 23 (12) : 857 - 865