What is a gene, post-ENCODE? History and updated definition

被引:380
作者
Gerstein, Mark B. [1 ]
Bruce, Can
Rozowsky, Joel S.
Zheng, Deyou
Du, Jiang
Korbel, Jan O.
Emanuelsson, Olof
Zhang, Zhengdong D.
Weissman, Sherman
Snyder, Michael
机构
[1] Yale Univ, Program Computat Biol & Bioinformat, New Haven, CT 06511 USA
[2] Yale Univ, Dept Mol Biophys & Biochem, New Haven, CT 06511 USA
[3] Yale Univ, Dept Comp Sci, New Haven, CT 06511 USA
[4] Yale Univ, Ctr Med Informat, New Haven, CT 06511 USA
[5] European Mol Biol Lab, D-69117 Heidelberg, Germany
[6] Stockholm Univ, Stockholm Bioinformat Ctr, Albanova Univ Ctr, SE-10691 Stockholm, Sweden
[7] Yale Univ, Dept Genet, New Haven, CT 06511 USA
[8] Yale Univ, Dept Mol Cellular & Dev Biol, New Haven, CT 06511 USA
关键词
D O I
10.1101/gr.6339607
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
While sequencing of the human genome surprised us with how many protein-coding genes there are, it did not fundamentally change our perspective on what a gene is. In contrast, the complex patterns of dispersed regulation and pervasive transcription uncovered by the ENCODE project, together with non-genic conservation and the abundance of noncoding RNA genes, have challenged the notion of the gene. To illustrate this, we review the evolution of operational definitions of a gene over the past century - from the abstract elements of heredity of Mendel and Morgan to the present-day ORFs enumerated in the sequence databanks. We then summarize the current ENCODE findings and provide a computational metaphor for the complexity. Finally, we propose a tentative update to the definition of a gene: A gene is a union of genomic sequences encoding a coherent set of potentially overlapping functional products. Our definition sidesteps the complexities of regulation and transcription by removing the former altogether from the definition and arguing that final, functional gene products ( rather than intermediate transcripts) should be used to group together entities associated with a single gene. It also manifests how integral the concept of biological function is in defining genes.
引用
收藏
页码:669 / 681
页数:13
相关论文
共 103 条
[1]   Transcription-mediated gene fusion in the human genome [J].
Akiva, P ;
Toporik, A ;
Edelheit, S ;
Peretz, Y ;
Diber, A ;
Shemesh, R ;
Novik, A ;
Sorek, R .
GENOME RESEARCH, 2006, 16 (01) :30-36
[2]  
[Anonymous], 1866, VERHANDLUNGEN NATURF, DOI DOI 10.5962/BHL.TITLE.61004
[3]   STUDIES ON THE CHEMICAL NATURE OF THE SUBSTANCE INDUCING TRANSFORMATION OF PNEUMOCOCCAL TYPES INDUCTION OF TRANSFORMATION BY A DESOXYRIBONUCLEIC ACID FRACTION ISOLATED FROM PNEUMOCOCCUS TYPE III [J].
Avery, Oswald T. ;
MacLeod, Colin M. ;
McCarty, Maclyn .
JOURNAL OF EXPERIMENTAL MEDICINE, 1944, 79 (02) :137-158
[4]   Pseudogenes: Are they "Junk" or functional DNA? [J].
Balakirev, ES ;
Ayala, FJ .
ANNUAL REVIEW OF GENETICS, 2003, 37 :123-151
[5]   Genetic control of biochemical reactions in neurospora [J].
Beadle, GW ;
Tatum, EL .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1941, 27 :499-506
[7]   Common intervals and sorting by reversals: a marriage of necessity [J].
Bergeron, A ;
Heber, S ;
Stoye, J .
BIOINFORMATICS, 2002, 18 :S54-S63
[8]   SPLICED SEGMENTS AT 5' TERMINUS OF ADENOVIRUS 2 LATE MESSENGER-RNA [J].
BERGET, SM ;
MOORE, C ;
SHARP, PA .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1977, 74 (08) :3171-3175
[9]   Global identification of human transcribed sequences with genome tiling arrays [J].
Bertone, P ;
Stolc, V ;
Royce, TE ;
Rozowsky, JS ;
Urban, AE ;
Zhu, XW ;
Rinn, JL ;
Tongprasit, W ;
Samanta, M ;
Weissman, S ;
Gerstein, M ;
Snyder, M .
SCIENCE, 2004, 306 (5705) :2242-2246
[10]  
BLUML M, 1995, CUR I ELECTR MODEL, V1, P1