The status of the human gene catalogue

被引:68
作者
Amaral, Paulo [1 ]
Carbonell-Sala, Silvia [2 ]
De La Vega, Francisco M. [3 ,4 ]
Faial, Tiago [5 ]
Frankish, Adam [6 ]
Gingeras, Thomas [7 ]
Guigo, Roderic [8 ]
Harrow, Jennifer L. [9 ]
Hatzigeorgiou, Artemis G. [10 ,11 ]
Johnson, Rory [12 ,13 ,14 ,15 ]
Murphy, Terence D. [16 ]
Pertea, Mihaela [17 ,18 ]
Pruitt, Kim D. [16 ]
Pujar, Shashikant [16 ]
Takahashi, Hazuki [20 ]
Ulitsky, Igor [21 ,22 ]
Varabyou, Ales [17 ,19 ]
Wells, Christine A. [23 ]
Yandell, Mark [24 ]
Carninci, Piero [20 ,25 ]
Salzberg, Steven L. [17 ,18 ,19 ,26 ]
机构
[1] INSPER, Inst Educ & Res, Sao Paulo, Brazil
[2] Ctr Genom Regulat CRG, Barcelona, Spain
[3] Stanford Univ, Sch Med, Dept Biomed Data Sci, Stanford, CA 94305 USA
[4] Tempus Labs, Chicago, IL USA
[5] Nat Genet, San Francisco, CA USA
[6] European Mol Biol Lab, European Bioinformat Inst, Wellcome Genome Campus, Hinxton, England
[7] Cold Spring Harbor Lab, Dept Funct Gen, Cold Spring Harbor, NY USA
[8] Univ Pompeu Fabra UPF, Barcelona, Spain
[9] AstraZeneca, Ctr Genom Res Discovery Sci, Royston, England
[10] Univ Thessaly, Dept Comp Sci & Biomed Informat, Lamia, Greece
[11] Hellenic Pasteur Inst, Athens, Greece
[12] Univ Coll Dublin, Sch Biol & Environm Sci, Dublin, Ireland
[13] Univ Coll Dublin, Conway Inst Biomed & Biomol Res, Dublin, Ireland
[14] Univ Bern, Univ Hosp Bern, Inselspital, Dept Med Oncol, Bern, Switzerland
[15] Univ Bern, Dept Biomed Res, Bern, Switzerland
[16] Natl Lib Med, Nat Ctr Biotechnol Informat, NIH, Bethesda, MD USA
[17] Johns Hopkins Univ, Computat Biol, Baltimore, MD 21218 USA
[18] Johns Hopkins Univ, Dept Biomed Engn, Baltimore, MD 21218 USA
[19] Johns Hopkins Univ, Dept Comp Sci, Baltimore, MD 21218 USA
[20] RIKEN Ctr Integrat Med Sci, Lab Transcriptome Technol, Yokohama, Kanagawa, Japan
[21] Weizmann Inst Sci, Dept Immunol & Regenerat Biol, Rehovot, Israel
[22] Weizmann Inst Sci, Dept Mol Neurosci, Rehovot, Israel
[23] Univ Melbourne, Dept Anat & Physiol, Stem Cell Syst, Fac Med Dent & Hlth Sci, Parkville, Vic, Australia
[24] Univ Utah, Utah Ctr Genet Discovery, Dept Human Genet, Salt Lake City, UT USA
[25] Human Technopole, Milan, Italy
[26] Johns Hopkins Univ, Dept Biostat, Baltimore, MD USA
基金
美国国家卫生研究院; 英国医学研究理事会; 英国惠康基金; 爱尔兰科学基金会; 美国国家科学基金会;
关键词
LONG NONCODING RNAS; SEQUENCE; ANNOTATION; LANDSCAPE; ELEMENTS; DATABASE;
D O I
10.1038/s41586-023-06490-x
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Scientists have been trying to identify every gene in the human genome since the initial draft was published in 2001. In the years since, much progress has been made in identifying protein-coding genes, currently estimated to number fewer than 20,000, with an ever-expanding number of distinct protein-coding isoforms. Here we review the status of the human gene catalogue and the efforts to complete it in recent years. Beside the ongoing annotation of protein-coding genes, their isoforms and pseudogenes, the invention of high-throughput RNA sequencing and other technological breakthroughs have led to a rapid growth in the number of reported non-coding RNA genes. For most of these non-coding RNAs, the functional relevance is currently unclear; we look at recent advances that offer paths forward to identifying their functions and towards eventually completing the human gene catalogue. Finally, we examine the need for a universal annotation standard that includes all medically significant genes and maintains their relationships with different reference genomes for the use of the human gene catalogue in clinical settings. Although the catalogue of human protein-coding genes is nearing completion, the number of non-coding RNA genes remains highly uncertain, and for all genes much work remains to be done to understand their functions.
引用
收藏
页码:41 / 47
页数:7
相关论文
共 80 条
[21]   Developmental dynamics of RNA translation in the human brain [J].
Duffy, Erin E. ;
Finander, Benjamin ;
Choi, GiHun ;
Carter, Ava C. ;
Pritisanac, Iva ;
Alam, Aqsa ;
Luria, Victor ;
Karger, Amir ;
Phu, William ;
Sherman, Maxwell A. ;
Assad, Elena G. ;
Pajarillo, Naomi ;
Khitun, Alexandra ;
Crouch, Elizabeth E. ;
Ganesh, Sanika ;
Chen, Jin ;
Berger, Bonnie ;
Sestan, Nenad ;
O'Donnell-Luria, Anne ;
Huang, Eric J. ;
Griffith, Eric C. ;
Forman-Kay, Julie D. ;
Moses, Alan M. ;
Kalish, Brian T. ;
Greenberg, Michael E. .
NATURE NEUROSCIENCE, 2022, 25 (10) :1353-+
[22]   An integrated encyclopedia of DNA elements in the human genome [J].
Dunham, Ian ;
Kundaje, Anshul ;
Aldred, Shelley F. ;
Collins, Patrick J. ;
Davis, CarrieA. ;
Doyle, Francis ;
Epstein, Charles B. ;
Frietze, Seth ;
Harrow, Jennifer ;
Kaul, Rajinder ;
Khatun, Jainab ;
Lajoie, Bryan R. ;
Landt, Stephen G. ;
Lee, Bum-Kyu ;
Pauli, Florencia ;
Rosenbloom, Kate R. ;
Sabo, Peter ;
Safi, Alexias ;
Sanyal, Amartya ;
Shoresh, Noam ;
Simon, Jeremy M. ;
Song, Lingyun ;
Trinklein, Nathan D. ;
Altshuler, Robert C. ;
Birney, Ewan ;
Brown, James B. ;
Cheng, Chao ;
Djebali, Sarah ;
Dong, Xianjun ;
Dunham, Ian ;
Ernst, Jason ;
Furey, Terrence S. ;
Gerstein, Mark ;
Giardine, Belinda ;
Greven, Melissa ;
Hardison, Ross C. ;
Harris, Robert S. ;
Herrero, Javier ;
Hoffman, Michael M. ;
Iyer, Sowmya ;
Kellis, Manolis ;
Khatun, Jainab ;
Kheradpour, Pouya ;
Kundaje, Anshul ;
Lassmann, Timo ;
Li, Qunhua ;
Lin, Xinying ;
Marinov, Georgi K. ;
Merkel, Angelika ;
Mortazavi, Ali .
NATURE, 2012, 489 (7414) :57-74
[23]   HOW MANY GENES IN THE HUMAN GENOME [J].
FIELDS, C ;
ADAMS, MD ;
WHITE, O ;
VENTER, JC .
NATURE GENETICS, 1994, 7 (03) :345-346
[24]   A promoter-level mammalian expression atlas [J].
Forrest, Alistair R. R. ;
Kawaji, Hideya ;
Rehli, Michael ;
Baillie, J. Kenneth ;
de Hoon, Michiel J. L. ;
Haberle, Vanja ;
Lassmann, Timo ;
Kulakovskiy, Ivan V. ;
Lizio, Marina ;
Itoh, Masayoshi ;
Andersson, Robin ;
Mungall, Christopher J. ;
Meehan, Terrence F. ;
Schmeier, Sebastian ;
Bertin, Nicolas ;
Jorgensen, Mette ;
Dimont, Emmanuel ;
Arner, Erik ;
Schmidl, Christian ;
Schaefer, Ulf ;
Medvedeva, Yulia A. ;
Plessy, Charles ;
Vitezic, Morana ;
Severin, Jessica ;
Semple, Colin A. ;
Ishizu, Yuri ;
Young, Robert S. ;
Francescatto, Margherita ;
Alam, Intikhab ;
Albanese, Davide ;
Altschuler, Gabriel M. ;
Arakawa, Takahiro ;
Archer, John A. C. ;
Arner, Peter ;
Babina, Magda ;
Rennie, Sarah ;
Balwierz, Piotr J. ;
Beckhouse, Anthony G. ;
Pradhan-Bhatt, Swati ;
Blake, Judith A. ;
Blumenthal, Antje ;
Bodega, Beatrice ;
Bonetti, Alessandro ;
Briggs, James ;
Brombacher, Frank ;
Burroughs, A. Maxwell ;
Califano, Andrea ;
Cannistraci, Carlo V. ;
Carbajo, Daniel ;
Chen, Yun .
NATURE, 2014, 507 (7493) :462-+
[25]   GENCODE: reference annotation for the human and mouse genomes in 2023 [J].
Frankish, Adam ;
Carbonell-Sala, Silvia ;
Diekhans, Mark ;
Jungreis, Irwin ;
Loveland, Jane E. ;
Mudge, Jonathan M. ;
Sisu, Cristina ;
Wright, James C. ;
Arnan, Carme ;
Barnes, If ;
Banerjee, Abhimanyu ;
Bennett, Ruth ;
Berry, Andrew ;
Bignell, Alexandra ;
Boix, Carles ;
Calvet, Ferriol ;
Cerdan-Velez, Daniel ;
Cunningham, Fiona ;
Davidson, Claire ;
Donaldson, Sarah ;
Dursun, Cagatay ;
Fatima, Reham ;
Giorgetti, Stefano ;
Giron, Carlos Garcia ;
Gonzalez, Jose Manuel ;
Hardy, Matthew ;
Harrison, Peter W. ;
Hourlier, Thibaut ;
Hollis, Zoe ;
Hunt, Toby ;
James, Benjamin ;
Jiang, Yunzhe ;
Johnson, Rory ;
Kay, Mike ;
Lagarde, Julien ;
Martin, Fergal J. ;
Gomez, Laura Martinez ;
Nair, Surag ;
Ni, Pengyu ;
Pozo, Fernando ;
Ramalingam, Vivek ;
Ruffier, Magali ;
Schmitt, Bianca M. ;
Schreiber, Jacob M. ;
Steed, Emily ;
Suner, Marie-Marthe ;
Sumathipala, Dulika ;
Sycheva, Irina ;
Uszczynska-Ratajczak, Barbara ;
Wass, Elizabeth .
NUCLEIC ACIDS RESEARCH, 2023, 51 (D1) :D942-D949
[26]   Transcriptome variation in human tissues revealed by long-read sequencing [J].
Glinos, Dafni A. ;
Garborcauskas, Garrett ;
Hoffman, Paul ;
Ehsan, Nava ;
Jiang, Lihua ;
Gokden, Alper ;
Dai, Xiaoguang ;
Aguet, Francois ;
Brown, Kathleen L. ;
Garimella, Kiran ;
Bowers, Tera ;
Costello, Maura ;
Ardlie, Kristin ;
Jian, Ruiqi ;
Tucker, Nathan R. ;
Ellinor, Patrick T. ;
Harrington, Eoghan D. ;
Tang, Hua ;
Snyder, Michael ;
Juul, Sissel ;
Mohammadi, Pejman ;
MacArthur, Daniel G. ;
Lappalainen, Tuuli ;
Cummings, Beryl .
NATURE, 2022, 608 (7922) :353-+
[27]   Transcriptome analysis of human tissues and cell lines reveals one dominant transcript per gene [J].
Gonzalez-Porta, Mar ;
Frankish, Adam ;
Rung, Johan ;
Harrow, Jennifer ;
Brazma, Alvis .
GENOME BIOLOGY, 2013, 14 (07)
[28]   Discovery of widespread transcription initiation at microsatellites predictable by sequence-based deep neural network [J].
Grapotte, Mathys ;
Saraswat, Manu ;
Bessiere, Chloe ;
Menichelli, Christophe ;
Ramilowski, Jordan A. ;
Severin, Jessica ;
Hayashizaki, Yoshihide ;
Itoh, Masayoshi ;
Tagami, Michihira ;
Murata, Mitsuyoshi ;
Kojima-Ishiyamas, Miki ;
Noma, Shohei ;
Noguchi, Shuhei ;
Kasukawa, Takeya ;
Hasegawa, Akira ;
Suzuki, Harukazu ;
Nishiyori-Sueki, Hiromi ;
Frith, Martin C. ;
Chatelain, Clement ;
Carninci, Piero ;
de Hoom, Michiel J. L. ;
Wasserman, Wyeth W. ;
Brehelin, Laurent ;
Lecellieree, Charles-Henri .
NATURE COMMUNICATIONS, 2021, 12 (01)
[29]   Transcriptional-Readthrough RNAs Reflect the Phenomenon of "A Gene Contains Gene(s)" or "Gene(s) within a Gene" in the Human Genome, and Thus Are Not Chimeric RNAs [J].
He, Yan ;
Yuan, Chengfu ;
Chen, Lichan ;
Lei, Mingjuan ;
Zellmer, Lucas ;
Huang, Hai ;
Liao, Dezhong Joshua .
GENES, 2018, 9 (01)
[30]   An atlas of human long non-coding RNAs with accurate 5′ ends [J].
Hon, Chung-Chau ;
Ramilowski, Jordan A. ;
Harshbarger, Jayson ;
Bertin, Nicolas ;
Rackham, Owen J. L. ;
Gough, Julian ;
Denisenko, Elena ;
Schmeier, Sebastian ;
Poulsen, Thomas M. ;
Severin, Jessica ;
Lizio, Marina ;
Kawaji, Hideya ;
Kasukawa, Takeya ;
Itoh, Masayoshi ;
Burroughs, A. Maxwell ;
Noma, Shohei ;
Djebali, Sarah ;
Alam, Tanvir ;
Medvedeva, Yulia A. ;
Testa, Alison C. ;
Lipovich, Leonard ;
Yip, Chi-Wai ;
Abugessaisa, Imad ;
Mendez, Mickael ;
Hasegawa, Akira ;
Tang, Dave ;
Lassmann, Timo ;
Heutink, Peter ;
Babina, Magda ;
Wells, Christine A. ;
Kojima, Soichi ;
Nakamura, Yukio ;
Suzuki, Harukazu ;
Daub, Carsten O. ;
de Hoon, Michiel J. L. ;
Arner, Erik ;
Hayashizaki, Yoshihide ;
Carninci, Piero ;
Forrest, Alistair R. R. .
NATURE, 2017, 543 (7644) :199-+