Enriched atlas of lncRNA and protein-coding genes for the GRCg7b chicken assembly and its functional annotation across 47 tissues

被引:6
作者
Degalez, Fabien [1 ]
Charles, Mathieu [2 ,3 ]
Foissac, Sylvain [4 ]
Zhou, Haijuan [5 ]
Guan, Dailu [5 ]
Fang, Lingzhao [6 ]
Klopp, Christophe [2 ]
Allain, Coralie [1 ]
Lagoutte, Laetitia [1 ]
Lecerf, Frederic [1 ]
Acloque, Herve [3 ]
Giuffra, Elisabetta [3 ]
Pitel, Frederique [4 ]
Lagarrigue, Sandrine [1 ]
机构
[1] Inst Agro, PEGASE, INRAE, F-35590 St Gilles, France
[2] Univ Fed Toulouse, INRAE, Bioinf, GenoToul Bioinformat Facil,Sigenae, F-31326 Castanet Tolosan, France
[3] Paris Saclay Univ, INRAE, AgroParisTech, GABI, F-78350 Jouy En Josas, France
[4] Univ Toulouse, GenPhySE, INRAE, ENVT, F-31326 Castanet Tolosan, France
[5] Univ Calif Davis, Davis, CA USA
[6] Aarhus Univ, Aarhus, Denmark
基金
欧盟地平线“2020”;
关键词
Gene atlas; Long non coding RNAs; Chicken; Genome annotation; Tissue specificity; Co-expression; miRNA; LONG NONCODING RNAS; EXPRESSION; TRANSCRIPTION; EVOLUTION; MECHANISMS; MUTATION; REVEALS; PACKAGE;
D O I
10.1038/s41598-024-56705-y
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Gene atlases for livestock are steadily improving thanks to new genome assemblies and new expression data improving the gene annotation. However, gene content varies across databases due to differences in RNA sequencing data and bioinformatics pipelines, especially for long non-coding RNAs (lncRNAs) which have higher tissue and developmental specificity and are harder to consistently identify compared to protein coding genes (PCGs). As done previously in 2020 for chicken assemblies galgal5 and GRCg6a, we provide a new gene atlas, lncRNA-enriched, for the latest GRCg7b chicken assembly, integrating "NCBI RefSeq", "EMBL-EBI Ensembl/GENCODE" reference annotations and other resources such as FAANG and NONCODE. As a result, the number of PCGs increases from 18,022 (RefSeq) and 17,007 (Ensembl) to 24,102, and that of lncRNAs from 5789 (RefSeq) and 11,944 (Ensembl) to 44,428. Using 1400 public RNA-seq transcriptome representing 47 tissues, we provided expression evidence for 35,257 (79%) lncRNAs and 22,468 (93%) PCGs, supporting the relevance of this atlas. Further characterization including tissue-specificity, sex-differential expression and gene configurations are provided. We also identified conserved miRNA-hosting genes with human counterparts, suggesting common function. The annotated atlas is available at gega.sigenae.org
引用
收藏
页数:18
相关论文
共 85 条
  • [1] Opportunities and challenges in long-read sequencing data analysis
    Amarasinghe, Shanika L.
    Su, Shian
    Dong, Xueyi
    Zappia, Luke
    Ritchie, Matthew E.
    Gouil, Quentin
    [J]. GENOME BIOLOGY, 2020, 21 (01)
  • [2] Coordinated international action to accelerate genome-to-phenome with FAANG, the Functional Annotation of Animal Genomes project
    Andersson, Leif
    Archibald, Alan L.
    Bottema, Cynthia D.
    Brauning, Rudiger
    Burgess, Shane C.
    Burt, Dave W.
    Casas, Eduardo
    Cheng, Hans H.
    Clarke, Laura
    Couldrey, Christine
    Dalrymple, Brian P.
    Elsik, Christine G.
    Foissac, Sylvain
    Giuffra, Elisabetta
    Groenen, Martien A.
    Hayes, Ben J.
    Huang, LuSheng S.
    Khatib, Hassan
    Kijas, James W.
    Kim, Heebal
    Lunney, Joan K.
    McCarthy, Fiona M.
    McEwan, John C.
    Moore, Stephen
    Nanduri, Bindu
    Notredame, Cedric
    Palti, Yniv
    Plastow, Graham S.
    Reecy, James M.
    Rohrer, Gary A.
    Sarropoulou, Elena
    Schmidt, Carl J.
    Silverstein, Jeffrey
    Tellam, Ross L.
    Tixier-Boichard, Michele
    Tosser-Klopp, Gwenola
    Tuggle, Christopher K.
    Vilkki, Johanna
    White, Stephen N.
    Zhao, Shuhong
    Zhou, Huaijun
    [J]. GENOME BIOLOGY, 2015, 16
  • [3] [Anonymous], LOC430486 similar to Ca2+ regulator SV2A [Gallus gallus (chicken)] - Gene - NCBI
  • [4] [Anonymous], Coordinate remapping service
  • [5] Microarray profiling of microRNAs reveals frequent coexpression with neighboring miRNAs and host genes
    Baskerville, S
    Bartel, DP
    [J]. RNA, 2005, 11 (03) : 241 - 247
  • [6] CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING
    BENJAMINI, Y
    HOCHBERG, Y
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) : 289 - 300
  • [7] Cai Yimei, 2009, Genomics Proteomics & Bioinformatics, V7, P147, DOI 10.1016/S1672-0229(08)60044-3
  • [8] Antisense lncRNA Transcription Mediates DNA Demethylation to Drive Stochastic Protocadherin α Promoter Choice
    Canzio, Daniele
    Nwakeze, Chiamaka L.
    Horta, Adan
    Rajkumar, Sandy M.
    Coffey, Eliot L.
    Duffy, Erin E.
    Duffie, Rachel
    Monahan, Kevin
    O'Keeffe, Sean
    Simon, Matthew D.
    Lomvardas, Stavros
    Maniatis, Tom
    [J]. CELL, 2019, 177 (03) : 639 - +
  • [9] Population-scale tissue transcriptomics maps long non-coding RNAs to complex disease
    de Goede, Olivia M.
    Nachun, Daniel C.
    Ferraro, Nicole M.
    Gloudemans, Michael J.
    Rao, Abhiram S.
    Smail, Craig
    Eulalio, Tiffany Y.
    Aguet, Francois
    Ng, Bernard
    Xu, Jishu
    Barbeira, Alvaro N.
    Castel, Stephane E.
    Kim-Hellmuth, Sarah
    Park, YoSon
    Scott, Alexandra J.
    Strober, Benjamin J.
    Brown, Christopher D.
    Wen, Xiaoquan
    Hall, Ira M.
    Battle, Alexis
    Lappalainen, Tuuli
    Im, Hae Kyung
    Ardlie, Kristin G.
    Mostafavi, Sara
    Quertermous, Thomas
    Kirkegaard, Karla
    Montgomery, Stephen B.
    [J]. CELL, 2021, 184 (10) : 2633 - +
  • [10] The GENCODE v7 catalog of human long noncoding RNAs: Analysis of their gene structure, evolution, and expression
    Derrien, Thomas
    Johnson, Rory
    Bussotti, Giovanni
    Tanzer, Andrea
    Djebali, Sarah
    Tilgner, Hagen
    Guernec, Gregory
    Martin, David
    Merkel, Angelika
    Knowles, David G.
    Lagarde, Julien
    Veeravalli, Lavanya
    Ruan, Xiaoan
    Ruan, Yijun
    Lassmann, Timo
    Carninci, Piero
    Brown, James B.
    Lipovich, Leonard
    Gonzalez, Jose M.
    Thomas, Mark
    Davis, Carrie A.
    Shiekhattar, Ramin
    Gingeras, Thomas R.
    Hubbard, Tim J.
    Notredame, Cedric
    Harrow, Jennifer
    Guigo, Roderic
    [J]. GENOME RESEARCH, 2012, 22 (09) : 1775 - 1789