Pangenome and pantranscriptome as the new reference for gene-family characterization: A case study of basic helix-loop-helix (bHLH) genes in barley

被引:1
作者
Tong, Cen [1 ,2 ]
Jia, Yong [1 ,2 ]
Hu, Haifei [1 ,2 ]
Zeng, Zhanghui [3 ]
Chapman, Brett [1 ,2 ]
Li, Chengdao [1 ,2 ,4 ,5 ]
机构
[1] Murdoch Univ, Western Crop Genet Alliance, Murdoch, WA 6150, Australia
[2] Murdoch Univ, Coll Sci Hlth Engn & Educ, State Agr Biotechnol Ctr SABC, Murdoch, WA 6150, Australia
[3] Hangzhou Normal Univ, Coll Life & Environm Sci, Hangzhou 311121, Peoples R China
[4] Govt Western Australia, Dept Primary Ind & Reg Dev, South Perth, WA 6155, Australia
[5] Shandong Agr Univ, Coll Agr, Tai An, Peoples R China
关键词
bHLH; barley pangenome; core and dispensable genes; genome-wide gene- family evolution; orthologous gene group; pantranscriptome; TRANSCRIPTION FACTOR; EVOLUTIONARY; ARABIDOPSIS; RICE; PROTEIN; IDENTIFICATION; ELONGATION; REGULATOR;
D O I
10.1016/j.xplc.2024.101190
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Genome-wide identification and comparative gene-family analyses have commonly been performed to investigate species- specific evolution linked to various traits and molecular pathways. However, most previous studies have been limited to gene screening in a single reference genome, failing to account for the gene presence/absence variations (gPAVs) in a species. Here, we propose an innovative pangenome-based approach for gene-family analyses based on orthologous gene groups (OGGs). Using the basic helix-loop-helix (bHLH) transcription factor family in barley as an example, we identified 161-176 bHLHs in 20 barley genomes, which can be classified into 201 OGGs. These 201 OGGs were further classified into 140 core, 12 softcore, 29 shell, and 20 line-specific/cloud bHLHs, revealing the complete profile of bHLH genes in barley. Using a genome-scanning approach, we overcame the genome annotation bias and identified an average of 1.5 un-annotated core bHLHs per barley genome. We found that whole-genome/segmental duplicates are predominant mechanisms contributing to the expansion of most core/softcore bHLHs, whereas dispensable bHLHs are more likely to result from small-scale duplication events. Interestingly, we noticed that the dispensable bHLHs tend to be enriched in the specific subfamilies SF13, SF27, and SF28, implying the potentially biased expansion of specific bHLHs in barley. We found that 50% of the bHLHs contain at least 1 intact transposon element (TE) within the 2-kb upstream-to-downstream region. bHLHs with copy-number variations (CNVs) have 1.48 TEs on average, significantly more than core bHLHs without CNVs (1.36), supporting a potential role of TEs in bHLH expansion. Analyses of selection pressure showed that dispensable bHLHs have experienced clear relaxation of selection compared with core bHLHs, consistent with their conservation patterns. We also integrated the pangenome data with recently available barley pantranscriptome data from 5 tissues and discovered apparent transcriptional divergence within and across bHLH subfamilies. We conclude that pangenome-based gene-family analyses can better describe the previously untapped, genuine evolutionary status of bHLHs and provide novel insights into bHLH evolution in barley. We expect that this study will inspire similar analyses in many other gene families and species.
引用
收藏
页数:20
相关论文
共 67 条
  • [1] Update on the basic helix-loop-helix transcription factor gene family in Arabidopsis thaliana
    Bailey, PC
    Martin, C
    Toledo-Ortiz, G
    Quail, PH
    Huq, E
    Heim, MA
    Jakoby, M
    Werber, M
    Weisshaar, B
    [J]. PLANT CELL, 2003, 15 (11) : 2497 - 2501
  • [2] Plant pan-genomes are the new reference
    Bayer, Philipp E.
    Golicz, Agnieszka A.
    Scheben, Armin
    Batley, Jacqueline
    Edwards, David
    [J]. NATURE PLANTS, 2020, 6 (08) : 914 - 920
  • [3] Near-optimal probabilistic RNA-seq quantification (vol 34, pg 525, 2016)
    Bray, Nicolas L.
    Pimentel, Harold
    Melsted, Pall
    Pachter, Lior
    [J]. NATURE BIOTECHNOLOGY, 2016, 34 (08) : 888 - 888
  • [4] Genome-Wide Classification and Evolutionary Analysis of the bHLH Family of Transcription Factors in Arabidopsis, Poplar, Rice, Moss, and Algae
    Carretero-Paulet, Lorenzo
    Galstyan, Anahit
    Roig-Villanova, Irma
    Martinez-Garcia, Jaime F.
    Bilbao-Castro, Jose R.
    Robertson, David L.
    [J]. PLANT PHYSIOLOGY, 2010, 153 (03) : 1398 - 1412
  • [5] TBtools: An Integrative Toolkit Developed for Interactive Analyses of Big Biological Data
    Chen, Chengjie
    Chen, Hao
    Zhang, Yi
    Thomas, Hannah R.
    Frank, Margaret H.
    He, Yehua
    Xia, Rui
    [J]. MOLECULAR PLANT, 2020, 13 (08) : 1194 - 1202
  • [6] How the pan-genome is changing crop genomics and improvement
    Della Coletta, Rafael
    Qiu, Yinjie
    Ou, Shujun
    Hufford, Matthew B.
    Hirsch, Candice N.
    [J]. GENOME BIOLOGY, 2021, 22 (01)
  • [7] MUSCLE: multiple sequence alignment with high accuracy and high throughput
    Edgar, RC
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 (05) : 1792 - 1797
  • [8] Genome-wide identification and expression analysis of the bHLH transcription factor family and its response to abiotic stress in foxtail millet (Setaria italica L.)
    Fan, Yu
    Lai, Dili
    Yang, Hao
    Xue, Guoxing
    He, Ailing
    Chen, Long
    Feng, Liang
    Ruan, Jingjun
    Xiang, Dabing
    Yan, Jun
    Cheng, Jianping
    [J]. BMC GENOMICS, 2021, 22 (01)
  • [9] Evolutionary and comparative analysis of MYB and bHLH plant transcription factors
    Feller, Antje
    Machemer, Katja
    Braun, Edward L.
    Grotewold, Erich
    [J]. PLANT JOURNAL, 2011, 66 (01) : 94 - 116
  • [10] Gene duplication and evolutionary novelty in plants
    Flagel, Lex E.
    Wendel, Jonathan F.
    [J]. NEW PHYTOLOGIST, 2009, 183 (03) : 557 - 564