Multi-trait modeling and machine learning discover new markers associated with stem traits in alfalfa

被引:2
作者
Medina, Cesar A. [1 ]
Heuschele, Deborah J. [1 ,2 ]
Zhao, Dongyan [3 ]
Lin, Meng [3 ]
Beil, Craig T. [3 ]
Sheehan, Moira J. [3 ]
Xu, Zhanyou [1 ,2 ]
机构
[1] Univ Minnesota, Dept Agron & Plant Genet, St Paul, MN 55455 USA
[2] ARS, Plant Sci Res Unit, USDA, St Paul, MN 55108 USA
[3] Cornell Univ, Breeding Insight, Ithaca, NY USA
来源
FRONTIERS IN PLANT SCIENCE | 2024年 / 15卷
关键词
alfalfa; stem traits; GWAS; multivariate modeling; machine learning; BINDING CASSETTE TRANSPORTER; NICOTIANA-PLUMBAGINIFOLIA; POPULATION-STRUCTURE; DOWN-REGULATION; PLANT; DIGESTIBILITY; LIGNIN; FORAGE; IMPROVEMENT; DIVISION;
D O I
10.3389/fpls.2024.1429976
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Alfalfa biomass can be fractionated into leaf and stem components. Leaves comprise a protein-rich and highly digestible portion of biomass for ruminant animals, while stems constitute a high fiber and less digestible fraction, representing 50 to 70% of the biomass. However, little attention has focused on stem-related traits, which are a key aspect in improving the nutritional value and intake potential of alfalfa. This study aimed to identify molecular markers associated with four morphological traits in a panel of five populations of alfalfa generated over two cycles of divergent selection based on 16-h and 96-h in vitro neutral detergent fiber digestibility in stems. Phenotypic traits of stem color, presence of stem pith cells, winter standability, and winter injury were modeled using univariate and multivariate spatial mixed linear models (MLM), and the predicted values were used as response variables in genome-wide association studies (GWAS). The alfalfa panel was genotyped using a 3K DArTag SNP markers for the evaluation of the genetic structure and GWAS. Principal component and population structure analyses revealed differentiations between populations selected for high- and low-digestibility. Thirteen molecular markers were significantly associated with stem traits using either univariate or multivariate MLM. Additionally, support vector machine (SVM) and random forest (RF) algorithms were implemented to determine marker importance scores for stem traits and validate the GWAS results. The top-ranked markers from SVM and RF aligned with GWAS findings for solid stem pith, winter standability, and winter injury. Additionally, SVM identified additional markers with high variable importance for solid stem pith and winter injury. Most molecular markers were located in coding regions. These markers can facilitate marker-assisted selection to expedite breeding programs to increase winter hardiness or stem palatability.
引用
收藏
页数:14
相关论文
共 63 条
  • [1] Genetic structure of putative heterotic populations of alfalfa
    Annicchiarico, Paolo
    Wei, Yanling
    Brummer, Edward Charles
    [J]. PLANT BREEDING, 2017, 136 (05) : 671 - 678
  • [2] [Anonymous], 2022, Citrus Fruits 2022 Summary, P1
  • [3] UniProt: a worldwide hub of protein knowledge
    Bateman, Alex
    Martin, Maria-Jesus
    Orchard, Sandra
    Magrane, Michele
    Alpi, Emanuele
    Bely, Benoit
    Bingley, Mark
    Britto, Ramona
    Bursteinas, Borisas
    Busiello, Gianluca
    Bye-A-Jee, Hema
    Da Silva, Alan
    De Giorgi, Maurizio
    Dogan, Tunca
    Castro, Leyla Garcia
    Garmiri, Penelope
    Georghiou, George
    Gonzales, Daniel
    Gonzales, Leonardo
    Hatton-Ellis, Emma
    Ignatchenko, Alexandr
    Ishtiaq, Rizwan
    Jokinen, Petteri
    Joshi, Vishal
    Jyothi, Dushyanth
    Lopez, Rodrigo
    Luo, Jie
    Lussi, Yvonne
    MacDougall, Alistair
    Madeira, Fabio
    Mahmoudy, Mahdi
    Menchi, Manuela
    Nightingale, Andrew
    Onwubiko, Joseph
    Palka, Barbara
    Pichler, Klemens
    Pundir, Sangya
    Qi, Guoying
    Raj, Shriya
    Renaux, Alexandre
    Lopez, Milagros Rodriguez
    Saidi, Rabie
    Sawford, Tony
    Shypitsyna, Aleksandra
    Speretta, Elena
    Turner, Edward
    Tyagi, Nidhi
    Vasudev, Preethi
    Volynkin, Vladimir
    Wardell, Tony
    [J]. NUCLEIC ACIDS RESEARCH, 2019, 47 (D1) : D506 - D515
  • [4] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [5] Bultreys A, 2009, MOL PLANT PATHOL, V10, P651, DOI [10.1111/j.1364-3703.2009.00562.x, 10.1111/J.1364-3703.2009.00562.X]
  • [6] Butler D. G., 2023, ASReml-R Reference Manual Version 4.2
  • [7] LAS1L interacts with the mammalian Rix1 complex to regulate ribosome biogenesis
    Castle, Christopher D.
    Cassimere, Erica K.
    Denicourt, Catherine
    [J]. MOLECULAR BIOLOGY OF THE CELL, 2012, 23 (04) : 716 - 728
  • [8] Allele-aware chromosome-level genome assembly and efficient transgene-free genome editing for the autotetraploid cultivated alfalfa
    Chen, Haitao
    Zeng, Yan
    Yang, Yongzhi
    Huang, Lingli
    Tang, Bolin
    Zhang, He
    Hao, Fei
    Li, Wei
    Li, Youhan
    Liu, Yanbin
    Zhang, Xiaoshuang
    Zhang, Ru
    Zhang, Yesheng
    Li, Yongxin
    Wang, Kun
    He, Hua
    Wang, Zhongkai
    Fan, Guangyi
    Yang, Hui
    Bao, Aike
    Shang, Zhanhuan
    Chen, Jianghua
    Wang, Wen
    Qiu, Qiang
    [J]. NATURE COMMUNICATIONS, 2020, 11 (01)
  • [9] polyRAD: Genotype Calling with Uncertainty from Sequencing Data in Polyploids and Diploids
    Clark, Lindsay V.
    Lipka, Alexander E.
    Sacks, Erik J.
    [J]. G3-GENES GENOMES GENETICS, 2019, 9 (03): : 663 - 673
  • [10] GENERAL AND SPECIFIC COMBINING ABILITY ESTIMATES FOR PITH CELL-DEATH IN STALK INTERNODES OF MAIZE
    COLBERT, TR
    KANG, MS
    MYERS, O
    ZUBER, MS
    [J]. FIELD CROPS RESEARCH, 1987, 17 (02) : 155 - 161