Application of machine learning to explore the genomic prediction accuracy of fall dormancy in autotetraploid alfalfa

被引:21
作者
Zhang, Fan [1 ,2 ]
Kang, Junmei [1 ]
Long, Ruicai [1 ]
Li, Mingna [1 ]
Sun, Yan [3 ]
He, Fei [1 ]
Jiang, Xueqian [1 ]
Yang, Changfu [1 ]
Yang, Xijiang [1 ]
Kong, Jie [1 ]
Wang, Yiwen [4 ]
Wang, Zhen [1 ]
Zhang, Zhiwu [2 ]
Yang, Qingchuan [1 ]
机构
[1] Chinese Acad Agr Sci, Inst Anim Sci, Beijing 100193, Peoples R China
[2] Washington State Univ, Dept Crop & Soil Sci, Pullman, WA 99163 USA
[3] China Agr Univ, Coll Grassland Sci & Technol, Dept Turf Sci & Engn, Beijing 100193, Peoples R China
[4] Univ Melbourne, Sch Math & Stat, Melbourne Integrat Genom, Melbourne 3052, Australia
基金
中国国家自然科学基金;
关键词
GROWTH; SELECTION; FORMAT; YIELD;
D O I
10.1093/hr/uhac225
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Fall dormancy (FD) is an essential trait to overcome winter damage and for alfalfa (Medicago sativa) cultivar selection. The plant regrowth height after autumn clipping is an indirect way to evaluate FD. Transcriptomics, proteomics, and quantitative trait locus mapping have revealed crucial genes correlated with FD; however, these genes cannot predict alfalfa FD very well. Here, we conducted genomic prediction of FD using whole-genome SNP markers based on machine learning-related methods,including support vector machine (SVM) regression, and regularization-related methods, such as Lasso and ridge regression. The results showed that using SVM regression with linear kernel and the top 3000 genome-wide association study (GWAS)-associated markers achieved the highest prediction accuracy for FD of 64.1%. For plant regrowth height, the prediction accuracy was 59.0% using the 3000 GWAS-associated markers and the SVM linear model. This was better than the results using whole-genome markers (25.0%). Therefore, the method we explored for alfalfa FD prediction outperformed the other models, such as Lasso and ElasticNet. The study suggests the feasibility of using machine learning to predict FD with GWAS-associated markers, and the GWAS-associated markers combined with machine learning would benefit FD-related traits as well. Application of the methodology may provide potential targets for FD selection, which would accelerate genetic research and molecular breeding of alfalfa with optimized FD.
引用
收藏
页数:8
相关论文
共 48 条
[1]   Dissecting Key Adaptation Traits in the Polyploid Perennial Medicago sativa Using GBS-SNP Mapping [J].
Adhikari, Laxman ;
Lindstrom, Orville M. ;
Markham, Jonathan ;
Missaoui, Ali M. .
FRONTIERS IN PLANT SCIENCE, 2018, 9
[2]   Accuracy of genomic selection for alfalfa biomass yield in different reference populations [J].
Annicchiarico, Paolo ;
Nazzicari, Nelson ;
Li, Xuehui ;
Wei, Yanling ;
Pecetti, Luciano ;
Brummer, E. Charles .
BMC GENOMICS, 2015, 16
[3]   Assessment of yield and quality characteristics of alfalfa (Medicago sativa L.) cultivars with different fall dormancy rating [J].
Avci, Mustafa ;
Hatipoglu, Rustu ;
Cinar, Selahattin ;
Kilicalp, Numan .
LEGUME RESEARCH, 2018, 41 (03) :369-373
[4]  
Barnes D. K., 1979, Agricultural Reviews and Manuals, Science and Education Administration, United States Department of Agriculture, P34
[5]   Effect of photoperiod prior to cold acclimation on freezing tolerance and carbohydrate metabolism in alfalfa (Medicago sativa L.) [J].
Bertrand, Annick ;
Bipfubusa, Marie ;
Claessens, Annie ;
Rocher, Solen ;
Castonguay, Yves .
PLANT SCIENCE, 2017, 264 :122-128
[6]   Genome-Wide Association Mapping and Genomic Selection for Alfalfa (Medicago sativa) Forage Quality Traits [J].
Biazzi, Elisa ;
Nazzicari, Nelson ;
Pecetti, Luciano ;
Brummer, E. Charles ;
Palmonari, Alberto ;
Tava, Aldo ;
Annicchiarico, Paolo .
PLOS ONE, 2017, 12 (01)
[7]   Trimmomatic: a flexible trimmer for Illumina sequence data [J].
Bolger, Anthony M. ;
Lohse, Marc ;
Usadel, Bjoern .
BIOINFORMATICS, 2014, 30 (15) :2114-2120
[8]   Across-population genomic prediction in grapevine opens up promising prospects for breeding [J].
Brault, Charlotte ;
Segura, Vincent ;
This, Patrice ;
Le Cunff, Loic ;
Flutre, Timothee ;
Francois, Pierre ;
Pons, Thierry ;
Peros, Jean-Pierre ;
Doligez, Agnes .
HORTICULTURE RESEARCH, 2022, 9
[9]   A One-Penny Imputed Genome from Next-Generation Reference Panels [J].
Browning, Brian L. ;
Zhou, Ying ;
Browning, Sharon R. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2018, 103 (03) :338-348
[10]   Multiple-trait analyses improved the accuracy of genomic prediction and the power of genome-wide association of productivity and climate change-adaptive traits in lodgepole pine [J].
Cappa, Eduardo P. ;
Chen, Charles ;
Klutsch, Jennifer G. ;
Sebastian-Azcona, Jaime ;
Ratcliffe, Blaise ;
Wei, Xiaojing ;
Da Ros, Letitia ;
Ullah, Aziz ;
Liu, Yang ;
Benowicz, Andy ;
Sadoway, Shane ;
Mansfield, Shawn D. ;
Erbilgin, Nadir ;
Thomas, Barb R. ;
El-Kassaby, Yousry A. .
BMC GENOMICS, 2022, 23 (01)