Analyzing Medicago spp. seed morphology using GWAS and machine learning

被引:2
作者
Botkin, Jacob [1 ]
Medina, Cesar [2 ]
Park, Sunchung [3 ]
Poudel, Kabita [2 ]
Cha, Minhyeok [4 ]
Lee, Yoonjung [1 ]
Prom, Louis K. [5 ]
Curtin, Shaun J. [2 ,6 ,7 ,8 ]
Xu, Zhanyou [6 ]
Ahn, Ezekiel [3 ]
机构
[1] Univ Minnesota, Dept Plant Pathol, St Paul, MN 55108 USA
[2] Univ Minnesota, Dept Agron & Plant Genet, St Paul, MN 55108 USA
[3] USDA ARS, Beltsville Agr Res Ctr, Sustainable Perennial Crops Lab, Beltsville, MD 20705 USA
[4] Korea Univ, Dept Biotechnol, Seoul 02841, South Korea
[5] USDA ARS, Southern Plains Agr Res Ctr, 2765 F&B Rd, College Stn, TX 77845 USA
[6] united States Dept Agr, Agr Res Serv, Plant Sci Res Unit, St Paul, MN 55108 USA
[7] Univ Minnesota, Ctr Plant Precis Genom, St Paul, MN 55108 USA
[8] Univ Minnesota, Ctr Genome Engn, St Paul, MN 55108 USA
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
Medicago sativa; Alfalfa; Seed morphology; Area size; Seed color; RGB; GWAS; Machine learning; COAT COLOR; ASSOCIATION; ACTIVATION;
D O I
10.1038/s41598-024-67790-4
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Alfalfa is widely recognized as an important forage crop. To understand the morphological characteristics and genetic basis of seed morphology in alfalfa, we screened 318 Medicago spp., including 244 Medicago sativa subsp. sativa (alfalfa) and 23 other Medicago spp., for seed area size, length, width, length-to-width ratio, perimeter, circularity, the distance between the intersection of length & width (IS) and center of gravity (CG), and seed darkness & red-green-blue (RGB) intensities. The results revealed phenotypic diversity and correlations among the tested accessions. Based on the phenotypic data of M. sativa subsp. sativa, a genome-wide association study (GWAS) was conducted using single nucleotide polymorphisms (SNPs) called against the Medicago truncatula genome. Genes in proximity to associated markers were detected, including CPR1, MON1, a PPR protein, and Wun1(threshold of 1E-04). Machine learning models were utilized to validate GWAS, and identify additional marker-trait associations for potentially complex traits. Marker S7_33375673, upstream of Wun1, was the most important predictor variable for red color intensity and highly important for brightness. Fifty-two markers were identified in coding regions. Along with strong correlations observed between seed morphology traits, these genes will facilitate the process of understanding the genetic basis of seed morphology in Medicago spp.
引用
收藏
页数:15
相关论文
共 50 条
[41]   Analyzing Data Granularity Levels for Insider Threat Detection Using Machine Learning [J].
Le, Duc C. ;
Zincir-Heywood, Nur ;
Heywood, Malcolm I. .
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2020, 17 (01) :30-44
[42]   Analyzing lower body movements using machine learning to classify autistic children [J].
Aljabiri, Saida Mohd Salah ;
Hamdan, Mohammad M. .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 94
[43]   Analyzing different household energy use patterns using clustering and machine learning [J].
Cui, Xue ;
Lee, Minhyun ;
Uddin, Mohammad Nyme ;
Zhang, Xuange ;
Zakka, Vincent Gbouna .
RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2025, 212
[44]   Analyzing Longitudinal Data Using Machine Learning with Mixed-Effects Models [J].
Yigit, Pakize ;
Ahmed, Syed Ejaz .
EIGHTEENTH INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING MANAGEMENT, ICMSEM 2024, 2024, 215 :633-646
[45]   Importance of GWAS Risk Loci and Clinical Data in Predicting Asthma Using Machine-learning Approaches [J].
Qin, Zan-Mei ;
Liang, Si-Qiao ;
Long, Jian-Xiong ;
Deng, Jing-Min ;
Wei, Xuan ;
Yang, Mei-Ling ;
Tang, Shao-Jie ;
Li, Hai-Li .
COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2024, 27 (03) :400-407
[46]   End to end stroke triage using cerebrovascular morphology and machine learning [J].
Deshpande, Aditi ;
Elliott, Jordan ;
Jiang, Bin ;
Tahsili-Fahadan, Pouya ;
Kidwell, Chelsea ;
Wintermark, Max ;
Laksari, Kaveh .
FRONTIERS IN NEUROLOGY, 2023, 14
[47]   Seed classification of three species of amaranth (Amaranthus spp.) using artificial neural network and canonical discriminant analysis [J].
Bagheri, A. ;
Eghbali, L. ;
Haghighi, R. Sadrabadi .
JOURNAL OF AGRICULTURAL SCIENCE, 2019, 157 (04) :333-341
[48]   Molecular Mapping of Water-Stress Responsive Genomic Loci in Lettuce (Lactuca spp.) Using Kinetics Chlorophyll Fluorescence, Hyperspectral Imaging and Machine Learning [J].
Kumar, Pawan ;
Eriksen, Renee L. ;
Simko, Ivan ;
Mou, Beiquan .
FRONTIERS IN GENETICS, 2021, 12
[49]   Analyzing the difference between deep learning and machine learning features of EEG signal using clustering techniques [J].
Saha, Anushri ;
Rathore, Sachin Singh ;
Sharma, Shivam ;
Samanta, Debasis .
PROCEEDINGS OF 2019 IEEE REGION 10 SYMPOSIUM (TENSYMP), 2019, :660-664
[50]   Engineering, Technology Techniques Soybean Seed Classification Using NIR and Machine Learning [J].
Wendling, Graziele Feltrin Dias ;
Monteiro, Rita de Cassia Mota ;
Bernardy, Ruan ;
Pinto, Karine Von Ahn ;
Pinheiro, Romario de Mesquita ;
Gadotti, Gizele Ingrid .
BRAZILIAN ARCHIVES OF BIOLOGY AND TECHNOLOGY, 2024, 67