Metagenome-Wide Association Study and Machine Learning Prediction of Bulk Soil Microbiome and Crop Productivity

被引:100
|
作者
Chang, Hao-Xun [1 ,3 ]
Haudenshield, James S. [1 ,2 ]
Bowen, Charles R. [1 ,2 ]
Hartman, Glen L. [1 ,2 ]
机构
[1] Univ Illinois, Dept Crop Sci, Urbana, IL 61801 USA
[2] USDA ARS, Urbana, IL 61801 USA
[3] Michigan State Univ, Dept Plant Soil & Microbial Sci, E Lansing, MI 48824 USA
关键词
machine learning; metagenome-wide association study; microbiome; nitrogen fixation; productivity random forest; rhizobium; soybeans; SOYBEAN GLYCINE-MAX; RHIZOSPHERE MICROBIOME; NITROGEN-FIXATION; COMMUNITIES; BACTERIAL; DIVERSITY; STRAINS; GROWTH; YIELD;
D O I
10.3389/fmicb.2017.00519
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Areas within an agricultural field in the same season often differ in crop productivity despite having the same cropping history, crop genotype, and management practices. One hypothesis is that abiotic or biotic factors in the soils differ between areas resulting in these productivity differences. In this study, bulk soil samples collected from a high and a low productivity area from within six agronomic fields in Illinois were quantified for abiotic and biotic characteristics. Extracted DNA from these bulk soil samples were shotgun sequenced. While logistic regression analyses resulted in no significant association between crop productivity and the 26 soil characteristics, principal coordinate analysis and constrained correspondence analysis showed crop productivity explained a major proportion of the taxa variance in the bulk soil microbiome. Metagenome-wide association studies (MWAS) identified more Bradyrhizodium and Gammaproteobacteria in higher productivity areas and more actinobacteria, ascomycota, planctomycetales, and Streptophyta in lower productivity areas. Machine learning using a random forest method successfully predicted productivity based on the microbiome composition with the best accuracy of 0.79 at the order level. Our study showed that crop productivity differences were associated with bulk soil microbiome composition and highlighted several nitrogen utility-related taxa. We demonstrated the merit of MWAS and machine learning for the first time in a plant-microbiome study.
引用
收藏
页数:11
相关论文
共 34 条
  • [21] Accurate Prediction of Soil Heavy Metal Pollution Using an Improved Machine Learning Method: A Case Study in the Pearl River Delta, China
    Zhao, Wenhao
    Ma, Jin
    Liu, Qiyuan
    Qu, Yajing
    Dou, Lei
    Shi, Huading
    Sun, Yi
    Chen, Haiyan
    Tian, Yuxin
    Wu, Fengchang
    ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2023, 57 (46) : 17751 - 17761
  • [22] A genome-wide association study coupled with machine learning approaches to identify influential demographic and genomic factors underlying Parkinson's disease
    Rahman, Md Asad
    Liu, Jinling
    FRONTIERS IN GENETICS, 2023, 14
  • [23] Holistic understanding of trimethoprim resistance in Streptococcus pneumoniae using an integrative approach of genome-wide association study, resistance reconstruction, and machine learning
    Pham, Nguyen-Phuong
    Gingras, Helene
    Godin, Chantal
    Feng, Jie
    Groppi, Alexis
    Nikolski, Macha
    Leprohon, Philippe
    Ouellette, Marc
    MBIO, 2024, 15 (09):
  • [24] Could Artificial Intelligence/Machine Learning and Inclusion of Diet-Gut Microbiome Interactions Improve Disease Risk Prediction? Case Study: Coronary Artery Disease
    Vilne, Baiba
    Kibilds, Juris
    Siksna, Inese
    Lazda, Ilva
    Valcina, Olga
    Krumina, Angelika
    FRONTIERS IN MICROBIOLOGY, 2022, 13
  • [25] Identification of diagnostic cytosine-phosphate-guanine biomarkers in patients with gestational diabetes mellitus via epigenome-wide association study and machine learning
    Liu, Yan
    Wang, Zhenglu
    Zhao, Lin
    GYNECOLOGICAL ENDOCRINOLOGY, 2021, 37 (09) : 857 - 862
  • [26] Simple Optimal Sampling Algorithm to Strengthen Digital Soil Mapping Using the Spatial Distribution of Machine Learning Predictive Uncertainty: A Case Study for Field Capacity Prediction
    Yang, Hyunje
    Lim, Honggeun
    Moon, Haewon
    Li, Qiwen
    Nam, Sooyoun
    Kim, Jaehoon
    Choi, Hyung Tae
    LAND, 2022, 11 (11)
  • [27] Integrated Assays of Genome-Wide Association Study, Multi-Omics Co-Localization, and Machine Learning Associated Calcium Signaling Genes with Oilseed Rape Resistance to Sclerotinia sclerotiorum
    Wang, Xin-Yao
    Ren, Chun-Xiu
    Fan, Qing-Wen
    Xu, You-Ping
    Wang, Lu-Wen
    Mao, Zhou-Lu
    Cai, Xin-Zhong
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (13)
  • [28] Accurate prediction of spatial distribution of soil potentially toxic elements using machine learning and associated key influencing factors identification: A case study in mining and smelting area in southwestern China
    Li, Kai
    Guo, Guanghui
    Zhang, Degang
    Lei, Mei
    Wang, Yingying
    JOURNAL OF HAZARDOUS MATERIALS, 2024, 478
  • [29] Identification and prediction of association patterns between nutrient intake and anemia using machine learning techniques: results from a cross-sectional study with university female students from Palestine
    Qasrawi, Radwan
    Badrasawi, Manal
    Abu Al-Halawa, Diala
    Polo, Stephanny Vicuna
    Abu Khader, Rami
    Al-Taweel, Haneen
    Abu Alwafa, Reem
    Zahdeh, Rana
    Hahn, Andreas
    Schuchardt, Jan Philipp
    EUROPEAN JOURNAL OF NUTRITION, 2024, 63 (05) : 1635 - 1649
  • [30] Prediction of type 2 diabetes using genome-wide polygenic risk score and metabolic profiles: A machine learning analysis of population-based 10-year prospective cohort study
    Hahn, Seok-Ju
    Kim, Suhyeon
    Choi, Young Sik
    Lee, Junghye
    Kang, Jihun
    EBIOMEDICINE, 2022, 86