Systematic tracking of nitrogen sources in complex river catchments: Machine learning approach based on microbial metagenomics

被引:16
作者
Zhu, Ziqian [1 ,2 ]
Ding, Junjie [1 ,2 ]
Du, Ran [1 ,2 ]
Zhang, Zehua [3 ]
Guo, Jiayin [4 ]
Li, Xiaodong [1 ,2 ]
Jiang, Longbo [1 ,2 ]
Chen, Gaojie [5 ]
Bu, Qiurong [6 ]
Tang, Ning [1 ,2 ]
Lu, Lan [1 ,2 ]
Gao, Xiang [1 ,2 ]
Li, Weixiang [1 ,2 ]
Li, Shuai [1 ,2 ]
Zeng, Guangming [1 ,2 ]
Liang, Jie [1 ,2 ]
机构
[1] Hunan Univ, Coll Environm Sci & Engn, Changsha 410082, Peoples R China
[2] Hunan Univ, Minist Educ, Key Lab Environm Biol & Pollut Control, Changsha 410082, Peoples R China
[3] Hunan Univ, Ctr Econ Finance & Management Studies, Changsha 410082, Peoples R China
[4] Hunan Univ Technol & Business, Sch Resources & Environm, Changsha 410205, Peoples R China
[5] Hunan Univ, Sch Math, Changsha 410082, Peoples R China
[6] Natl Engn Res Ctr Adv Technol & Equipment Water En, Changsha 410205, Peoples R China
基金
中国国家自然科学基金;
关键词
River nitrogen pollution pattern; Pollution source tracking; Metagenomic data; Random forest; POLLUTION;
D O I
10.1016/j.watres.2024.121255
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Tracking nitrogen pollution sources is crucial for the effective management of water quality; however, it is a challenging task due to the complex contaminative scenarios in the freshwater systems. The contaminative pattern variations can induce quick responses of aquatic microorganisms, making them sensitive indicators of pollution origins. In this study, the soil and water assessment tool, accompanied by a detailed pollution source database, was used to detect the main nitrogen pollution sources in each sub-basin of the Liuyang River watershed. Thus, each sub-basin was assigned to a known class according to SWAT outputs, including point source pollution-dominated area, crop cultivation pollution-dominated area, and the septic tank pollutiondominated area. Based on these outputs, the random forest (RF) model was developed to predict the main pollution sources from different river ecosystems using a series of input variable groups (e.g., natural macroscopic characteristics, river physicochemical properties, 16S rRNA microbial taxonomic composition, microbial metagenomic data containing taxonomic and functional information, and their combination). The accuracy and the Kappa coefficient were used as the performance metrics for the RF model. Compared with the prediction performance among all the input variable groups, the prediction performance of the RF model was significantly improved using metagenomic indices as inputs. Among the metagenomic data-based models, the combination of the taxonomic information with functional information of all the species achieved the highest accuracy (0.84) and increased median Kappa coefficient (0.70). Feature importance analysis was used to identify key features that could serve as indicators for sudden pollution accidents and contribute to the overall function of the river system. The bacteria Rhabdochromatium marinum, Frankia, Actinomycetia, and Competibacteraceae were the most important species, whose mean decrease Gini indices were 0.0023, 0.0021, 0.0019, and 0.0018, respectively, although their relative abundances ranged only from 0.0004 to 0.1 %. Among the top 30 important variables, functional variables constituted more than half, demonstrating the remarkable variation in the microbial functions among sites with distinct pollution sources and the key role of functionality in predicting pollution sources. Many functional indicators related to the metabolism of Mycobacterium tuberculosis, such as K24693, K25621, K16048, and K14952, emerged as significant important factors in distinguishing nitrogen pollution origins. With the shortage of pollution source data in developing regions, this suggested approach offers an economical, quick, and accurate solution to locate the origins of water nitrogen pollution using the metagenomic data of microbial communities.
引用
收藏
页数:13
相关论文
共 55 条
[1]   Ecosystem-wide metagenomic binning enables prediction of ecological niches from genomes [J].
Alneberg, Johannes ;
Bennke, Christin ;
Beier, Sara ;
Bunse, Carina ;
Quince, Christopher ;
Ininbergs, Karolina ;
Riemann, Lasse ;
Ekman, Martin ;
Juergens, Klaus ;
Labrenz, Matthias ;
Pinhassi, Jarone ;
Andersson, Anders F. .
COMMUNICATIONS BIOLOGY, 2020, 3 (01)
[2]   BIOLOGY OF FRANKIA STRAINS, ACTINOMYCETE SYMBIONTS OF ACTINORHIZAL PLANTS [J].
BENSON, DR ;
SILVESTER, WB .
MICROBIOLOGICAL REVIEWS, 1993, 57 (02) :293-319
[3]   A new evolutionary scenario for the Mycobacterium tuberculosis complex [J].
Brosch, R ;
Gordon, SV ;
Marmiesse, M ;
Brodin, P ;
Buchrieser, C ;
Eiglmeier, K ;
Garnier, T ;
Gutierrez, C ;
Hewinson, G ;
Kremer, K ;
Parsons, LM ;
Pym, AS ;
Samper, S ;
van Soolingen, D ;
Cole, ST .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (06) :3684-3689
[4]   The synergy of Fe(III) and NO2- drives the anaerobic oxidation of methane [J].
Chen, Weiqi ;
Yu, Xiuling ;
Huang, Juan ;
Zhao, Wurong ;
Ju, Jinwei ;
Ye, Jinshao ;
Qin, Huaming ;
Long, Yan .
SCIENCE OF THE TOTAL ENVIRONMENT, 2022, 837
[5]   Shifts in the high-resolution spatial distribution of dissolved N2O and the underlying microbial communities and processes in the Pearl River Estuary [J].
Cheng, Xiang ;
Dong, Yue ;
Fan, Fuqiang ;
Xiao, Shangbin ;
Liu, Jia ;
Wang, Shengrui ;
Lin, Wei ;
Zhou, Chunyang .
WATER RESEARCH, 2023, 243
[6]   Sources and behavior of ammonium during riverbank filtration [J].
Covatti, Gustavo ;
Grischek, Thomas .
WATER RESEARCH, 2021, 191
[7]   Non-point source fecal contamination from aging wastewater infrastructure is a primary driver of antibiotic resistance in surface waters [J].
Damashek, Julian ;
Westrich, Jason R. ;
McDonald, Jacob M. Bateman ;
Teachey, Morgan E. ;
Jackson, Charlene R. ;
Frye, Jonathan G. ;
Lipp, Erin K. ;
Capps, Krista A. ;
Ottesen, Elizabeth A. .
WATER RESEARCH, 2022, 222
[8]  
DILLING W, 1995, ARCH MICROBIOL, V164, P125
[9]   Drought-induced changes in hydrological and phenological interactions modulate waterbird habitats dynamics [J].
Gao, Xiang ;
Liang, Jie ;
Zhu, Ziqian ;
Li, Weixiang ;
Lu, Lan ;
Li, Xin ;
Li, Shuai ;
Tang, Ning ;
Li, Xiaodong .
JOURNAL OF HYDROLOGY, 2023, 626
[10]   Mycobacterial species and their contribution to cholesterol degradation in wastewater treatment plants [J].
Guo, Feng ;
Zhang, Tong ;
Li, Bing ;
Wang, Zhiping ;
Ju, Feng ;
Liang, Yi-ting .
SCIENTIFIC REPORTS, 2019, 9 (1)