Structuring Nutrient Yields throughout Mississippi/Atchafalaya River Basin Using Machine Learning Approaches

被引:2
作者
Zhen, Yi [1 ]
Feng, Huan [2 ]
Yoo, Shinjae [3 ]
机构
[1] Southern Univ New Orleans, Dept Nat Sci, New Orleans, LA 70126 USA
[2] Montclair State Univ, Dept Earth & Environm Studies, Montclair, NJ 07043 USA
[3] Brookhaven Natl Lab, Computat Sci Initiat, Upton, NY 11973 USA
关键词
Mississippi/Atchafalaya River Basin; principal component analysis (PCA); t-distributed stochastic neighbor embedding (t-SNE); clustering analysis (CA); surface water quality; nutrient yields; WATER-QUALITY; TREND ANALYSIS; UNCERTAINTY; FLUXES; STATE;
D O I
10.3390/environments10090162
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
To minimize the eutrophication pressure along the Gulf of Mexico or reduce the size of the hypoxic zone in the Gulf of Mexico, it is important to understand the underlying temporal and spatial variations and correlations in excess nutrient loads, which are strongly associated with the formation of hypoxia. This study's objective was to reveal and visualize structures in high-dimensional datasets of nutrient yield distributions throughout the Mississippi/Atchafalaya River Basin (MARB). For this purpose, the annual mean nutrient concentrations were collected from thirty-three US Geological Survey (USGS) water stations scattered in the upper and lower MARB from 1996 to 2020. Eight surface water quality indicators were selected to make comparisons among water stations along the MARB over the past two decades. Principal component analysis (PCA) was used to comprehensively evaluate the nutrient yields across thirty-three USGS monitoring stations and identify the major contributing nutrient loads. The results showed that all samples could be analyzed using two main components, which accounted for 81.6% of the total variance. The PCA results showed that yields of orthophosphate (OP), silica (SI), nitrate-nitrites (NO3-NO2), and total suspended sediment (TSS) are major contributors to nutrient yields. It also showed that land-planted crops, density of population, domestic and industrial discharges, and precipitation are fundamental causes of excess nutrient loads in MARB. These factors are of great significance for the excess nutrient load management and pollution control of the Mississippi River. It was found that the average nutrient yields were stable within the sub-MARB area, but the large nitrogen yields in the upper MARB and the large phosphorus yields in the lower MARB were of great concern. t-distributed stochastic neighbor embedding (t-SNE) revealed interesting nonlinear and local structures in nutrient yield distributions. Clustering analysis (CA) showed the detailed development of similarities in the nutrient yield distribution. Moreover, PCA, t-SNE, and CA showed consistent clustering results. This study demonstrated that the integration of dimension reduction techniques, PCA, and t-SNE with CA techniques in machine learning are effective tools for the visualization of the structures of the correlations in high-dimensional datasets of nutrient yields and provide a comprehensive understanding of the correlations in the distributions of nutrient loads across the MARB.
引用
收藏
页数:14
相关论文
共 40 条
[1]   Trends in the nutrient enrichment of US rivers during the late 20th century and their relation to changes in probable stream trophic conditions [J].
Alexander, Richard B. ;
Smith, Richard A. .
LIMNOLOGY AND OCEANOGRAPHY, 2006, 51 (01) :639-654
[2]  
[Anonymous], Report EPA-822-B-00-002
[3]   Statistical and trend analysis of water quality and quantity data for the Strymon River in Greece [J].
Antonopoulos, VZ ;
Papamichail, DM ;
Mitsiou, KA .
HYDROLOGY AND EARTH SYSTEM SCIENCES, 2001, 5 (04) :679-691
[4]  
Arsham Hossein., 2011, INT ENCY STAT SCI, P87, DOI [10.1007/978-3-642-04898-2_132, DOI 10.1007/978-3-642-04898-2_132]
[5]   STUDY OF A MEASURE OF SAMPLING ADEQUACY FOR FACTOR-ANALYTIC CORRELATION MATRICES [J].
CERNY, BA ;
KAISER, HF .
MULTIVARIATE BEHAVIORAL RESEARCH, 1977, 12 (01) :43-47
[6]   Water quality assessment based on multivariate statistics and water quality index of a strategic river in the Brazilian Atlantic Forest [J].
Costa, David de Andrade ;
Soares de Azevedo, Jose Paulo ;
dos Santos, Marco Aurelio ;
Facchetti Vinhaes Assumpcao, Rafaela dos Santos .
SCIENTIFIC REPORTS, 2020, 10 (01)
[7]   Sources of Nitrate Yields in the Mississippi River Basin [J].
David, Mark B. ;
Drinkwater, Laurie E. ;
Mclsaac, Gregory F. .
JOURNAL OF ENVIRONMENTAL QUALITY, 2010, 39 (05) :1657-1667
[8]   Characterisation and assessment of spatiotemporal variations in nutrient concentrations and fluxes in an urban watershed: Passaic River Basin, New Jersey, USA [J].
Du, Jinglong ;
Feng, Huan ;
Nie, Jing ;
Li, Yuanyi ;
Witherell, Benjamin B. .
INTERNATIONAL JOURNAL OF ENVIRONMENT AND POLLUTION, 2018, 63 (03) :154-177
[9]   Use of water quality index and multivariate statistical techniques for the assessment of spatial variations in water quality of a small river [J].
Dutta, Smita ;
Dwivedi, Ajay ;
Kumar, M. Suresh .
ENVIRONMENTAL MONITORING AND ASSESSMENT, 2018, 190 (12)
[10]   Seasonal differences in trace element concentrations and distribution in Spartina alterniflora root tissue [J].
Feng, Huan ;
Qian, Yu ;
Cochran, J. Kirk ;
Zhu, Qingzhi ;
Heilbrun, Christina ;
Li, Li ;
Hu, Wen ;
Yan, Hanfei ;
Huang, Xiaojing ;
Ge, Mingyuan ;
Nazareski, Evgeny ;
Chu, Yong S. ;
Yoo, Shinjae ;
Zhang, Xuebin ;
Liu, Chang-Jun .
CHEMOSPHERE, 2018, 204 :359-370