Machine learning predictions of chlorophyll-a in the Han river basin, Korea

被引:49
作者
Kim, Kyung-Min [1 ]
Ahn, Johng-Hwa [1 ,2 ]
机构
[1] Kangwon Natl Univ, Dept Integrated Energy & Infra Syst, Chunchon 24341, Gangwon Do, South Korea
[2] Kangwon Natl Univ, Coll Engn, Dept Environm Engn, Chunchon 24341, Gangwon Do, South Korea
关键词
Artificial intelligence; Chlorophyll-a; Feature importance; Han river basin; Machine learning; Random forest; NEURAL-NETWORK MODEL; DISSOLVED-OXYGEN; NITRATE UPTAKE; FRESH-WATER; NITROGEN; LAKES; TEMPERATURE; PHOSPHORUS; MICROALGAE; VARIABLES;
D O I
10.1016/j.jenvman.2022.115636
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This study developed a model to predict concentrations of chlorophyll-a ([Chl-a]) as a proxy for algal population with data from multiple monitoring stations in the Han river basin, by using machine-learning predictive models, then analyzed the relationship between [Chl-a] and the input variables of the optimized model. Daily water quality and meteorological data from 2012 to 2020 were collected from the real-time water quality information system and the meteorological administration of Korea. To quantify model accuracy, the coefficient of determination, root mean square error, and mean absolute error were applied. Among random forest (RF), support vector machine, and artificial neural network, the RF with random dataset showed the highest accuracy. The RF was optimized when 78 trees were applied to the model. Input variables for the best RF model were total organic carbon (feature importance: 27%), total nitrogen (19%), pH (13%), water temperature (8%), total phosphorus (8%), electrical conductivity (7%), dissolved oxygen (6%), minimum air temperature (AT) (4%), mean AT (3%), and maximum AT (3%). The feature-importance analysis showed that total organic carbon was the most important variable to predict [Chl-a] in the Han river basin. Total nitrogen was a more important variable than total phosphorus.
引用
收藏
页数:8
相关论文
共 62 条
[1]   Measurement and mathematical modelling of nutrient level and water quality parameters [J].
Alasl, M. Kashefi ;
Khosravi, M. ;
Hosseini, M. ;
Pazuki, G. R. ;
Zadeh, R. Nezakati Esmail .
WATER SCIENCE AND TECHNOLOGY, 2012, 66 (09) :1962-1967
[2]  
[Anonymous], 2010, WDS 10 P CONTR PAP
[3]   Dynamic metabolic modeling of heterotrophic and mixotrophic microalgal growth on fermentative wastes [J].
Baroukh, Caroline ;
Turon, Violette ;
Bernard, Olivier .
PLOS COMPUTATIONAL BIOLOGY, 2017, 13 (06)
[4]   Short-term water quality variable prediction using a hybrid CNN-LSTM deep learning model [J].
Barzegar, Rahim ;
Aalami, Mohammad Taghi ;
Adamowski, Jan .
STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2020, 34 (02) :415-433
[5]   Model predictive control of pH in tubular photobioreactors [J].
Berenguel, M ;
Rodríguez, F ;
Acién, FG ;
García, JL .
JOURNAL OF PROCESS CONTROL, 2004, 14 (04) :377-387
[6]   Relevant factors in the eutrophication of the Uruguay River and the Rio Negro [J].
Beretta-Blanco, Andres ;
Carrasco-Letelier, Leonidas .
SCIENCE OF THE TOTAL ENVIRONMENT, 2021, 761
[7]   Modeling continuous cultures of microalgae colimited by nitrogen and phosphorus [J].
Bougaran, Gael ;
Bernard, Olivier ;
Sciandra, Antoine .
JOURNAL OF THEORETICAL BIOLOGY, 2010, 265 (03) :443-454
[8]   Phytoplankton bloom status: Chlorophyll a biomass as an indicator of water quality condition in the southern estuaries of Florida, USA [J].
Boyer, Joseph N. ;
Kelble, Christopher R. ;
Ortner, Peter B. ;
Rudnick, David T. .
ECOLOGICAL INDICATORS, 2009, 9 :S56-S67
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]   Prediction of contamination potential of groundwater arsenic in Cambodia, Laos, and Thailand using artificial neural network [J].
Cho, Kyung Hum ;
Sthiannopkao, Suthipong ;
Pachepsky, Yakou A. ;
Kim, Kyoung-Woong ;
Kim, Joon Ha .
WATER RESEARCH, 2011, 45 (17) :5535-5544