Determination of the optimal parameters in regression models for the prediction of chlorophyll-a: A case study of the Yeongsan Reservoir, Korea

被引:40
作者
Cho, Kyung Hwa [1 ]
Kang, Joo-Hyon [1 ]
Ki, Seo Jin [1 ]
Park, Yongeun [1 ]
Cha, Sung Min [1 ]
Kim, Joon Ha [1 ,2 ]
机构
[1] GIST, Dept Environm Sci & Engn, Kwangju 500712, South Korea
[2] GIST, Sustainable Water Resource Technol Ctr, Kwangju 500712, South Korea
关键词
Principal Component Regression (PCR); Multiple Linear Regression (MLR); F-overall statistics; Optimal parameters; Chlorophyll-a Prediction; Uncertainty; LAKE EUTROPHICATION; WATER-QUALITY; MANAGEMENT; RIVER; ALGAE;
D O I
10.1016/j.scitotenv.2009.01.017
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Statistical regression models involve linear equations, which often lead to significant prediction errors due to poor statistical stability and accuracy. This concern arises from multicollinearity in the models, which may drastically affect model performance in terms of a trade-off scenario for effective water resource management logistics. In this paper, we propose a new methodology for improving the statistical stability and accuracy of regression models, and then show how to cope with pitfalls in the models and determine optimal parameters with a decreased number of predictive variables. Here, a comparison of the predictive performance was made using four types of multiple linear regression (MLR) and principal component regression (PCR) models in the prediction of chlorophyll-a (chl-a) concentration in the Yeongsan (YS) Reservoir, Korea, an estuarine reservoir that historically suffers from high levels of nutrient input. During a 3-year water quality monitoring period, results showed that PCRs could be a compact solution for improving the accuracy of the models, as in each case MLR could not accurately produce reliable predictions due to a persistent collinearity problem. Furthermore, based on R 2 (goodness of fit) and F-overall number (confidence of regression), and the number of explanatory variables (R-F-N) curve, it was revealed that PCR-F(7) was the best model among the four regression models in predicting chl-a, having the fewest explanatory variables (seven) and the lowest uncertainty. Seven PCs were identified as significant variables, related to eight water quality parameters: pH, 5-day biochemical oxygen demand, total coliform, fecal indicator bacteria, chemical oxygen demand, ammonia-nitrogen, total nitrogen, and dissolved oxygen. Overall, the results not only demonstrated that the models employed successfully simulated chl-a in a reservoir in both the test and validation periods, but also suggested that the optimal parameters should cautiously be considered in the design of regression models. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:2536 / 2545
页数:10
相关论文
共 10 条
  • [1] Application of feature selection and regression models for chlorophyll-a prediction in a shallow lake
    Li, Xue
    Sha, Jian
    Wang, Zhong-Liang
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2018, 25 (20) : 19488 - 19498
  • [2] Use of principal component scores in multiple linear regression models for prediction of Chlorophyll-a in reservoirs
    Çamdevyren, H
    Demyr, N
    Kanik, A
    Keskyn, S
    ECOLOGICAL MODELLING, 2005, 181 (04) : 581 - 589
  • [3] Identification and Quantification of Physicochemical Parameters Influencing Chlorophyll-a Concentrations through Combined Principal Component Analysis and Factor Analysis: A Case Study of the Yuqiao Reservoir in China
    Chen, Ran
    Ju, Meiting
    Chu, Chunli
    Jing, Weiqiang
    Wang, Yuqiu
    SUSTAINABILITY, 2018, 10 (04)
  • [4] Monitoring Chlorophyll-a concentration in karst plateau lakes using Sentinel 2 imagery from a case study of pingzhai reservoir in Guizhou, China
    Li, Yongliu
    Zhou, Zhongfa
    Kong, Jie
    Wen, Chaocheng
    Li, Shaohui
    Zhang, Yongrong
    Xie, Jiangting
    Wang, Cui
    EUROPEAN JOURNAL OF REMOTE SENSING, 2022, 55 (01) : 1 - 19
  • [5] Comparing deep learning with several typical methods in prediction of assessing chlorophyll-a by remote sensing: a case study in Taihu Lake, China
    Zhao, Xiaolan
    Xu, Haoli
    Ding, Zhibin
    Wang, Daqing
    Deng, Zhengdong
    Wang, Yi
    Wu, Tingfong
    Li, Wei
    Lu, Zhao
    Wang, Guangyuan
    WATER SUPPLY, 2021, 21 (07) : 3710 - 3724
  • [6] Predicting chlorophyll-a using Landsat 8 OLI sensor data and the non-linear RANSAC method - a case study of Nakdong River, South Korea
    Kim, Hyeong Hun
    Ko, Byoung Chul
    Nam, Jae Yeal
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2016, 37 (14) : 3255 - 3271
  • [7] Prediction of wastewater quality parameters using adaptive and machine learning models: A South African case study
    Sheik, Abdul Gaffar
    Malla, Muneer Ahmad
    Srungavarapu, Chandra Sainadh
    Patan, Ameer Khan
    Kumari, Sheena
    Bux, Faizal
    JOURNAL OF WATER PROCESS ENGINEERING, 2024, 67
  • [8] Comparison of conventional and machine learning regression models for accurate prediction of selected optical active components - A case study: The Gulf of Izmit
    Sunar, Filiz
    Dervisoglu, A.
    Yagmur, N.
    Atabay, H.
    Donertas, A.
    MARINE POLLUTION BULLETIN, 2024, 208
  • [9] Designing key parameters of bubble plumes to restore water quality by using conceptual models: a case study in a sub-deep reservoir
    Lan, Chen
    Chen, Jingan
    Guo, Jianyang
    Wang, Jingfu
    WATER SUPPLY, 2020, 20 (07) : 2915 - 2927
  • [10] Prediction of Physico-Chemical Parameters of Surface Waters Using Autoregressive Moving Average Models: A Case Study of Kis-Balaton Water Protection System, Hungary
    Kovacs, Zsofia
    Tarcsay, Balint Levente
    Toth, Piroska
    Juhasz, Csenge Judit
    Nemeth, Sandor
    Shahrokhi, Amin
    WATER, 2024, 16 (16)