Identification of driving factors of algal growth in the South-to-North Water Diversion Project by Transformer-based deep learning

被引:8
作者
Qian, Jing [1 ]
Pu, Nan [2 ]
Qian, Li [3 ]
Xue, Xiaobai [4 ]
Bi, Yonghong [5 ]
Norra, Stefan [6 ]
机构
[1] Karlsruhe Inst Technol, Inst Appl Geosci, D-76131 Karlsruhe, Germany
[2] Leiden Univ, Inst Adv Comp Sci, NL-2333 CA Leiden, Netherlands
[3] Ludwig Maximilian Univ Munich, Inst Informat, D-80538 Munich, Germany
[4] Yingtou Informat Technol Shanghai Ltd, MioTech Res, Shanghai 200120, Peoples R China
[5] Chinese Acad Sci, State Key Lab Freshwater Ecol & Biotechnol, Inst Hydrobiol, Wuhan 430072, Peoples R China
[6] Potsdam Univ, Inst Environm Sci & Geog, Soil Sci & Geoecol, D-14476 Potsdam Golm, Germany
来源
WATER BIOLOGY AND SECURITY | 2023年 / 2卷 / 03期
关键词
Algal growth; Deep learning; Driving factor determination; Model interpretability; Transformer; CHLOROPHYLL-A; FRESH-WATER; PHYTOPLANKTON; RIVER;
D O I
10.1016/j.watbs.2023.100184
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Accurate and credible identification of the drivers of algal growth is essential for sustainable utilization and scientific management of freshwater. In this study, we developed a deep learning-based Transformer model, named Bloomformer-1, for end-to-end identification of the drivers of algal growth without the needing extensive a priori knowledge or prior experiments. The Middle Route of the South-to-North Water Diversion Project (MRP) was used as the study site to demonstrate that Bloomformer-1 exhibited more robust performance (with the highest R2, 0.80 to 0.94, and the lowest RMSE, 0.22-0.43 mu g/L) compared to four widely used traditional machine learning models, namely extra trees regression (ETR), gradient boosting regression tree (GBRT), support vector regression (SVR), and multiple linear regression (MLR). In addition, Bloomformer-1 had higher interpretability (including higher transferability and understandability) than the four traditional machine learning models, which meant that it was trustworthy and the results could be directly applied to real scenarios. Finally, it was determined that total phosphorus (TP) was the most important driver for the MRP, especially in Henan section of the canal, although total nitrogen (TN) had the highest effect on algal growth in the Hebei section. Based on these results, phosphorus loading controlling in the whole MRP was proposed as an algal control strategy.
引用
收藏
页数:10
相关论文
共 48 条
  • [1] Agarap A. F., 2018, Deep learning using rectified linear units (relu)
  • [2] ASTM, 1993, Standard Practices for Measurement of Chlorophyll Content of Algae in Surface Waters, P3731
  • [3] Awad M., 2015, EFFICIENT LEARNING M, P67, DOI [10.1007/978-1-4302-5990-9_4, DOI 10.1007/978-1-4302-5990-9_4]
  • [4] A review of methods for analysing spatial and temporal patterns in coastal water quality
    Bierman, Paul
    Lewis, Megan
    Ostendorf, Bertram
    Tanner, Jason
    [J]. ECOLOGICAL INDICATORS, 2011, 11 (01) : 103 - 114
  • [5] Lipton ZC, 2017, Arxiv, DOI [arXiv:1606.03490, 10.48550/arXiv.1606.03490]
  • [6] Chauhan NK, 2018, 2018 INTERNATIONAL CONFERENCE ON COMPUTING, POWER AND COMMUNICATION TECHNOLOGIES (GUCON), P340
  • [7] Machine learning based marine water quality prediction for coastal hydro-environment management
    Deng, Tianan
    Chau, Kwok-Wing
    Duan, Huan-Feng
    [J]. JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2021, 284
  • [8] Du XD, 2016, 2016 31ST YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), P159, DOI 10.1109/YAC.2016.7804882
  • [9] Fan ZH, 2021, 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), P1692
  • [10] Greedy function approximation: A gradient boosting machine
    Friedman, JH
    [J]. ANNALS OF STATISTICS, 2001, 29 (05) : 1189 - 1232