TLT: Recurrent fine-tuning transfer learning for water quality long-term prediction

被引:63
作者
Peng, Lin [1 ,2 ]
Wu, Huan [3 ,4 ]
Gao, Min [1 ,2 ]
Yi, Hualing [2 ]
Xiong, Qingyu [2 ]
Yang, Linda [5 ]
Cheng, Shuiping [3 ]
机构
[1] Chongqing Univ, Minist Educ, Key Lab Dependable Serv Comp Cyber Phys Soc, Chongqing 401331, Peoples R China
[2] Chongqing Univ, Sch Big Data & Software Engn, Chongqing 400044, Peoples R China
[3] Tongji Univ, Coll Environm Sci & Engn, Shanghai 200092, Peoples R China
[4] TY Lin Int Engn Consulting China Co Ltd, Chongqing 401121, Peoples R China
[5] Univ Portsmouth, Sch Comp, Portsmouth O1 3HE, Hants, England
关键词
Water quality prediction; Transformer; Transfer learning; Deep leaning; Stations with limited samples; NEURAL-NETWORKS; MODELS;
D O I
10.1016/j.watres.2022.119171
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The water quality long-term prediction is essential to water environment management decisions. In recent years, although water quality prediction methods based on deep learning have achieved excellent performance in shortterm prediction, these methods are unsuitable for long-term prediction because the accumulation use of shortterm prediction will easily introduce noise. Furthermore, The long-term prediction task requires a large amount of data to train the model to obtain accurate prediction results. For some monitoring stations with limited historical data, it is challenging to fully exploit the performance of deep learning models. To this end, we introduce a transfer learning framework into water quality prediction to improve the prediction performance in data-constrained scenarios. We propose a deep Transfer Learning based on Transformer (TLT) model to enable time dependency perception and facilitate long-term water quality prediction. In TLT, we innovatively introduce a recurrent fine-tuning transfer learning method, which can transfer the knowledge learned from source monitoring stations to the target station, while preventing the deep learning model from overfitting the source data during the pre-training phase. So, TLT can fully exert the performance of deep learning models with limited samples. We conduct experiments on data from 120 monitoring stations in major rivers and lakes in China to verify the effectiveness of TLT. The results show that TLT can effectively improve the long-term prediction accuracy of four water quality indicators (pH, DO, NH3-N, and CODMn) from monitoring stations with limited samples.
引用
收藏
页数:12
相关论文
共 43 条
[1]   Machine learning methods for better water quality prediction [J].
Ahmed, Ali Najah ;
Othman, Faridah Binti ;
Afan, Haitham Abdulmohsin ;
Ibrahim, Rusul Khaleel ;
Fai, Chow Ming ;
Hossain, Md Shabbir ;
Ehteram, Mohammad ;
Elshafie, Ahmed .
JOURNAL OF HYDROLOGY, 2019, 578
[2]   Accurate prediction of protein structures and interactions using a three-track neural network [J].
Baek, Minkyung ;
DiMaio, Frank ;
Anishchenko, Ivan ;
Dauparas, Justas ;
Ovchinnikov, Sergey ;
Lee, Gyu Rie ;
Wang, Jue ;
Cong, Qian ;
Kinch, Lisa N. ;
Schaeffer, R. Dustin ;
Millan, Claudia ;
Park, Hahnbeom ;
Adams, Carson ;
Glassman, Caleb R. ;
DeGiovanni, Andy ;
Pereira, Jose H. ;
Rodrigues, Andria V. ;
van Dijk, Alberdina A. ;
Ebrecht, Ana C. ;
Opperman, Diederik J. ;
Sagmeister, Theo ;
Buhlheller, Christoph ;
Pavkov-Keller, Tea ;
Rathinaswamy, Manoj K. ;
Dalwadi, Udit ;
Yip, Calvin K. ;
Burke, John E. ;
Garcia, K. Christopher ;
Grishin, Nick V. ;
Adams, Paul D. ;
Read, Randy J. ;
Baker, David .
SCIENCE, 2021, 373 (6557) :871-+
[3]  
Banejad H., 2011, The Journal of American Science, V7, P60
[4]   Machine learning models and bankruptcy prediction [J].
Barboza, Flavio ;
Kimura, Herbert ;
Altman, Edward .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 83 :405-417
[5]   Short-term water quality variable prediction using a hybrid CNN-LSTM deep learning model [J].
Barzegar, Rahim ;
Aalami, Mohammad Taghi ;
Adamowski, Jan .
STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2020, 34 (02) :415-433
[6]  
Brockwell P.J., 2002, Introduction to time series and forecasting
[7]   Artificial neural networks for estimating regional arsenic concentrations in a blackfoot disease area in Taiwan [J].
Chang, Fi-John ;
Kao, Li-shan ;
Kuo, Yi-Ming ;
Liu, Chen-Wuing .
JOURNAL OF HYDROLOGY, 2010, 388 (1-2) :65-76
[8]   Comparative analysis of surface water quality prediction performance and identification of key water parameters using different machine learning models based on big data [J].
Chen, Kangyang ;
Chen, Hexia ;
Zhou, Chuanlong ;
Huang, Yichao ;
Qi, Xiangyang ;
Shen, Ruqin ;
Liu, Fengrui ;
Zuo, Min ;
Zou, Xinyi ;
Wang, Jinfeng ;
Zhang, Yan ;
Chen, Da ;
Chen, Xingguo ;
Deng, Yongfeng ;
Ren, Hongqiang .
WATER RESEARCH, 2020, 171
[9]   Probabilistic machine learning and artificial intelligence [J].
Ghahramani, Zoubin .
NATURE, 2015, 521 (7553) :452-459
[10]   A survey of deep learning techniques for autonomous driving [J].
Grigorescu, Sorin ;
Trasnea, Bogdan ;
Cocias, Tiberiu ;
Macesanu, Gigel .
JOURNAL OF FIELD ROBOTICS, 2020, 37 (03) :362-386