Reinforcement learning applied to wastewater treatment process control optimization: Approaches, challenges, and path forward

被引:27
作者
Croll, Henry C. [1 ]
Ikuma, Kaoru [1 ]
Ong, Say Kee [1 ]
Sarkar, Soumik [2 ]
机构
[1] Iowa State Univ, Dept Civil Construct & Environm Engn, Ames, IA 50011 USA
[2] Iowa State Univ, Dept Mech Engn, Ames, IA USA
关键词
Artificial intelligence; control optimization; machine learning; reinforcement learning; wastewater treatment; Hyunjung Kim and Scott Bradford; DISSOLVED-OXYGEN; TREATMENT PLANTS; AERATION CONTROL; ALGORITHM; PERFORMANCE; KNOWLEDGE; DESIGN; MOBILE; ENERGY; COSTS;
D O I
10.1080/10643389.2023.2183699
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Wastewater treatment process control optimization is a complex task in a highly nonlinear environment. Reinforcement learning (RL) is a machine learning technique that stands out for its ability to perform better than human operators for certain high-dimensional, complex decision-making problems, making it an ideal candidate for wastewater treatment process control optimization. However, while RL control optimization strategies have shown potential to provide operational cost savings and effluent quality improvements, RL has proven slow to be adopted among environmental engineers. This review provides an overview of existing RL applications for wastewater treatment control optimization found in literature and evaluates five key challenges that must be addressed prior to widespread adoption: practical RL implementation, managing data, integrating existing process models, building trust in empirical control strategies, and bridging gaps in professional training. Finally, this review discusses potential paths forward to addressing each key challenge, including leveraging soft sensing to improve online data collection, working with process engineers to integrate RL programming with existing industry software, utilizing supervised training to build expert knowledge into the RL agent, and focusing research efforts on known scenarios such as the Benchmark Simulation Model No. 1 to build a robust database of RL agent control optimization results.
引用
收藏
页码:1775 / 1794
页数:20
相关论文
共 91 条
[61]   Reinforcement Learning Control with Deep Deterministic Policy Gradient Algorithm for Multivariable pH Process [J].
Panjapornpon, Chanin ;
Chinchalongporn, Patcharapol ;
Bardeeniz, Santi ;
Makkayatorn, Ratthanita ;
Wongpunnawat, Witchaya .
PROCESSES, 2022, 10 (12)
[62]   Deep reinforcement learning in production systems: a systematic literature review [J].
Panzer, Marcel ;
Bender, Benedict .
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2022, 60 (13) :4316-4341
[63]  
Petersen B., 2002, J HYDROINFOMATICS, V4, P15, DOI [10.2166/hydro.2002.0003,arXiv:https://iwaponline.com/jh/article-pdf/4/1/15/392373, DOI 10.2166/HYDRO.2002.0003,ARXIV:HTTPS://IWAPONLINE.COM/JH/ARTICLE-PDF/4/1/15/392373, 10.2166/hydro.2002.0003, DOI 10.2166/HYDRO.2002.0003]
[64]  
Plappert Matthias., 2016, keras-rl
[65]   Resource Recovery from Wastewater by Biological Technologies: Opportunities, Challenges, and Prospects [J].
Puyol, Daniel ;
Batstone, Damien J. ;
Hulsen, Tim ;
Astals, Sergi ;
Peces, Miriam ;
Kromer, Jens O. .
FRONTIERS IN MICROBIOLOGY, 2017, 7
[66]  
Raffin A, 2021, J MACH LEARN RES, V22, P1
[67]  
Rao Ashwin., 2022, Foundations of reinforcement learning with applications in finance
[68]   Aeration of large-scale municipal wastewater treatment plants: state of the art [J].
Rosso, Diego ;
Larson, Lory E. ;
Stenstrom, Michael K. .
WATER SCIENCE AND TECHNOLOGY, 2008, 57 (07) :973-978
[69]   Ammonia-based aeration control with optimal SRT control: improved performance and lower energy consumption [J].
Schraa, Oliver ;
Rieger, Leiv ;
Alex, Jens ;
Miletic, Ivan .
WATER SCIENCE AND TECHNOLOGY, 2019, 79 (01) :63-72
[70]  
Schrittwieser J, 2021, ADV NEUR IN, V34