Domain adaptation through active learning strategies for anomaly classification in wastewater treatment plants

被引:0
|
作者
Bellamoli, Francesca [1 ,2 ]
Vian, Marco [2 ]
Di Iorio, Mattia [3 ]
Melgani, Farid [1 ]
机构
[1] Univ Trento, Dept Informat Engn & Comp Sci, Via Sommar 9, I-38123 Trento, Italy
[2] ETC Sustainable Solut Srl, Via Palustei 16, I-38121 Trento, Italy
[3] D-3 Srl, Via Palustei 16, I-38121 Trento, Italy
关键词
active learning; domain adaptation; gradient boosting; intermittent aeration; multiclass classification; wastewater treatment plants;
D O I
10.2166/wst.2024.387
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The increasing use of intermittent aeration controllers in wastewater treatment plants (WWTPs) aims to reduce aeration costs via continuous ammonia and oxygen measurements but faces challenges in detecting sensor and process anomalies. Applying machine learning to this unbalanced, multivariate, multiclass classification challenge requires much data, difficult to obtain from a new plant. This study develops a machine learning algorithm to identify anomalies in intermittent aeration WWTPs, adaptable to new plants with limited data. Utilizing active learning, the method iteratively selects samples from the target domain to fine-tune a gradient-boosting model initially trained on data from 17 plants. Three sampling strategies were tested, with low probability and high entropy sampling proving effective in early adaptation, achieving an F2-score close to the optimal with minimal sample use. The objective is to deploy these models as decision support systems for WWTP management, providing a strategy for efficient model adaptation to new plants, and optimizing labeling efforts.
引用
收藏
页码:3123 / 3138
页数:16
相关论文
共 50 条
  • [21] BOOSTING FOR DOMAIN ADAPTATION EXTREME LEARNING MACHINES FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Xia, Junshi
    Yokoya, Naoto
    Iwasaki, Akira
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 3615 - 3618
  • [22] Applying domain knowledge to the discovery of operating situations in wastewater treatment plants
    Bejar, J
    Cortes, U
    Sanchez, M
    Gimeno, JM
    Poch, M
    INTELLIGENT INFORMATION SYSTEMS, (IIS'97) PROCEEDINGS, 1997, : 360 - 364
  • [23] Active Learning for Domain Classification in a Commercial Spoken Personal Assistant
    Chen, Xi C.
    Sagar, Adithya
    Kao, Justine T.
    Li, Tony Y.
    Klein, Christopher
    Pulman, Stephen
    Garg, Ashish
    Williams, Jason D.
    INTERSPEECH 2019, 2019, : 1478 - 1482
  • [24] Multi-domain Active Learning for Semi-supervised Anomaly Detection
    Vercruyssen, Vincent
    Perini, Lorenzo
    Meert, Wannes
    Davis, Jesse
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT IV, 2023, 13716 : 485 - 501
  • [25] Domain-adaptation-based active ensemble learning for improving chemical sensor array performance
    Yan, Jia
    Sun, Ruihong
    Liu, Tao
    Duan, Shukai
    SENSORS AND ACTUATORS A-PHYSICAL, 2023, 357
  • [26] Active Learning Query Strategies for Classification, Regression, and Clustering: A Survey
    Punit Kumar
    Atul Gupta
    Journal of Computer Science and Technology, 2020, 35 : 913 - 945
  • [27] Active Learning Query Strategies for Classification, Regression, and Clustering: A Survey
    Kumar, Punit
    Gupta, Atul
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2020, 35 (04) : 913 - 945
  • [28] Active Learning Strategies and Convolutional Neural Networks for Mammogram Classification
    Tozato, Joao Marcelo
    Bugatti, Pedro Henrique
    Maeda Saito, Priscila Tiemi
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING (ICAISC 2021), PT II, 2021, 12855 : 126 - 134
  • [29] Sustainable wastewater treatment plants design through multiobjective optimization
    Padron-Paez, Juan I.
    De-Leon Almaraz, Sofia
    Roman-Martinez, Alicia
    COMPUTERS & CHEMICAL ENGINEERING, 2020, 140
  • [30] Multi-source domain adaptation with joint learning for cross-domain sentiment classification
    Zhao, Chuanjun
    Wang, Suge
    Li, Deyu
    KNOWLEDGE-BASED SYSTEMS, 2020, 191