Machine learning for anomaly detection in cyanobacterial fluorescence signals

被引:29
作者
Almuhtaram, Husein [1 ]
Zamyadi, Arash [2 ,3 ,4 ]
Hofmann, Ron [1 ]
机构
[1] Univ Toronto, Dept Civil & Mineral Engn, Toronto, ON M5S 1A4, Canada
[2] Water RA Melbourne, 990 La Trobe St, Docklands, Vic 3008, Australia
[3] Univ New South Wales UNSW, BGA Innovat Hub, Sch Civil & Environm Engn, Sydney, NSW 2052, Australia
[4] Univ New South Wales UNSW, Water Res Ctr, Sch Civil & Environm Engn, Sydney, NSW 2052, Australia
基金
加拿大自然科学与工程研究理事会;
关键词
Phycocyanin; Drinking water treatment; Cyanobacteria; Monitoring; Artificial intelligence; Chlorophyll a; CCchHlo C; WESTERN LAKE-ERIE; WATER-TREATMENT; ESTABLISHMENT; MANAGEMENT; PROBES; RISK;
D O I
10.1016/j.watres.2021.117073
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Many drinking water utilities drawing from waters susceptible to harmful algal blooms (HABs) are imple-menting monitoring tools that can alert them to the onset of blooms. Some have invested in fluorescence-based online monitoring probes to measure phycocyanin, a pigment found in cyanobacteria, but it is not clear how to best use the data generated. Previous studies have focused on correlating phycocyanin flu-orescence and cyanobacteria cell counts. However, not all utilities collect cell count data, making this method impossible to apply in some cases. Instead, this paper proposes a novel approach to determine when a utility needs to respond to a HAB based on machine learning by identifying anomalies in phyco-cyanin fluorescence data without the need for corresponding cell counts or biovolume. Four widespread and open source algorithms are evaluated on data collected at four buoys in Lake Erie from 2014 to 2019: local outlier factor (LOF), One-Class Support Vector Machine (SVM), elliptic envelope, and Isolation Forest (iForest). When trained on standardized historical data from 2014 to 2018 and tested on labelled 2019 data collected at each buoy, the One-Class SVM and elliptic envelope models both achieve a maximum average F1 score of 0.86 among the four datasets. Therefore, One-Class SVM and elliptic envelope are promising algorithms for detecting potential HABs using fluorescence data only. (c) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 68 条
[51]   Machine learning approaches for anomaly detection of water quality on a real-world data set* [J].
Muharemi, Fitore ;
Logofatu, Doina ;
Leon, Florin .
JOURNAL OF INFORMATION AND TELECOMMUNICATION, 2019, 3 (03) :294-307
[52]   Review-Deep Learning Methods for Sensor Based Predictive Maintenance and Future Perspectives for Electrochemical Sensors [J].
Namuduri, Srikanth ;
Narayanan, Barath Narayanan ;
Davuluru, Venkata Salini Priyamvada ;
Burton, Lamar ;
Bhansali, Shekhar .
JOURNAL OF THE ELECTROCHEMICAL SOCIETY, 2020, 167 (03)
[53]   Is qPCR a Reliable Indicator of Cyanotoxin Risk in Freshwater? [J].
Pacheco, Ana Beatriz F. ;
Guedes, Iame A. ;
Azevedo, Sandra M. F. O. .
TOXINS, 2016, 8 (06)
[54]  
Pedregosa F, 2011, J MACH LEARN RES, V12, P2825
[55]   A fast algorithm for the minimum covariance determinant estimator [J].
Rousseeuw, PJ ;
Van Driessen, K .
TECHNOMETRICS, 1999, 41 (03) :212-223
[57]   Automated active fault detection in fouled dissolved oxygen sensors [J].
Samuelsson, Oscar ;
Zambrano, Jesus ;
Bjork, Anders ;
Carlsson, Bengt .
WATER RESEARCH, 2019, 166
[58]   Estimating the support of a high-dimensional distribution [J].
Schölkopf, B ;
Platt, JC ;
Shawe-Taylor, J ;
Smola, AJ ;
Williamson, RC .
NEURAL COMPUTATION, 2001, 13 (07) :1443-1471
[59]   Applying high-frequency surrogate measurements and a wavelet-ANN model to provide early warnings of rapid surface water quality anomalies [J].
Shi, Bin ;
Wang, Peng ;
Jiang, Jiping ;
Liu, Rentao .
SCIENCE OF THE TOTAL ENVIRONMENT, 2018, 610 :1390-1399
[60]   Monitoring Approaches for a Toxic Cyanobacterial Bloom [J].
Srivastava, Ankita ;
Singh, Shweta ;
Ahn, Chi-Yong ;
Oh, Hee-Mock ;
Asthana, Ravi Kumar .
ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2013, 47 (16) :8999-9013