Machine Learning-Assisted, Process-Based Quality Control for Detecting Compromised Environmental Sensors

被引:7
作者
Schmidt, Jacquelyn Q. Q. [1 ]
Kerkez, Branko [1 ]
机构
[1] Univ Michigan, Dept Civil & Environm Engn, Ann Arbor, MI 48109 USA
基金
美国国家科学基金会;
关键词
data quality control and assurance; machinelearning; environmental sensors; automated datavalidation; wireless sensor networks; TIME-SERIES; WATER-QUALITY; VALIDATION; FRAMEWORK; STREAM;
D O I
10.1021/acs.est.3c00360
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This study presents a machine learning-assisted data qualitycontrol methodology for environmental sensor data. This process-constrainedmethodology is shown to be more robust than existing state-of-the-artsin detecting faulty data. Machine learning (ML) techniquespromise to revolutionize environmentalresearch and management, but collecting the necessary volumes of high-qualitydata remains challenging. Environmental sensors are often deployedunder harsh conditions, requiring labor-intensive quality assuranceand control (QAQC) processes. The need for manual QAQC is a majorimpediment to the scalability of these sensor networks. Existing techniquesfor automated QAQC make strong assumptions about noise profiles inthe data they filter that do not necessarily hold for broadly deployedenvironmental sensors, however. Toward the goal of increasing thevolume of high-quality environmental data, we introduce an ML-assistedQAQC methodology that is robust to low signal-to-noise ratio data.Our approach embeds sensor measurements into a dynamical feature spaceand trains a binary classification algorithm (Support Vector Machine)to detect deviation from expected process dynamics, indicating whethera sensor has become compromised and requires maintenance. This strategyenables the automated detection of a wide variety of nonphysical signals.We apply the methodology to three novel data sets produced by 136low-cost environmental sensors (stream level, drinking water pH, anddrinking water electroconductivity), deployed by our group across250,000 km(2) in Michigan, USA. The proposed methodologyachieved accuracy scores of up to 0.97 and consistently outperformedstate-of-the-art anomaly detection techniques.
引用
收藏
页码:18058 / 18066
页数:9
相关论文
共 44 条
[1]   Advanced monitoring of water systems using in situ measurement stations: data validation and fault detection [J].
Alferes, Janelcy ;
Tik, Sovanna ;
Copp, John ;
Vanrolleghem, Peter A. .
WATER SCIENCE AND TECHNOLOGY, 2013, 68 (05) :1022-1030
[2]   Outlier Detection and Smoothing Process for Water Level Data Measured by Ultrasonic Sensor in Stream Flows [J].
Bae, Inhyeok ;
Ji, Un .
WATER, 2019, 11 (05)
[3]   Open storm: a complete framework for sensing and control of urban watersheds [J].
Bartos, Matthew ;
Wong, Brandon ;
Kerkez, Branko .
ENVIRONMENTAL SCIENCE-WATER RESEARCH & TECHNOLOGY, 2018, 4 (03) :346-358
[4]   Incorporating Low-Cost Sensor Measurements into High-Resolution PM2.5 Modeling at a Large Spatial Scale [J].
Bi, Jianzhao ;
Wildani, Avani ;
Chang, Howard H. ;
Liu, Yang .
ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2020, 54 (04) :2152-2162
[6]  
Brezonik P. L., 2018, CHEM KINETICS PROCES
[7]   Quantity is Nothing without Quality: Automated QA/QC for Streaming Environmental Sensor Data [J].
Campbell, John L. ;
Rustad, Lindsey E. ;
Porter, John H. ;
Taylor, Jeffrey R. ;
Dereszynski, Ethan W. ;
Shanley, James B. ;
Gries, Corinna ;
Henshaw, Donald L. ;
Martin, Mary E. ;
Sheldon, Wade M. ;
Boose, Emery R. .
BIOSCIENCE, 2013, 63 (07) :574-585
[8]   Anomaly Detection: A Survey [J].
Chandola, Varun ;
Banerjee, Arindam ;
Kumar, Vipin .
ACM COMPUTING SURVEYS, 2009, 41 (03)
[9]   A Critical Review for Real-Time Continuous Soil Monitoring: Advantages, Challenges, and Perspectives [J].
Fan, Yingzheng ;
Wang, Xingyu ;
Funk, Thomas ;
Rashid, Ishrat ;
Herman, Brianna ;
Bompoti, Nefeli ;
Mahmud, Shaad ;
Chrysochoou, Maria ;
Yang, Meijian ;
Vadas, Timothy M. ;
Lei, Yu ;
Li, Baikun .
ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2022, 56 (19) :13546-13564
[10]   Fault detection in level and flow rate sensors for safe and performant remote-control in a water supply system [J].
Fellini, Sofia ;
Vesipa, Riccardo ;
Boano, Fulvio ;
Ridolfi, Luca .
JOURNAL OF HYDROINFORMATICS, 2020, 22 (01) :132-147