A novel water quality data analysis framework based on time-series data mining

被引:38
作者
Deng, Weihui [1 ,2 ]
Wang, Guoyin [1 ]
机构
[1] Chinese Acad Sci, Chongqing Key Lab Big Data & Intelligent Comp, Chongqing Inst Green & Intelligent Technol, Chongqing 400714, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Time-series data mining; Cloud model; Water quality analysis; Similarity measure; Anomaly detection; Pattern discovery; MODEL; CLASSIFICATION; REPRESENTATION;
D O I
10.1016/j.jenvman.2017.03.024
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The rapid development of time-series data mining provides an emerging method for water resource management research. In this paper, based on the time-series data mining methodology, we propose a novel and general analysis framework for water quality time-series data. It consists of two parts: implementation components and common tasks of time-series data mining in water quality data. In the first part, we propose to granulate the time series into several two-dimensional normal clouds and calculate the similarities in the granulated level. On the basis of the similarity matrix, the similarity search, anomaly detection, and pattern discovery tasks in the water quality time-series instance dataset can be easily implemented in the second part. We present a case study of this analysis framework on weekly Dissolve Oxygen time-series data collected from five monitoring stations on the upper reaches of Yangtze River, China. It discovered the relationship of water quality in the mainstream and tributary as well as the main changing patterns of DO. The experimental results show that the proposed analysis framework is a feasible and efficient method to mine the hidden and valuable knowledge from water quality historical time-series data. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:365 / 375
页数:11
相关论文
共 35 条
[1]  
[Anonymous], 2013, 2013 INT C COMPUTER
[2]  
[Anonymous], 1977, J MARKETING RES
[3]  
Asadi A., 2016, J ENV MANAG
[4]   Clustering flood events from water quality time series using Latent Dirichlet Allocation model [J].
Aubert, A. H. ;
Tavenard, R. ;
Emonet, R. ;
de Lavenne, A. ;
Malinowski, S. ;
Guyet, T. ;
Quiniou, R. ;
Odobez, J-M ;
Merot, P. ;
Gascuel-Odoux, C. .
WATER RESOURCES RESEARCH, 2013, 49 (12) :8187-8199
[5]   Time-Series Classification with COTE: The Collective of Transformation-Based Ensembles [J].
Bagnall, Anthony ;
Lines, Jason ;
Hills, Jon ;
Bostrom, Aaron .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (09) :2522-2535
[6]   A tree-based statistical classification algorithm (CHAID) for identifying variables responsible for the occurrence of faecal indicator bacteria during waterworks operations [J].
Bichler, Andrea ;
Neumaier, Arnold ;
Hofmann, Thilo .
JOURNAL OF HYDROLOGY, 2014, 519 :909-917
[7]   Regional analysis of groundwater droughts using hydrograph classification [J].
Bloomfield, J. P. ;
Marchant, B. P. ;
Bricker, S. H. ;
Morgan, R. B. .
HYDROLOGY AND EARTH SYSTEM SCIENCES, 2015, 19 (10) :4327-4344
[8]   Real-time deployment of artificial neural network forecasting models: Understanding the range of applicability [J].
Bowden, Gavin J. ;
Maier, Holger R. ;
Dandy, Graeme C. .
WATER RESOURCES RESEARCH, 2012, 48
[9]   A hybrid evolutionary data driven model for river water quality early warning [J].
Burchard-Levine, Alejandra ;
Liu, Shuming ;
Vince, Francois ;
Li, Mingming ;
Ostfeld, Avi .
JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2014, 143 :8-16
[10]   Modeling water quality in an urban river using hydrological factors - Data driven approaches [J].
Chang, Fi-John ;
Tsai, Yu-Hsuan ;
Chen, Pin-An ;
Coynel, Alexandra ;
Vachaud, Georges .
JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2015, 151 :87-96