Coupling self-organizing maps with a Naive Bayesian classifier : Stream classification studies using multiple assessment data

被引:17
作者
Fytilis, Nikolaos [1 ]
Rizzo, Donna M. [1 ]
机构
[1] Univ Vermont, Dept Civil & Environm Engn, Coll Engn & Math Sci, Burlington, VT 05405 USA
基金
美国国家科学基金会;
关键词
self-organizing maps; Naive Bayesian classifier; classification; stream habitat health; data assimilation; ARTIFICIAL NEURAL-NETWORKS; WATER-RESOURCES; MYXOBOLUS-CEREBRALIS; WHIRLING DISEASE; FLOW VARIABILITY; UNCERTAINTY; PREDICTION; FRAMEWORK; RIVER; MANAGEMENT;
D O I
10.1002/2012WR013422
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Organizing or clustering data into natural groups is one of the most fundamental aspects of understanding and mining information. The recent explosion in sensor networks and data storage associated with hydrological monitoring has created a huge potential for automating data analysis and classification of large, high-dimensional data sets. In this work, we develop a new classification tool that couples a Naive Bayesian classifier with a neural network clustering algorithm (i.e., Kohonen Self-Organizing Map (SOM)). The combined Bayesian-SOM algorithm reduces classification error by leveraging the Bayesian's ability to accommodate parameter uncertainty with the SOM's ability to reduce high-dimensional data to lower dimensions. The resulting algorithm is data-driven, nonparametric and is as computationally efficient as a Naive Bayesian classifier due to its parallel architecture. We apply, evaluate and test the Bayesian-SOM network using two real-world hydrological data sets. The first uses genetic data to classify the state of disease in native fish populations in the upper Madison River, MT, USA. The second uses stream geomorphic and water quality data measured at similar to 2500 Vermont stream reaches to predict habitat conditions. The new classification tool has substantial benefits over traditional classification methods due to its ability to dynamically update prior information, assess the uncertainty/confidence of the posterior probability values, and visualize both the input data and resulting probabilistic clusters onto two-dimensional maps to better assess nonlinear mappings between the two.
引用
收藏
页码:7747 / 7762
页数:16
相关论文
共 89 条
[1]   Application of the Kohonen neural network in coastal water management: Methodological development for the assessment and prediction of water quality [J].
Aguilera, PA ;
Frenich, AG ;
Torres, JA ;
Castro, H ;
Vidal, JLM ;
Canton, M .
WATER RESEARCH, 2001, 35 (17) :4053-4062
[2]  
Androutsopoulos I., 2000, Proceedings of SIGIR-00, 23rd ACM International Conference on Research and Development in Information Retrieval, P160
[3]  
[Anonymous], 1997, Data exploration using self-organizing maps, DOI DOI 10.1111/fwb.12264
[4]   Uncertainty reduction and characterization for complex environmental fate and transport models: An empirical Bayesian framework incorporating the stochastic response surface method [J].
Balakrishnan, S ;
Roy, A ;
Ierapetritou, MG ;
Flach, GP ;
Georgopoulos, PG .
WATER RESOURCES RESEARCH, 2003, 39 (12)
[5]   Advances in ungauged streamflow prediction using artificial neural networks [J].
Besaw, Lance E. ;
Rizzo, Donna M. ;
Bierman, Paul R. ;
Hackett, William R. .
JOURNAL OF HYDROLOGY, 2010, 386 (1-4) :27-37
[6]   Stream classification using hierarchical artificial neural networks: A fluvial hazard management tool [J].
Besaw, Lance E. ;
Rizzo, Donna M. ;
Kline, Michael ;
Underwood, Kristen L. ;
Doris, Jeffrey J. ;
Morrissey, Leslie A. ;
Pelletier, Keith .
JOURNAL OF HYDROLOGY, 2009, 373 (1-2) :34-43
[7]  
Bickel P., 2007, WORKSH DISC COMPL MA
[8]  
Brinkhurst R., 1986, GUIDE FRESHWATER AQU
[9]  
Chapelle Olivier, 2006, IEEE Transactions on Neural Networks, DOI DOI 10.1109/TNN.2009.2015974
[10]   NEURAL NETWORKS - A REVIEW FROM A STATISTICAL PERSPECTIVE [J].
CHENG, B ;
TITTERINGTON, DM .
STATISTICAL SCIENCE, 1994, 9 (01) :2-30