Coupling self-organizing maps with a Naive Bayesian classifier : Stream classification studies using multiple assessment data

被引:17
|
作者
Fytilis, Nikolaos [1 ]
Rizzo, Donna M. [1 ]
机构
[1] Univ Vermont, Dept Civil & Environm Engn, Coll Engn & Math Sci, Burlington, VT 05405 USA
基金
美国国家科学基金会;
关键词
self-organizing maps; Naive Bayesian classifier; classification; stream habitat health; data assimilation; ARTIFICIAL NEURAL-NETWORKS; WATER-RESOURCES; MYXOBOLUS-CEREBRALIS; WHIRLING DISEASE; FLOW VARIABILITY; UNCERTAINTY; PREDICTION; FRAMEWORK; RIVER; MANAGEMENT;
D O I
10.1002/2012WR013422
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Organizing or clustering data into natural groups is one of the most fundamental aspects of understanding and mining information. The recent explosion in sensor networks and data storage associated with hydrological monitoring has created a huge potential for automating data analysis and classification of large, high-dimensional data sets. In this work, we develop a new classification tool that couples a Naive Bayesian classifier with a neural network clustering algorithm (i.e., Kohonen Self-Organizing Map (SOM)). The combined Bayesian-SOM algorithm reduces classification error by leveraging the Bayesian's ability to accommodate parameter uncertainty with the SOM's ability to reduce high-dimensional data to lower dimensions. The resulting algorithm is data-driven, nonparametric and is as computationally efficient as a Naive Bayesian classifier due to its parallel architecture. We apply, evaluate and test the Bayesian-SOM network using two real-world hydrological data sets. The first uses genetic data to classify the state of disease in native fish populations in the upper Madison River, MT, USA. The second uses stream geomorphic and water quality data measured at similar to 2500 Vermont stream reaches to predict habitat conditions. The new classification tool has substantial benefits over traditional classification methods due to its ability to dynamically update prior information, assess the uncertainty/confidence of the posterior probability values, and visualize both the input data and resulting probabilistic clusters onto two-dimensional maps to better assess nonlinear mappings between the two.
引用
收藏
页码:7747 / 7762
页数:16
相关论文
共 50 条
  • [1] Quality assessment of data discrimination using self-organizing maps
    Mekler, Alexey
    Schwarz, Dmitri
    JOURNAL OF BIOMEDICAL INFORMATICS, 2014, 51 : 210 - 218
  • [2] Self-organizing maps in chemotaxonomic studies of asteraceae: a classification of tribes using flavonoid data
    Emerenciano, Vicente R.
    Barbosa, Karina O.
    Scotti, Marcus T.
    Ferreira, Marcelo J. R.
    JOURNAL OF THE BRAZILIAN CHEMICAL SOCIETY, 2007, 18 (05) : 891 - 899
  • [3] Spatiotemporal classification of environmental monitoring data in the Yeongsan River basin, Korea, using self-organizing maps
    Jin, Y. -H.
    Kawamura, A.
    Park, S. -C.
    Nakagawa, N.
    Amaguchi, H.
    Olsson, J.
    JOURNAL OF ENVIRONMENTAL MONITORING, 2011, 13 (10): : 2886 - 2894
  • [4] BAYESIAN SELF-ORGANIZING MAP FOR DATA CLASSIFICATION AND CLUSTERING
    Guo, Xiaolian
    Wang, Haiying
    Glass, David H.
    INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2013, 11 (05)
  • [5] Unsupervised Classification of Audio Signals by Self-Organizing Maps and Bayesian Labeling
    Cruz, Ricardo
    Ortiz, Andres
    Barbancho, Ana M.
    Barbancho, Isabel
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, PT I, 2012, 7208 : 61 - 70
  • [6] Application of multiple self-organizing maps for classification of soil samples in Thailand according to their geographic origins
    Krongchai, Chanida
    Funsueb, Sujitra
    Jakmunee, Jaroon
    Kittiwachana, Sila
    JOURNAL OF CHEMOMETRICS, 2017, 31 (02)
  • [7] Multiple outlier detection in multivariate data using self-organizing maps title
    Nag, AK
    Mitra, A
    Mitra, S
    COMPUTATIONAL STATISTICS, 2005, 20 (02) : 245 - 264
  • [8] Multiple outlier detection in multivariate data using self-organizing maps title
    Ashok K. Nag
    Amit Mitra
    Sharmishtha Mitra
    Computational Statistics, 2005, 20 : 245 - 264
  • [9] Surface water quality assessment using self-organizing maps and Hasse diagram technique
    Voyslavov, Tsvetomil
    Tsakovski, Stefan
    Simeonov, Vasil
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2012, 118 : 280 - 286
  • [10] Classification of urban soils for forensic purposes using supervised self-organizing maps
    Idrizi, Hirijete
    Najdoski, Metodija
    Kuzmanovski, Igor
    JOURNAL OF CHEMOMETRICS, 2021, 35 (04)