Coupling self-organizing maps with a Naive Bayesian classifier : Stream classification studies using multiple assessment data

被引:17
|
作者
Fytilis, Nikolaos [1 ]
Rizzo, Donna M. [1 ]
机构
[1] Univ Vermont, Dept Civil & Environm Engn, Coll Engn & Math Sci, Burlington, VT 05405 USA
基金
美国国家科学基金会;
关键词
self-organizing maps; Naive Bayesian classifier; classification; stream habitat health; data assimilation; ARTIFICIAL NEURAL-NETWORKS; WATER-RESOURCES; MYXOBOLUS-CEREBRALIS; WHIRLING DISEASE; FLOW VARIABILITY; UNCERTAINTY; PREDICTION; FRAMEWORK; RIVER; MANAGEMENT;
D O I
10.1002/2012WR013422
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Organizing or clustering data into natural groups is one of the most fundamental aspects of understanding and mining information. The recent explosion in sensor networks and data storage associated with hydrological monitoring has created a huge potential for automating data analysis and classification of large, high-dimensional data sets. In this work, we develop a new classification tool that couples a Naive Bayesian classifier with a neural network clustering algorithm (i.e., Kohonen Self-Organizing Map (SOM)). The combined Bayesian-SOM algorithm reduces classification error by leveraging the Bayesian's ability to accommodate parameter uncertainty with the SOM's ability to reduce high-dimensional data to lower dimensions. The resulting algorithm is data-driven, nonparametric and is as computationally efficient as a Naive Bayesian classifier due to its parallel architecture. We apply, evaluate and test the Bayesian-SOM network using two real-world hydrological data sets. The first uses genetic data to classify the state of disease in native fish populations in the upper Madison River, MT, USA. The second uses stream geomorphic and water quality data measured at similar to 2500 Vermont stream reaches to predict habitat conditions. The new classification tool has substantial benefits over traditional classification methods due to its ability to dynamically update prior information, assess the uncertainty/confidence of the posterior probability values, and visualize both the input data and resulting probabilistic clusters onto two-dimensional maps to better assess nonlinear mappings between the two.
引用
收藏
页码:7747 / 7762
页数:16
相关论文
共 50 条
  • [31] Seismic facies analysis from pre-stack data using self-organizing maps
    Kourki, Meysam
    Riahi, Mohammad Ali
    JOURNAL OF GEOPHYSICS AND ENGINEERING, 2014, 11 (06)
  • [32] Using self-organizing maps as unsupervised learning models for meteorological data mining
    Mihai, Andrei
    2020 IEEE 14TH INTERNATIONAL SYMPOSIUM ON APPLIED COMPUTATIONAL INTELLIGENCE AND INFORMATICS (SACI 2020), 2020, : 23 - 28
  • [33] Assessment of the water quality of Klodnica River catchment using self-organizing maps
    Olkowska, Ewa
    Kudlak, Blazej
    Tsakovski, Stefan
    Ruman, Marek
    Simeonov, Vasil
    Polkowska, Zaneta
    SCIENCE OF THE TOTAL ENVIRONMENT, 2014, 476 : 477 - 484
  • [34] Style classification and visualization of art painting's genre using self-organizing maps
    Lee, Sang-Geol
    Cha, Eui-Young
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2016, 6
  • [35] Redefining floristic zones in the Korean Peninsula using high-resolution georeferenced specimen data and self-organizing maps
    Jung, Songhie
    Cho, Yong-chan
    ECOLOGY AND EVOLUTION, 2020, 10 (20): : 11549 - 11564
  • [36] Reducing topological defects in self-organizing maps using multiple scale neighborhood functions
    Murakoshi, Kazushi
    Sato, Yuichi
    BIOSYSTEMS, 2007, 90 (01) : 101 - 104
  • [37] Classification of lung sounds in patients with asthma, emphysema, fibrosing alveolitis and healthy lungs by using self-organizing maps
    Malmberg, LP
    Kallio, K
    Haltsonen, S
    Katila, T
    Sovijarvi, ARA
    CLINICAL PHYSIOLOGY, 1996, 16 (02): : 115 - 129
  • [38] Evaluating Spatial Variability in Sediment and Phosphorus Concentration-Discharge Relationships Using Bayesian Inference and Self-Organizing Maps
    Underwood, Kristen L.
    Rizzo, Donna M.
    Schroth, Andrew W.
    Dewoolkar, Mandar M.
    WATER RESOURCES RESEARCH, 2017, 53 (12) : 10293 - 10316
  • [39] Sustainable Development with Smart Meter Data Analytics Using NoSQL and Self-Organizing Maps
    Oprea, Simona-Vasilica
    Bara, Adela
    Tudorica, Bogdan George
    Dobrita, Gabriela
    SUSTAINABILITY, 2020, 12 (08)
  • [40] Clustering and Analyzing Embedded Software Development Projects Data Using Self-Organizing Maps
    Iwata, Kazunori
    Nakashima, Toyoshiro
    Anan, Yoshiyuki
    Ishii, Naohiro
    SOFTWARE ENGINEERING RESEARCH, MANAGEMENT AND APPLICATIONS 2011, 2012, 377 : 47 - +