Gaussian Clusters and Noise: An Approach Based on the Minimum Description Length Principle

被引:0
作者
Luosto, Panu [1 ]
Kivinen, Jyrki [1 ]
Mannila, Heikki [2 ]
机构
[1] Univ Helsinki, Dept Comp Sci, FIN-00014 Helsinki, Finland
[2] Aalto Univ, Dept Informat & Comp Sci, Helsinki, Finland
来源
DISCOVERY SCIENCE, DS 2010 | 2010年 / 6332卷
关键词
STOCHASTIC COMPLEXITY; INFORMATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a well-grounded minimum description length (MDL) based quality measure for a clustering consisting of either spherical or axis-aligned normally distributed clusters and a cluster with a uniform distribution in an axis-aligned rectangular box. The uniform component extends the practical usability of the model e. g. in the presence of noise, and using the MDL principle for the model selection makes comparing the quality of clusterings with a different number of clusters possible. We also introduce a novel search heuristic for finding the best clustering with an unknown number of clusters. The heuristic is based on the idea of moving points from the Gaussian clusters to the uniform one and using MDL for determining the optimal amount of noise. Tests with synthetic data having a clear cluster structure imply that the search method is effective in finding the intuitively correct clustering.
引用
收藏
页码:251 / 265
页数:15
相关论文
共 38 条
  • [31] Spectral-Spatial Classification of Hyperspectral Data Based on a Stochastic Minimum Spanning Forest Approach
    Bernard, Kevin
    Tarabalka, Yuliya
    Angulo, Jesus
    Chanussot, Jocelyn
    Benediktsson, Jon Atli
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2012, 21 (04) : 2008 - 2021
  • [32] A visual social network group consensus approach with minimum adjustment based on Pythagorean fuzzy set
    Liu, Y.
    Diao, W. X.
    Yang, J.
    Yi, J. H.
    IRANIAN JOURNAL OF FUZZY SYSTEMS, 2021, 18 (06): : 167 - 183
  • [33] Context Weighting for Ubiquitous Learning Situation Description: Approach Based on Combination of Weighted Experts' Opinions
    Souabni, Raoudha
    Saadi, Ines Bayoudh
    Kinshuk
    Ben Ghezala, Henda
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2018, 17 (01) : 247 - 309
  • [34] A context-based approach to predict speech intelligibility in interrupted noise: Model design
    van Schoonhoven, Jelmer
    Rhebergen, Koenraad S.
    Dreschler, Wouter A.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2022, 151 (02) : 1404 - 1415
  • [35] Feature Selection Approach Based on Improved Fuzzy C-Means With Principle of Refined Justifiable Granularity
    Li, Wentao
    Zhai, Shichao
    Xu, Weihua
    Pedrycz, Witold
    Qian, Yuhua
    Ding, Weiping
    Zhan, Tao
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2023, 31 (07) : 2112 - 2126
  • [36] Consensus reaching for MAGDM with multi-granular hesitant fuzzy linguistic term sets: a minimum adjustment-based approach
    Yu, Wenyu
    Zhang, Zhen
    Zhong, Qiuyan
    ANNALS OF OPERATIONS RESEARCH, 2021, 300 (02) : 443 - 466
  • [37] An Efficient Approach for Prediction of Nuclear Receptor and Their Subfamilies Based on Fuzzy k-Nearest Neighbor with Maximum Relevance Minimum Redundancy
    Tiwari, Arvind Kumar
    Srivastava, Rajeev
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES INDIA SECTION A-PHYSICAL SCIENCES, 2018, 88 (01) : 129 - 136
  • [38] Consensus reaching for group decision making with multi-granular unbalanced linguistic information: A bounded confidence and minimum adjustment-based approach
    Zhang, Zhen
    Li, Zhuolin
    Gao, Yuan
    INFORMATION FUSION, 2021, 74 : 96 - 110