A Nonparametric Clustering Algorithm with a Quantile-Based Likelihood Estimator

被引:4
|
作者
Hino, Hideitsu [1 ]
Murata, Noboru [2 ]
机构
[1] Univ Tsukuba, Grad Sch Syst & Informat Engn, Tsukuba, Ibaraki 3058573, Japan
[2] Waseda Univ, Sch Sci & Engn, Shinjuku Ku, Tokyo 1698555, Japan
关键词
D O I
10.1162/NECO_a_00628
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering is a representative of unsupervised learning and one of the important approaches in exploratory data analysis. By its very nature, clustering without strong assumption on data distribution is desirable. Information-theoretic clustering is a class of clustering methods that optimize information-theoretic quantities such as entropy and mutual information. These quantities can be estimated in a nonparametric manner, and information-theoretic clustering algorithms are capable of capturing various intrinsic data structures. It is also possible to estimate information-theoretic quantities using a data set with sampling weight for each datum. Assuming the data set is sampled from a certain cluster and assigning different sampling weights depending on the clusters, the cluster-conditional information-theoretic quantities are estimated. In this letter, a simple iterative clustering algorithm is proposed based on a nonparametric estimator of the log likelihood for weighted data sets. The clustering algorithm is also derived from the principle of conditional entropy minimization with maximum entropy regularization. The proposed algorithm does not contain a tuning parameter. The algorithm is experimentally shown to be comparable to or outperform conventional nonparametric clustering methods.
引用
收藏
页码:2074 / 2101
页数:28
相关论文
共 50 条
  • [32] Quantile-based optimal portfolio selection
    Bodnar, Taras
    Lindholm, Mathias
    Thorsen, Erik
    Tyrcha, Joanna
    COMPUTATIONAL MANAGEMENT SCIENCE, 2021, 18 (03) : 299 - 324
  • [33] Quantile-Based Entropy of Order Statistics
    Sunoj S.M.
    Krishnan A.S.
    Sankaran P.G.
    Journal of the Indian Society for Probability and Statistics, 2017, 18 (1) : 1 - 17
  • [34] QBRIX: a quantile-based approach to retinex
    Gianini, Gabriele
    Manenti, Andrea
    Rizzi, Alessandro
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2014, 31 (12) : 2663 - 2673
  • [35] A quantile-based approach to system selection
    Batur, D.
    Choobineh, F.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2010, 202 (03) : 764 - 772
  • [36] Quantile-based optimal portfolio selection
    Taras Bodnar
    Mathias Lindholm
    Erik Thorsén
    Joanna Tyrcha
    Computational Management Science, 2021, 18 : 299 - 324
  • [37] A quantile-based Tsallis-α divergence
    Kayal, Suchandan
    Tripathy, Manas Ranjan
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2018, 492 : 496 - 505
  • [38] A quantile-based block Kaczmarz algorithm for solving large consistent linear systems
    Zhang, Ke
    Deng, Jin-Yu
    Jiang, Xiang-Long
    COMPUTATIONAL & APPLIED MATHEMATICS, 2025, 44 (01):
  • [39] Quantile-based robust ridge m-estimator for linear regression model in presence of multicollinearity and outliers
    Suhail, Muhammad
    Chand, Sohail
    Kibria, B. M. Golam
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2021, 50 (11) : 3194 - 3206
  • [40] Quantile-based fuzzy C-means clustering of multivariate time series: Robust techniques
    Lopez-Oriona, Angel
    D'Urso, Pierpaolo
    Vilar, Jose A.
    Lafuente-Rego, Borja
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2022, 150 : 55 - 82