A Nonparametric Clustering Algorithm with a Quantile-Based Likelihood Estimator

被引:4
|
作者
Hino, Hideitsu [1 ]
Murata, Noboru [2 ]
机构
[1] Univ Tsukuba, Grad Sch Syst & Informat Engn, Tsukuba, Ibaraki 3058573, Japan
[2] Waseda Univ, Sch Sci & Engn, Shinjuku Ku, Tokyo 1698555, Japan
关键词
D O I
10.1162/NECO_a_00628
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering is a representative of unsupervised learning and one of the important approaches in exploratory data analysis. By its very nature, clustering without strong assumption on data distribution is desirable. Information-theoretic clustering is a class of clustering methods that optimize information-theoretic quantities such as entropy and mutual information. These quantities can be estimated in a nonparametric manner, and information-theoretic clustering algorithms are capable of capturing various intrinsic data structures. It is also possible to estimate information-theoretic quantities using a data set with sampling weight for each datum. Assuming the data set is sampled from a certain cluster and assigning different sampling weights depending on the clusters, the cluster-conditional information-theoretic quantities are estimated. In this letter, a simple iterative clustering algorithm is proposed based on a nonparametric estimator of the log likelihood for weighted data sets. The clustering algorithm is also derived from the principle of conditional entropy minimization with maximum entropy regularization. The proposed algorithm does not contain a tuning parameter. The algorithm is experimentally shown to be comparable to or outperform conventional nonparametric clustering methods.
引用
收藏
页码:2074 / 2101
页数:28
相关论文
共 50 条
  • [1] Quantile-based clustering
    Hennig, Christian
    Viroli, Cinzia
    Anderlucci, Laura
    ELECTRONIC JOURNAL OF STATISTICS, 2019, 13 (02): : 4849 - 4883
  • [2] Nonparametric estimation of quantile-based entropy function
    Subhash, Silpa
    Sunoj, S. M.
    Sankaran, P. G.
    Rajesh, G.
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (05) : 1805 - 1821
  • [3] The quantile-based empirical likelihood for the difference of quantiles
    Dai, Lichun
    Liu, Pengfei
    Liu, Yiming
    Yang, Guangren
    STATISTICS & PROBABILITY LETTERS, 2025, 216
  • [4] Nonparametric frontier estimation: A conditional quantile-based approach
    Aragon, Y
    Daouia, A
    Thomas-Agnan, C
    ECONOMETRIC THEORY, 2005, 21 (02) : 358 - 389
  • [5] Quantile-Based Maximum Likelihood Training for Outlier Detection
    Taghikhah, Masoud
    Kumar, Nishant
    Segvic, Sinisa
    Eslami, Abouzar
    Gumhold, Stefan
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21610 - 21618
  • [6] Quantile-based nonparametric inference for first-price auctions
    Marmer, Vadim
    Shneyerov, Artyom
    JOURNAL OF ECONOMETRICS, 2012, 167 (02) : 345 - 357
  • [7] Nonparametric "anti-Bayesian" quantile-based pattern classification
    Mahmoudi, Fatemeh
    Razmkhah, Mostafa
    Oommen, B. John
    PATTERN ANALYSIS AND APPLICATIONS, 2021, 24 (01) : 75 - 87
  • [8] Practical Nonparametric Sampling Strategies for Quantile-Based Ordinal Optimization
    Shin, Dongwook
    Broadie, Mark
    Zeevi, Assaf
    INFORMS JOURNAL ON COMPUTING, 2022, 34 (02) : 752 - 768
  • [9] Nonparametric “anti-Bayesian” quantile-based pattern classification
    Fatemeh Mahmoudi
    Mostafa Razmkhah
    B. John Oommen
    Pattern Analysis and Applications, 2021, 24 : 75 - 87
  • [10] On weak convergence of quantile-based empirical likelihood process for ROC curves
    Jiang, Hu
    Liu, Yiming
    Wang, Zhou
    STATISTICS AND COMPUTING, 2024, 34 (05)