An automatic clustering algorithm for probability density functions

被引:33
作者
Chen, Jen-Hao [1 ]
Hung, Wen-Liang [1 ]
机构
[1] Natl Hsinchu Univ Educ, Dept Appl Math, Hsinchu, Taiwan
关键词
clustering algorithms; COREL image database; kernel density method; probability density function; EM ALGORITHM; LIKELIHOOD; VALUES;
D O I
10.1080/00949655.2014.949715
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We propose an intuitive and computationally simple algorithm for clustering the probability density functions (pdfs). A data-driven learning mechanism is incorporated in the algorithm in order to determine the suitable widths of the clusters. The clustering results prove that the proposed algorithm is able to automatically group the pdfs and provide the optimal cluster number without any a priori information. The performance study also shows that the proposed algorithm is more efficient than existing ones. In addition, the clustering can serve as the intermediate compression tool in content-based multimedia retrieval that we apply the proposed algorithm to categorize a subset of COREL image database. And the clustering results indicate that the proposed algorithm performs well in colour image categorization.
引用
收藏
页码:3047 / 3063
页数:17
相关论文
共 18 条
[1]  
Banerjee J.G. A., 2005, J MACH LEARN RES, V6, P1
[2]   Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models [J].
Biernacki, C ;
Celeux, G ;
Govaert, G .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2003, 41 (3-4) :561-575
[3]  
Chen T.-L., 2007, JSM P, V2034
[4]   On the convergence and consistency of the blurring mean-shift process [J].
Chen, Ting-Li .
ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2015, 67 (01) :157-176
[5]   Simple and effective boundary correction for kernel densities and regression with an application to the world income and Engel curve estimation [J].
Dai, J. ;
Sperlich, S. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (11) :2487-2497
[6]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[7]  
Goh A, 2008, LECT NOTES ARTIF INT, V5211, P377, DOI 10.1007/978-3-540-87479-9_43
[8]   Self-updating clustering algorithm for estimating the parameters in mixtures of von Mises distributions [J].
Hung, Wen-Liang ;
Chang-Chien, Shou-Jen ;
Yang, Miin-Shen .
JOURNAL OF APPLIED STATISTICS, 2012, 39 (10) :2259-2274
[9]   Choosing initial values for the EM algorithm for finite mixtures [J].
Karlis, D ;
Xekalaki, E .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2003, 41 (3-4) :577-590
[10]   The self-organizing map [J].
Kohonen, T .
NEUROCOMPUTING, 1998, 21 (1-3) :1-6