A Bayesian mixture model for clustering circular data

被引:8
作者
Rodriguez, Carlos E. [1 ]
Nunez-Antonio, Gabriel [2 ]
Escarela, Gabriel [2 ]
机构
[1] IIMAS UNAM, Dept Probabil & Stat, Mexico City, DF, Mexico
[2] Univ Autonoma Metropolitana Iztapalapa, Dept Math, Mexico City, DF, Mexico
关键词
Classification; Label switching; Projected normal; Slice sampler; Reversible jump; REVERSIBLE JUMP; UNKNOWN NUMBER; DENSITY-ESTIMATION; COMPONENTS; INFERENCE;
D O I
10.1016/j.csda.2019.106842
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Clustering complex circular phenomena is a common problem in different scientific disciplines. Examples include the clustering of directions of animal movement in the wild to identify migration patterns, and the classification of angular positions of meteorological events to investigate seasonality fluctuations. The main goal is to develop a novel methodology for clustering and classification of circular data, under a Bayesian mixture modeling framework. The mixture model is defined assuming that the number of components is finite, but unknown, and that each component follows a projected normal distribution. Model selection is performed by jointly making inferences about the parameters of the mixture model and the number of components, choosing the model with the highest posterior probability. A deterministic relabeling strategy is used to recover identifiability for the components in the chosen model. Estimates of both the posterior classification probabilities and the scaled densities are approximated via the relabeled MCMC output. The proposed methods are illustrated using both simulated and real datasets, and performance comparisons with existing strategies are also given. The results suggest that the new approach is an appealing alternative for the clustering and classification of circular data. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:16
相关论文
共 47 条
[1]   A note on circular nonparametrical classification [J].
Ackermann, H .
BIOMETRICAL JOURNAL, 1997, 39 (05) :577-587
[2]  
[Anonymous], 1997, THESIS
[3]  
[Anonymous], 2019, Advanced R, V2nd
[4]  
Burkard R. E., 2009, Assignment problems, DOI DOI 10.1137/1.9780898717754
[5]   Reversible jump, birth-and-death and more general continuous time Markov chain Monte Carlo samplers [J].
Cappé, O ;
Robert, CP ;
Rydén, T .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2003, 65 :679-700
[6]  
Chang F, 2010, J STAT SOFTW, V33, P1
[7]   On mean shift-based clustering for circular data [J].
Chang-Chien, Shou-Jen ;
Hung, Wen-Liang ;
Yang, Miin-Shen .
SOFT COMPUTING, 2012, 16 (06) :1043-1060
[8]   SOME PROPERTIES OF SCAN STATISTIC ON CIRCLE AND LINE [J].
CRESSIE, N .
JOURNAL OF APPLIED PROBABILITY, 1977, 14 (02) :272-283
[9]  
D Peng R., 2002, An Introduction to the .C Interface to R
[10]  
DIEBOLT J, 1994, J ROY STAT SOC B MET, V56, P363