Learning-based EM clustering for data on the unit hypersphere with application to exoplanet data

被引:1
作者
Yang, Miin-Shen [1 ]
Chang-Chien, Shou-Jen [1 ]
Hung, Wen-Liang [2 ]
机构
[1] Chung Yuan Christian Univ, Dept Appl Math, Chungli 32023, Taiwan
[2] Natl Tsing Hua Univ, Ctr Teacher Educ, Hsinchu, Taiwan
关键词
Clustering; EM algorithm; Model-based clustering; Directional data; Spherical data; Data on the unit hypersphere; von Mises-Fisher distributions; MEAN SHIFT; ALGORITHM; MIXTURES; NUMBER; SET;
D O I
10.1016/j.asoc.2017.06.037
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study focuses on clustering algorithms for data on the unit hypersphere. This type of directional data lain on the surface of a unit hypersphere is used in geology, biology, meteorology, medicine and oceanography. The EM algorithm with mixtures of von Mises-Fisher distributions is often used for model-based clustering for data on the unit hypersphere. However, the EM algorithm is sensitive to initial values and outliers and a number of clusters must be assigned a priori. In this paper, we propose an effective approach, called a learning-based EM algorithm with von Mises-Fisher distributions, to cluster this type of hyper-spherical data. The proposed clustering method is robust to outliers, without the need for initialization, and automatically determines the number of clusters. Thus, it becomes a fully-unsupervised model-based clustering method for data on the unit hypersphere. Some numerical and real examples with comparisons are given to demonstrate the effectiveness and superiority of the proposed method. We also apply the proposed learning-based EM algorithm to cluster exoplanet data in extrasolar planets. The clustering results have several important implications for exoplanet data and allow an interpretation of exoplanet migration. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:101 / 114
页数:14
相关论文
共 54 条