Gaussian Mixture Models Based on Principal Components and Applications

被引:5
作者
Alqahtani, Nada A. [1 ]
Kalantan, Zakiah I. [1 ]
机构
[1] King Abdulaziz Univ, Dept Stat, Jeddah, Saudi Arabia
关键词
Gaussian distribution;
D O I
10.1155/2020/1202307
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Data scientists use various machine learning algorithms to discover patterns in large data that can lead to actionable insights. In general, high-dimensional data are reduced by obtaining a set of principal components so as to highlight similarities and differences. In this work, we deal with the reduced data using a bivariate mixture model and learning with a bivariate Gaussian mixture model. We discuss a heuristic for detecting important components by choosing the initial values of location parameters using two different techniques: cluster means,k-means and hierarchical clustering, and default values in the "mixtools" R package. The parameters of the model are obtained via an expectation maximization algorithm. The criteria from Bayesian point are evaluated for both techniques, demonstrating that both techniques are efficient with respect to computation capacity. The effectiveness of the discussed techniques is demonstrated through a simulation study and using real data sets from different fields.
引用
收藏
页数:13
相关论文
共 18 条
  • [11] MCLACHLAN G, 2000, WILEY SER PROB STAT, P1, DOI 10.1002/0471721182
  • [12] On lines and planes of closest fit to systems of points in space.
    Pearson, Karl
    [J]. PHILOSOPHICAL MAGAZINE, 1901, 2 (7-12) : 559 - 572
  • [13] Porter J., 2019, INT J MOL SCI
  • [14] Rehman M. H. U., 2016, DATA SCI ENG, V1, P265, DOI DOI 10.1007/S41019-016-0022-0
  • [15] ESTIMATING DIMENSION OF A MODEL
    SCHWARZ, G
    [J]. ANNALS OF STATISTICS, 1978, 6 (02) : 461 - 464
  • [16] Verbeek J, 2004, THESIS
  • [17] World data atlas, WORLD REG STAT NAT D
  • [18] Multimode process monitoring with PCA mixture model
    Xu, Xianzhen
    Xie, Lei
    Wang, Shuqing
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (07) : 2101 - 2112