Gaussian Mixture Models Based on Principal Components and Applications

被引:6
作者
Alqahtani, Nada A. [1 ]
Kalantan, Zakiah I. [1 ]
机构
[1] King Abdulaziz Univ, Dept Stat, Jeddah, Saudi Arabia
关键词
Learning algorithms - Object recognition - Image segmentation - Maximum principle - Learning systems - K-means clustering - Machine learning;
D O I
10.1155/2020/1202307
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Data scientists use various machine learning algorithms to discover patterns in large data that can lead to actionable insights. In general, high-dimensional data are reduced by obtaining a set of principal components so as to highlight similarities and differences. In this work, we deal with the reduced data using a bivariate mixture model and learning with a bivariate Gaussian mixture model. We discuss a heuristic for detecting important components by choosing the initial values of location parameters using two different techniques: cluster means,k-means and hierarchical clustering, and default values in the "mixtools" R package. The parameters of the model are obtained via an expectation maximization algorithm. The criteria from Bayesian point are evaluated for both techniques, demonstrating that both techniques are efficient with respect to computation capacity. The effectiveness of the discussed techniques is demonstrated through a simulation study and using real data sets from different fields.
引用
收藏
页数:13
相关论文
共 18 条
[11]   On lines and planes of closest fit to systems of points in space. [J].
Pearson, Karl .
PHILOSOPHICAL MAGAZINE, 1901, 2 (7-12) :559-572
[12]  
R Core Team, 2022, R: A language and environment for statistical computing
[13]   Big Data Reduction Methods: A Survey [J].
Rehman, Muhammad Habib ur ;
Liew, Chee Sun ;
Abbas, Assad ;
Jayaraman, Prem Prakash ;
Wah, Teh Ying ;
Khan, Samee U. .
DATA SCIENCE AND ENGINEERING, 2016, 1 (04) :265-284
[14]   ESTIMATING DIMENSION OF A MODEL [J].
SCHWARZ, G .
ANNALS OF STATISTICS, 1978, 6 (02) :461-464
[15]  
Scrucca Luca, 2023, CRAN
[16]  
Verbeek J, 2004, THESIS
[17]  
World data atlas, WORLD REG STAT NAT D
[18]   Multimode process monitoring with PCA mixture model [J].
Xu, Xianzhen ;
Xie, Lei ;
Wang, Shuqing .
COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (07) :2101-2112