Statistical Modeling of RPCA-FCM in Spatiotemporal Rainfall Patterns Recognition

被引:3
作者
Che Mat Nor, Siti Mariana [1 ]
Shaharudin, Shazlyn Milleana [1 ]
Ismail, Shuhaida [2 ]
Mohd Najib, Sumayyah Aimi [3 ]
Tan, Mou Leong [4 ]
Ahmad, Norhaiza [5 ]
机构
[1] Univ Pendidikan Sultan Idris, Dept Math, Fac Sci & Math, Tanjong Malim 35900, Perak, Malaysia
[2] Univ Tun Hussein Onn Malaysia, Fac Appl Sci & Technol, Dept Math & Stat, Panchor 84600, Johor, Malaysia
[3] Univ Pendidikan Sultan Idris, Fac Human Sci, Dept Geog & Environm, Tanjong Malim 35900, Perak, Malaysia
[4] Univ Teknol Malaysia, Sch Humanities, Geog Sect, Geoinformat Unit, Gelugor 11800, Pulau Pinang, Malaysia
[5] Univ Teknol Malaysia, Fac Sci, Dept Math Sci, Skudai 81310, Johor, Malaysia
关键词
principal component analysis; robust principal component analysis; rainfall patterns; Tukey's biweight correlation; spatiotemporal; PRINCIPAL COMPONENT ANALYSIS; FUZZY C-MEANS; MEDITERRANEAN AREA; K-MEANS; PRECIPITATION; PCA;
D O I
10.3390/atmos13010145
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This study was conducted to identify the spatiotemporal torrential rainfall patterns of the East Coast of Peninsular Malaysia, as it is the region most affected by the torrential rainfall of the Northeast Monsoon season. Dimension reduction, such as the classical Principal Components Analysis (PCA) coupled with the clustering approach, is often applied to reduce the dimension of the data while simultaneously performing cluster partitions. However, the classical PCA is highly insensitive to outliers, as it assigns equal weights to each set of observations. Hence, applying the classical PCA could affect the cluster partitions of the rainfall patterns. Furthermore, traditional clustering algorithms only allow each element to exclusively belong to one cluster, thus observations within overlapping clusters of the torrential rainfall datasets might not be captured effectively. In this study, a statistical model of torrential rainfall pattern recognition was proposed to alleviate these issues. Here, a Robust PCA (RPCA) based on Tukey's biweight correlation was introduced and the optimum breakdown point to extract the number of components was identified. A breakdown point of 0.4 at 85% cumulative variance percentage efficiently extracted the number of components to avoid low-frequency variations or insignificant clusters on a spatial scale. Based on the extracted components, the rainfall patterns were further characterized based on cluster solutions attained using Fuzzy C-means clustering (FCM) to allow data elements to belong to more than one cluster, as the rainfall data structure permits this. Lastly, data generated using a Monte Carlo simulation were used to evaluate the performance of the proposed statistical modeling. It was found that the proposed RPCA-FCM performed better using RPCA-FCM compared to the classical PCA coupled with FCM in identifying the torrential rainfall patterns of Peninsular Malaysia's East Coast.
引用
收藏
页数:21
相关论文
共 59 条
[1]   A comparative analysis of clustering algorithms to identify the homogeneous rainfall gauge stations of Bangladesh [J].
Alam, Mohammad Samsul ;
Paul, Sangita .
JOURNAL OF APPLIED STATISTICS, 2020, 47 (08) :1460-1481
[2]   Probability Distribution and Characterization of Daily Precipitation Related to Tropical Cyclones over the Korean Peninsula [J].
Alcantara, Angelika L. ;
Ahn, Kuk-Hyun .
WATER, 2020, 12 (04)
[3]  
Alias NE, 2016, J TEKNOL, V78, P83
[4]  
Ansari Z., 2011, World of Computer Science and Information Technology Journal (WCSIT), V1, P217
[5]   A high-order multi-variable Fuzzy Time Series forecasting algorithm based on fuzzy clustering [J].
Askari, S. ;
Montazerin, N. .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (04) :2121-2135
[6]   Fuzzy C-Means clustering algorithm for data with unequal cluster sizes and contaminated with noise and outliers: Review and development [J].
Askari, Salar .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 165
[7]   KC-Means: A Fast Fuzzy Clustering [J].
Atiyah, Israa Abdzaid ;
Mohammadpour, Adel ;
Taheri, S. Mahmoud .
ADVANCES IN FUZZY SYSTEMS, 2018, 2018
[8]   Characterizing Compactness of Geometrical Clusters Using Fuzzy Measures [J].
Beliakov, Gleb ;
Li, Gang ;
Huy Quan Vu ;
Wilkin, Tim .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2015, 23 (04) :1030-1043
[9]  
Bezdek J. C., 1973, Journal of Cybernetics, V3, P58, DOI 10.1080/01969727308546047
[10]  
Bolon-Canedo V., 2020, FEATURE SELECTION HI