Robust fuzzy clustering based on quantile autocovariances

被引:0
作者
B. Lafuente-Rego
P. D’Urso
J. A. Vilar
机构
[1] University of A Coruña,Research Group on Modeling, Optimization and Statistical Inference (MODES), Department of Mathematics, Computer Science Faculty
[2] Sapienza University of Rome,Dipartimento di Scienze Sociali ed Economiche
来源
Statistical Papers | 2020年 / 61卷
关键词
Time series data; Robust fuzzy ; -medoids clustering; Quantile autocovariances; Exponential distance; Noise cluster; Trimming;
D O I
暂无
中图分类号
学科分类号
摘要
Robustness to the presence of outliers in time series clustering is addressed. Assuming that the clustering principle is to group realizations of series generated from similar dependence structures, three robust versions of a fuzzy C-medoids model based on comparing sample quantile autocovariances are proposed by considering, respectively, the so-called metric, noise, and trimmed approaches. Each method achieves its robustness against outliers in different manner. The metric approach considers a suitable transformation of the distance aimed at smoothing the effect of the outliers, the noise approach brings together the outliers into a separated artificial cluster, and the trimmed approach removes a fraction of the time series. All the proposed approaches take advantage of the high capability of the quantile autocovariances to discriminate between independent realizations from a broad range of stationary processes, including linear, non-linear and conditional heteroskedastic models. An extensive simulation study involving scenarios with different generating models and contaminated with outliers is performed. Robustness against (i) outliers generated from different generating patterns, and (ii) outliers characterized by isolated, temporary or persistent level changes is evaluated. The influence of the input parameters required by the different algorithms is analyzed. Regardless of the considered models, the results show that the proposed robust procedures are able to neutralize the effect of the anomalous series preserving the true clustering structure, and fairly outperform other robust algorithms based on alternative metrics. Two applications to financial data sets permit to illustrate the usefulness of the proposed models.
引用
收藏
页码:2393 / 2448
页数:55
相关论文
共 195 条
  • [1] Aielli GP(2013)Fast clustering of GARCH processes via gaussian mixture models Math Comput Simul 94 205-222
  • [2] Caporin M(2006)Comparison of time series using subsampling Comput Stat Data Anal 50 2589-2599
  • [3] Alonso AM(2006)Time series clustering based on forecast densities Comput Stat Data Anal 51 762-776
  • [4] Maharaj EA(1996)The geometrical ergodicity of nonlinear autoregressive models Stat Sin 6 943-956
  • [5] Alonso AM(1981)Overlapping clustering: a new method for product positioning J Mark Res 18 310-317
  • [6] Berrendero JR(2014)Clustering financial time series with variance ratio statistics Quant Financ 14 2121-2133
  • [7] Hernández A(2010)Identifying common dynamic features in stock returns Quant Financ 10 797-807
  • [8] Justel A(2006)A periodogram-based metric for time series classification Comput Stat Data Anal 50 2668-2684
  • [9] An HZ(2009)Comparison of times series with unequal length in the frequency domain Commun Stat Simul Comput 38 527-540
  • [10] Huang FC(2006)A fuzzy extension of the sihouette width criterion for cluster analysis Fuzzy Sets Syst 157 2858-2875