Quantile-based fuzzy clustering of multivariate time series in the frequency domain

被引:0
作者
Lopez-Oriona, Angel [1 ]
Vilar, Jose A. [1 ,2 ]
D'Urso, Pierpaolo [3 ]
机构
[1] Univ A Coruna, Res Ctr Informat & Commun Technol CITIC, Res Grp MODES, La Coruna 15071, Spain
[2] Technol Inst Ind Math ITMATI, La Coruna, Spain
[3] Sapienza Univ Rome, Dept Social Sci & Econ, Ple Aldo Moro 5, Rome, Italy
关键词
Multivariate time series; Clustering; Quantile cross-spectral density; Fuzzy C-means; Fuzzy C-medoids; Principal component analysis;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A novel procedure to perform fuzzy clustering of multivariate time series generated from different dependence models is proposed. Different amounts of dissimilarity between the generating models or changes on the dynamic behaviours over time are some arguments justifying a fuzzy approach, where each series is associated to all the clusters with specific membership levels. Our procedure considers quantile-based cross-spectral features and consists of three stages: (i) each element is characterized by a vector of proper estimates of the quantile cross-spectral densities, (ii) principal component analysis is carried out to capture the main differences reducing the effects of the noise, and (iii) the squared Euclidean distance between the first retained principal components is used to perform clustering through the standard fuzzy C-means and fuzzy C-medoids algorithms. The performance of the proposed approach is evaluated in a broad simulation study where several types of generating processes are considered, including linear, nonlinear and dynamic conditional correlation models. Assessment is done in two different ways: by directly measuring the quality of the resulting fuzzy partition and by taking into account the ability of the technique to determine the overlapping nature of series located equidistant from well-defined clusters. The procedure is compared with the few alternatives suggested in the literature, substantially outperforming all of them whatever the underlying process and the evaluation scheme. Two specific applications involving air quality and financial databases illustrate the usefulness of our approach. (c) 2022 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页码:115 / 154
页数:40
相关论文
共 95 条
  • [1] Time-series clustering - A decade review
    Aghabozorgi, Saeed
    Shirkhorshidi, Ali Seyed
    Teh Ying Wah
    [J]. INFORMATION SYSTEMS, 2015, 53 : 16 - 38
  • [2] Andersson M., 2008, APPL FINANC EC, V18, P139, DOI [10.1080/09603100601057854, DOI 10.1080/09603100601057854]
  • [3] OVERLAPPING CLUSTERING - A NEW METHOD FOR PRODUCT POSITIONING
    ARABIE, P
    CARROLL, JD
    DESARBO, W
    WIND, J
    [J]. JOURNAL OF MARKETING RESEARCH, 1981, 18 (03) : 310 - 317
  • [4] Awwad A., 2008, IECON 2008 - 34th Annual Conference of IEEE Industrial Electronics Society, P1287, DOI 10.1109/IECON.2008.4758140
  • [5] Bagnall A., 2018, ARXIV, DOI DOI 10.48550/ARXIV.1811.00075
  • [6] Quantile coherency: A general measure for dependence between cyclical economic variables
    Barunik, Jozef
    Kley, Tobias
    [J]. ECONOMETRICS JOURNAL, 2019, 22 (02) : 131 - +
  • [7] Clustering financial time series with variance ratio statistics
    Bastos, Joao A.
    Caiado, Jorge
    [J]. QUANTITATIVE FINANCE, 2014, 14 (12) : 2121 - 2133
  • [8] Multivariate GARCH models: A survey
    Bauwens, L
    Laurent, S
    Rombouts, JVK
    [J]. JOURNAL OF APPLIED ECONOMETRICS, 2006, 21 (01) : 79 - 109
  • [9] Ben-Hur Asa, 2003, Methods Mol Biol, V224, P159
  • [10] Validity-guided (re)clustering with applications to image segmentation
    Bensaid, AM
    Hall, LO
    Bezdek, JC
    Clarke, LP
    Silbiger, ML
    Arrington, JA
    Murtagh, RF
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1996, 4 (02) : 112 - 123