Clustering Spatially Correlated Functional Data With Multiple Scalar Covariates

被引:4
|
作者
Wu, Hui [1 ]
Li, Yan-Fu [1 ]
机构
[1] Tsinghua Univ, Dept Ind Engn, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Clustering; L-1-penalized estimator; mixture model; scalar covariates; spatially correlated functional data; MAXIMUM WEIGHTED LIKELIHOOD; FEATURE-SELECTION; MODEL; EM; MIXTURES; REGRESSION; CONVERGENCE;
D O I
10.1109/TNNLS.2021.3137795
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a probabilistic model for clustering spatially correlated functional data with multiple scalar covariates. The motivating application is to partition the 29 provinces of the Chinese mainland into a few groups characterized by the epidemic severity of COVID-19, while the spatial dependence and effects of risk factors are considered. It can be regarded as an extension of mixture models, which allows different subsets of covariates to influence the component weights and the component densities by modeling the parameters of the mixture as functions of the covariates. In this way, provinces with similar spatial factors are a priori more likely to be clustered together. Posterior predictive inference in this model formalizes the desired prediction. Further, the identifiability of the proposed model is analyzed, and sufficient conditions to guarantee "generic'' identifiability are provided. An L-1-penalized estimator is developed to assist variable selection and robust estimation when the number of explanatory covariates is large. An efficient expectation-minimization algorithm is presented for parameter estimation. Simulation studies and real-data examples are presented to investigate the empirical performance of the proposed method. Finally, it is worth noting that the proposed model has a wide range of practical applications, e.g., health management, environmental science, ecological studies, and so on.
引用
收藏
页码:7074 / 7088
页数:15
相关论文
共 50 条
  • [21] Bayesian correlated clustering to integrate multiple datasets
    Kirk, Paul
    Griffin, Jim E.
    Savage, Richard S.
    Ghahramani, Zoubin
    Wild, David L.
    BIOINFORMATICS, 2012, 28 (24) : 3290 - 3297
  • [22] Clustering Sequence Data with Mixture Markov Chains with Covariates Using Multiple Simplex Constrained Optimization Routine (MSiCOR)
    Das, Priyam
    Sen, Deborshee
    De, Debsurya
    Hou, Jue
    Abad, Zahra S. H.
    Kim, Nicole
    Xia, Zongqi
    Cai, Tianxi
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2024, 33 (02) : 379 - 392
  • [23] Addressing class imbalance in functional data clustering
    Higgins, Catherine
    Carey, Michelle
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2024,
  • [24] THE DIRICHLET LABELING PROCESS FOR CLUSTERING FUNCTIONAL DATA
    XuanLong Nguyen
    Gelfand, Alan E.
    STATISTICA SINICA, 2011, 21 (03) : 1249 - 1289
  • [25] Multivariate Receptor Models for Spatially Correlated Multipollutant Data
    Jun, Mikyoung
    Park, Eun Sug
    TECHNOMETRICS, 2013, 55 (03) : 309 - 320
  • [26] Data augmentation and parameter expansion for independent or spatially correlated ordinal data
    Schliep, Erin M.
    Hoeting, Jennifer A.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2015, 90 : 1 - 14
  • [27] Clustering of High-Dimensional and Correlated Data
    McLachlan, Geoffrey J.
    Ng, Shu-Kay
    Wang, K.
    DATA ANALYSIS AND CLASSIFICATION, 2010, : 3 - 11
  • [28] Clustering of extreme events created by multiple correlated maxima
    Azevedo, Davide
    Moreira Freitas, Ana Cristina
    Freitas, Jorge Milhazes
    Rodrigues, Fagner B.
    PHYSICA D-NONLINEAR PHENOMENA, 2016, 315 : 33 - 48
  • [29] Consistency of the mean and the principal components of spatially distributed functional data
    Hormann, Siegfried
    Kokoszka, Piotr
    BERNOULLI, 2013, 19 (5A) : 1535 - 1558
  • [30] Nonparametric Clustering of Functional Data
    Wang, Haiyan
    Neill, James
    Miller, Forrest
    STATISTICS AND ITS INTERFACE, 2008, 1 (01) : 47 - 62