Distance-based Clustering of Functional Data with Derivative Principal Component Analysis

被引：0

作者：

Yu, Ping ^{[1
]}

Shi, Gongming ^{[2
]}

Wang, Chunjie ^{[3
]}

Song, Xinyuan ^{[4
]}

机构：

[1] Shanxi Normal Univ, Sch Math & Comp Sci, Taiyuan, Peoples R China

[2] Capital Univ Econ & Business, Sch Stat, Beijing, Peoples R China

[3] Changchun Univ Technol, Sch Math & Stat, Changchun, Peoples R China

[4] Chinese Univ Hong Kong, Dept Stat, Shatin, Hong Kong, Peoples R China

来源：

JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS | 2025年 / 34卷 / 01期

关键词：

Clustering; Curve derivatives; Functional principal component analysis; Identifiability; Projection;

D O I：

10.1080/10618600.2024.2366499

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

Functional data analysis (FDA) is an important modern paradigm for handling infinite-dimensional data. An important task in FDA is clustering, which identifies subgroups based on the shapes of measured curves. Considering that derivatives can provide additional useful information about the shapes of functionals, we propose a novel L2 distance between two random functions by incorporating the functions and their derivative information to determine the dissimilarity of curves under a unified scheme for dense observations. The Karhunen-Lo & egrave;ve expansion is used to approximate the curves and their derivatives. Cluster membership prediction for each curve intends to minimize the new distances between the observed and predicted curves through subspace projection among all possible clusters. We provide consistent estimators for the curves, curve derivatives, and the proposed distance. Identifiability issues of the clustering procedure are also discussed. The utility of the proposed method is illustrated via simulation studies and applications to two real datasets. The proposed method can considerably improve cluster performance compared with existing functional clustering methods. Supplementary materials for the article are available online.

引用

页码：47 / 58

页数：12

共 32 条

[1] Carroll C., 2021, FDAPACE FUNCTIONAL D
[2] Sparse and smooth functional data clustering
Centofanti, Fabio
Lepore, Antonio
Palumbo, Biagio
[J]. STATISTICAL PAPERS, 2024, 65 (02) : 795 - 825
[3] Functional clustering and identifying substructures of longitudinal data
Chiou, Jeng-Min
Li, Pai-Ling
[J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2007, 69 : 679 - 699
[4] Correlation-Based Functional Clustering via Subspace Projection
Chiou, Jeng-Min
Li, Pai-Ling
[J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2008, 103 (484) : 1684 - 1692
[5] DERIVATIVE PRINCIPAL COMPONENTS FOR REPRESENTING THE TIME DYNAMICS OF LONGITUDINAL AND FUNCTIONAL DATA
Dai, Xiongtao
Muller, Hans-Georg
Tao, Wenwen
[J]. STATISTICA SINICA, 2022, 32 (01) : 179 - 180
[6] DERIVATIVE PRINCIPAL COMPONENT ANALYSIS FOR REPRESENTING THE TIME DYNAMICS OF LONGITUDINAL AND FUNCTIONAL DATA
Dai, Xiongtao
Muller, Hans-Georg
Tao, Wenwen
[J]. STATISTICA SINICA, 2018, 28 (03) : 1583 - 1609
[7] Componentwise classification and clustering of functional data
Delaigle, A.
Hall, P.
Bathia, N.
[J]. BIOMETRIKA, 2012, 99 (02) : 299 - 313
[8] Clustering functional data into groups by using projections
Delaigle, Aurore
Hall, Peter
Tung Pham
[J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2019, 81 (02) : 271 - 304
[9] Functional nonparametric statistics in action
Ferraty, Frederic
Vieu, Philippe
[J]. ART OF SEMIPARAMETRICS, 2006, : 112 - +
[10] A proposal for robust curve clustering
García-Escudero, LA
Gordaliza, A
[J]. JOURNAL OF CLASSIFICATION, 2005, 22 (02) : 185 - 201

← 1 2 3 4 →