Multiscale Clustering for Functional Data

被引:3
|
作者
Lim, Yaeji [1 ]
Oh, Hee-Seok [2 ]
Cheung, Ying Kuen [3 ]
机构
[1] Chung Ang Univ, Dept Appl Stat, Seoul 48513, South Korea
[2] Seoul Natl Univ, Dept Stat, 1 Gwanak Ro, Seoul 08826, South Korea
[3] Columbia Univ, Dept Biostat, New York, NY 10032 USA
基金
新加坡国家研究基金会; 美国国家卫生研究院;
关键词
Empirical mode decomposition; Functional data; High-dimensional data; Multiresolution analysis; Wavelet transform; REGRESSION;
D O I
10.1007/s00357-019-09313-9
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In an era of massive and complex data, clustering is one of the most important procedures for understanding and analyzing unstructured multivariate data. Classical methods such as K-means and hierarchical clustering, however, are not efficient in grouping data that are high dimensional and have inherent multiscale structures. This paper presents new clustering procedures that can adapt to multiscale characteristics and high dimensionality of data. The proposed methods are based on a novel combination of multiresolution analysis and functional data analysis. As the core of the methodology, a clustering approach using the concept of multiresolution analysis may reflect both the global trend and local activities of data, and functional data analysis handles the high-dimensional data efficiently. Practical algorithms to implement the proposed methods are further discussed. The empirical performance of the proposed methods is evaluated through numerical studies including a simulation study and real data analysis, which demonstrates promising results of the proposed clustering.
引用
收藏
页码:368 / 391
页数:24
相关论文
共 50 条