Componentwise classification and clustering of functional data

被引:53
作者
Delaigle, A. [1 ]
Hall, P. [1 ]
Bathia, N. [2 ]
机构
[1] Univ Melbourne, Dept Math & Stat, Parkville, Vic 3010, Australia
[2] Jump Trading, Chicago, IL 60654 USA
基金
美国国家科学基金会; 澳大利亚研究理事会;
关键词
Bandwidth; Classification error rate; Kernel method; Statistical smoothing; Tightness of clusters; DISCRIMINANT-ANALYSIS;
D O I
10.1093/biomet/ass003
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The infinite dimension of functional data can challenge conventional methods for classification and clustering. A variety of techniques have been introduced to address this problem, particularly in the case of prediction, but the structural models that they involve can be too inaccurate, or too abstract, or too difficult to interpret, for practitioners. In this paper, we develop approaches to adaptively choose components, enabling classification and clustering to be reduced to finite-dimensional problems. We explore and discuss properties of these methodologies. Our techniques involve methods for estimating classifier error rate and cluster tightness, and for choosing both the number of components, and their locations, to optimize these quantities. A major attraction of this approach is that it allows identification of parts of the function domain that convey important information for classification and clustering. It also permits us to determine regions that are relevant to one of these analyses but not the other.
引用
收藏
页码:299 / 313
页数:15
相关论文
共 33 条
[1]  
[Anonymous], 1994, Kernel smoothing
[2]  
[Anonymous], 2012, R LANG ENV STAT COMP
[3]   Functional Logistic Discrimination Via Regularized Basis Expansions [J].
Araki, Yuko ;
Konishi, Sadanori ;
Kawano, Shuichi ;
Matsui, Hidetoshi .
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2009, 38 (16-17) :2944-2957
[4]  
BERLINET A., 2008, ANN ISUP, V52, P61
[5]   Functional classification in Hilbert spaces [J].
Biau, G ;
Bunea, F ;
Wegkamp, MH .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2005, 51 (06) :2163-2172
[6]   A hidden process regression model for functional data description. Application to curve discrimination [J].
Chamroukhi, Faicel ;
Same, Allou ;
Govaert, Gerard ;
Aknin, Patrice .
NEUROCOMPUTING, 2010, 73 (7-9) :1210-1221
[7]   Functional clustering and identifying substructures of longitudinal data [J].
Chiou, Jeng-Min ;
Li, Pai-Ling .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2007, 69 :679-699
[8]   Robust estimation and classification for functional data via projection-based depth notions [J].
Cuevas, Antonio ;
Febrero, Manuel ;
Fraiman, Ricardo .
COMPUTATIONAL STATISTICS, 2007, 22 (03) :481-496
[9]   Achieving near perfect classification for functional data [J].
Delaigle, Aurore ;
Hall, Peter .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2012, 74 :267-286
[10]   DEFINING PROBABILITY DENSITY FOR A DISTRIBUTION OF RANDOM FUNCTIONS [J].
Delaigle, Aurore ;
Hall, Peter .
ANNALS OF STATISTICS, 2010, 38 (02) :1171-1193