Optimally Weighted L2 Distance for Functional Data

被引:24
作者
Chen, Huaihou [1 ]
Reiss, Philip T. [1 ,2 ,3 ]
Tarpey, Thaddeus [4 ]
机构
[1] NYU, Sch Med, Dept Child & Adolescent Psychiat, New York, NY 10012 USA
[2] NYU, Sch Med, Dept Populat Hlth, New York, NY USA
[3] Nathan S Kline Inst Psychiat Res, Orangeburg, NY 10962 USA
[4] Wright State Univ, Dept Math & Stat, Dayton, OH 45435 USA
基金
美国国家卫生研究院;
关键词
Coefficient of variation; Functional classification; Functional clustering; Penalized splines; Weighted L-2 distance; LIKELIHOOD; REGRESSION; SELECTION; TESTS;
D O I
10.1111/biom.12161
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Many techniques of functional data analysis require choosing a measure of distance between functions, with the most common choice being L2 distance. In this article we show that using a weighted L2 distance, with a judiciously chosen weight function, can improve the performance of various statistical methods for functional data, including k-medoids clustering, nonparametric classification, and permutation testing. Assuming a quadratically penalized (e.g., spline) basis representation for the functional data, we consider three nontrivial weight functions: design density weights, inverse-variance weights, and a new weight function that minimizes the coefficient of variation of the resulting squared distance by means of an efficient iterative procedure. The benefits of weighting, in particular with the proposed weight function, are demonstrated both in simulation studies and in applications to the Berkeley growth data and a functional magnetic resonance imaging data set.
引用
收藏
页码:516 / 525
页数:10
相关论文
共 26 条
[1]  
[Anonymous], 2003, Semiparametric Regression
[2]   Likelihood ratio tests in linear mixed models with one variance component [J].
Crainiceanu, CM ;
Ruppert, D .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2004, 66 :165-185
[3]   Functional k-sample problem when data are density functions [J].
Delicado, Pedro .
COMPUTATIONAL STATISTICS, 2007, 22 (03) :391-410
[4]   Curves discrimination: a nonparametric functional approach [J].
Ferraty, F ;
Vieu, P .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2003, 44 (1-2) :161-173
[5]  
Ferraty F., 2006, SPR S STAT
[6]   The effect of pre-smoothing functional data on cluster analysis [J].
Hitchcock, David B. ;
Booth, James G. ;
Casella, George .
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2007, 77 (11-12) :1089-1101
[7]   Improved estimation of dissimilarities by presmoothing functional data [J].
Hitchcock, DB ;
Casella, G ;
Booth, JG .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (473) :211-222
[8]  
Kaufman L., 1990, FINDING GROUPS DATA, DOI DOI 10.1002/9780470316801
[9]  
Lai M-J, 2007, Spline functions on triangulations no. 110
[10]   MULTI-RESPONSE PERMUTATION PROCEDURES FOR A PRIORI CLASSIFICATIONS [J].
MIELKE, PW ;
BERRY, KJ ;
JOHNSON, ES .
COMMUNICATIONS IN STATISTICS PART A-THEORY AND METHODS, 1976, 5 (14) :1409-1424