NONPARAMETRIC CLUSTERING FOR LONGITUDINAL FUNCTIONAL DATA WITH THE APPLICATION TO H-NMR SPECTRA OF KIDNEY TRANSPLANT PATIENTS. LONGITUDINAL FUNCTIONAL DATA CLUSTERING

被引:2
作者
Xie, Minzhen [1 ]
Liu, Haiyan [1 ]
Houwing-Duistermaat, Jeanine [1 ]
机构
[1] Univ Leeds, Dept Stat, Leeds, W Yorkshire, England
关键词
Longitudinal Functional Data; Nonparametric Clustering; Kidney Transplant Patients; PREDICTION;
D O I
10.19272/202111401003
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Longitudinal functional data are increasingly common in the health domain. The motivated dataset for this paper comprises H-NMR spectra of kidney transplant patients [8]. Our aim is to cluster patients into different clinical outcome subgoups to reveal the success of the transplantation. The NMR spectra of each patient at each time point are functional data and the data are longitudinally collected at up to nine different time points. Existing methods are available for functional data collected at one time point, but not for longitudinal functional data collected at a grid of time points subject to missingness. We therefore first apply a method to extract the same number of functional feactures for each subject. Next we propose a novel nonparametric clustering method for mulitivariate functional data. We applied our proposed clustering method to the kidney transplant dataset both to a subset of the raw data with only two time points and the extacted functional features. It appeared that the proposed method achieves better clustering performance on the extracted functional features than on the subset of raw data. A data simulation study was performed to further evaluate the method. The design mimiced the kidney transplant dataset but with a larger sample size. Scenarios which have different levels of noise were considered. The simulation study showed the accuarcy of our proposed method.
引用
收藏
页码:15 / 28
页数:14
相关论文
共 17 条
  • [1] [Anonymous], 2002, Applied functional data analysis: methods and case studies
  • [2] KERNEL DENSITY ESTIMATION VIA DIFFUSION
    Botev, Z. I.
    Grotowski, J. F.
    Kroese, D. P.
    [J]. ANNALS OF STATISTICS, 2010, 38 (05) : 2916 - 2957
  • [3] Modeling Repeated Functional Observations
    Chen, Kehui
    Mueller, Hans-Georg
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2012, 107 (500) : 1599 - 1609
  • [4] FERRATY F, 2006, SPR S STAT, P1
  • [5] Generalized multilevel function-on-scalar regression and principal component analysis
    Goldsmith, Jeff
    Zipunnikov, Vadim
    Schrack, Jennifer
    [J]. BIOMETRICS, 2015, 71 (02) : 344 - 353
  • [6] Greven S, 2011, CONTRIB STAT, P149
  • [7] Modeling and prediction of children's growth data via functional principal component analysis
    Hu Yu
    He XuMing
    Tao Jian
    Shi NingZhong
    [J]. SCIENCE IN CHINA SERIES A-MATHEMATICS, 2009, 52 (06): : 1342 - 1350
  • [8] An efficient k-means clustering algorithm:: Analysis and implementation
    Kanungo, T
    Mount, DM
    Netanyahu, NS
    Piatko, CD
    Silverman, R
    Wu, AY
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (07) : 881 - 892
  • [9] Optimal Reaction Coordinate as a Biomarker for the Dynamics of Recovery from Kidney Transplant
    Krivov, Sergei V.
    Fenton, Hayley
    Goldsmith, Paul J.
    Prasad, Rajendra K.
    Fisher, Julie
    Paci, Emanuele
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2014, 10 (06)
  • [10] Functional Principal Component Analysis of Spatiotemporal Point Processes With Applications in Disease Surveillance
    Li, Yehua
    Guan, Yongtao
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2014, 109 (507) : 1205 - 1215