Semiparametric mixture regression for asynchronous longitudinal data using multivariate functional principal component analysis

被引:0
作者
Lu, Ruihan [1 ]
Li, Yehua [2 ]
Yao, Weixin [2 ]
机构
[1] Off Biostat Food &Drug Adm, New Hampshire Ave, Sliver Spring, MD 10903 USA
[2] Univ Calif, Dept Stat, 900 Univ Anenue, Riverside, CA 92521 USA
基金
美国国家卫生研究院;
关键词
EM algorithm; functional data; functional principal component analysis; mixture regression; splines; subgroup analysis; DEHYDROEPIANDROSTERONE-SULFATE; MODELS; REPLACEMENT; DHEA; WOMEN; MOOD; RISK;
D O I
10.1093/biostatistics/kxaf008
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The transitional phase of menopause induces significant hormonal fluctuations, exerting a profound influence on the long-term well-being of women. In an extensive longitudinal investigation of women's health during mid-life and beyond, known as the Study of Women's Health Across the Nation (SWAN), hormonal biomarkers are repeatedly assessed, following an asynchronous schedule compared to other error-prone covariates, such as physical and cardiovascular measurements. We conduct a subgroup analysis of the SWAN data employing a semiparametric mixture regression model, which allows us to explore how the relationship between hormonal responses and other time-varying or time-invariant covariates varies across subgroups. To address the challenges posed by asynchronous scheduling and measurement errors, we model the time-varying covariate trajectories as functional data with reduced-rank Karhunen-Lo & eacute;ve expansions, where splines are employed to capture the mean and eigenfunctions. Treating the latent subgroup membership and the functional principal component (FPC) scores as missing data, we propose an Expectation-Maximization algorithm to effectively fit the joint model, combining the mixture regression for the hormonal response and the FPC model for the asynchronous, time-varying covariates. In addition, we explore data-driven methods to determine the optimal number of subgroups within the population. Through our comprehensive analysis of the SWAN data, we unveil a crucial subgroup structure within the aging female population, shedding light on important distinctions and patterns among women undergoing menopause.
引用
收藏
页数:18
相关论文
共 47 条
[1]   An Approach for Determining the Number of Clusters in a Model-Based Cluster Analysis [J].
Akogul, Serkan ;
Erisoglu, Murat .
ENTROPY, 2017, 19 (09)
[2]  
Al-Harithy RN, 2003, SAUDI MED J, V24, P837
[3]  
Carroll R.J., 2006, Measurement Error in Nonlinear Models
[4]   High serum dehydroepiandrosterone sulfate is associated with phenotypic acne and a reduced risk of abdominal obesity in women with polycystic ovary syndrome [J].
Chen, Mei-Jou ;
Chen, Chin-Der ;
Yang, Jehn-Hsiahn ;
Chen, Chi-Ling ;
Ho, Hong-Nerng ;
Yang, Wei-Shiung ;
Yang, Yu-Shih .
HUMAN REPRODUCTION, 2011, 26 (01) :227-234
[5]   MULTIVARIATE FUNCTIONAL PRINCIPAL COMPONENT ANALYSIS: A NORMALIZATION APPROACH [J].
Chiou, Jeng-Min ;
Chen, Yu-Ting ;
Yang, Ya-Fang .
STATISTICA SINICA, 2014, 24 (04) :1571-1596
[6]   Penalized Functional Regression [J].
Goldsmith, Jeff ;
Bobb, Jennifer ;
Crainiceanu, Ciprian M. ;
Caffo, Brian ;
Reich, Daniel .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2011, 20 (04) :830-851
[7]   A general framework for functional regression modelling [J].
Greven, Sonja ;
Scheipl, Fabian .
STATISTICAL MODELLING, 2017, 17 (1-2) :1-35
[8]   Restricted Likelihood Ratio Testing for Zero Variance Components in Linear Mixed Models [J].
Greven, Sonja ;
Crainiceanu, Ciprian M. ;
Kuechenhoff, Helmut ;
Peters, Annette .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2008, 17 (04) :870-891
[9]   Long-term DHEA replacement in primary adrenal insufficiency: A randomized, controlled trial [J].
Gurnell, Eleanor M. ;
Hunt, Penelope J. ;
Curran, Suzanne E. ;
Conway, Catherine L. ;
Pullenayegum, Eleanor M. ;
Huppert, Felicia A. ;
Compston, Juliet E. ;
Herbert, Joseph ;
Chatterjee, V. Krishna K. .
JOURNAL OF CLINICAL ENDOCRINOLOGY & METABOLISM, 2008, 93 (02) :400-409
[10]   Multivariate Functional Principal Component Analysis for Data Observed on Different (Dimensional) Domains [J].
Happ, Clara ;
Greven, Sonja .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (522) :649-659