共 50 条
Theory-Guided Exploration With Structural Equation Model Forests
被引:56
作者:
Brandmaier, Andreas M.
[1
,2
]
Prindle, John J.
[1
,5
]
McArdle, John J.
[3
]
Lindenberger, Ulman
[1
,4
]
机构:
[1] Max Planck Inst Human Dev, Ctr Lifespan Psychol, Lentzeallee 94, D-14195 Berlin, Germany
[2] Max Planck UCL Ctr Computat Psychiat & Ageing Res, Berlin, Germany
[3] Univ Southern Calif, Dept Psychol, Los Angeles, CA USA
[4] European Univ Inst, Fiesole, Italy
[5] Univ Southern Calif, Sch Social Work, Los Angeles, CA USA
关键词:
SEM forest;
model-based tree;
recursive partitioning;
variable importance;
case proximity;
VARIABLE IMPORTANCE;
EPISODIC MEMORY;
CLASSIFICATION;
VALIDATION;
SELECTION;
AGE;
STRESS;
FUTURE;
SAMPLE;
ADULTS;
D O I:
10.1037/met0000090
中图分类号:
B84 [心理学];
学科分类号:
04 ;
0402 ;
摘要:
Structural equation model (SEM) trees, a combination of SEMs and decision trees, have been proposed as a data-analytic tool for theory-guided exploration of empirical data. With respect to a hypothesized model of multivariate outcomes, such trees recursively find subgroups with similar patterns of observed data. SEM trees allow for the automatic selection of variables that predict differences across individuals in specific theoretical models, for instance, differences in latent factor profiles or developmental trajectories. However, SEM trees are unstable when small variations in the data can result in different trees. As a remedy, SEM forests, which are ensembles of SEM trees based on resamplings of the original dataset, provide increased stability. Because large forests are less suitable for visual inspection and interpretation, aggregate measures provide researchers with hints on how to improve their models: (a) variable importance is based on random permutations of the out-of-bag (OOB) samples of the individual trees and quantifies, for each variable, the average reduction of uncertainty about the model-predicted distribution; and (b) case proximity enables researchers to perform clustering and outlier detection. We provide an overview of SEM forests and illustrate their utility in the context of cross-sectional factor models of intelligence and episodic memory. We discuss benefits and limitations, and provide advice on how and when to use SEM trees and forests in future research.
引用
收藏
页码:566 / 582
页数:17
相关论文
共 50 条