Theory-Guided Exploration With Structural Equation Model Forests

被引:56
作者
Brandmaier, Andreas M. [1 ,2 ]
Prindle, John J. [1 ,5 ]
McArdle, John J. [3 ]
Lindenberger, Ulman [1 ,4 ]
机构
[1] Max Planck Inst Human Dev, Ctr Lifespan Psychol, Lentzeallee 94, D-14195 Berlin, Germany
[2] Max Planck UCL Ctr Computat Psychiat & Ageing Res, Berlin, Germany
[3] Univ Southern Calif, Dept Psychol, Los Angeles, CA USA
[4] European Univ Inst, Fiesole, Italy
[5] Univ Southern Calif, Sch Social Work, Los Angeles, CA USA
关键词
SEM forest; model-based tree; recursive partitioning; variable importance; case proximity; VARIABLE IMPORTANCE; EPISODIC MEMORY; CLASSIFICATION; VALIDATION; SELECTION; AGE; STRESS; FUTURE; SAMPLE; ADULTS;
D O I
10.1037/met0000090
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Structural equation model (SEM) trees, a combination of SEMs and decision trees, have been proposed as a data-analytic tool for theory-guided exploration of empirical data. With respect to a hypothesized model of multivariate outcomes, such trees recursively find subgroups with similar patterns of observed data. SEM trees allow for the automatic selection of variables that predict differences across individuals in specific theoretical models, for instance, differences in latent factor profiles or developmental trajectories. However, SEM trees are unstable when small variations in the data can result in different trees. As a remedy, SEM forests, which are ensembles of SEM trees based on resamplings of the original dataset, provide increased stability. Because large forests are less suitable for visual inspection and interpretation, aggregate measures provide researchers with hints on how to improve their models: (a) variable importance is based on random permutations of the out-of-bag (OOB) samples of the individual trees and quantifies, for each variable, the average reduction of uncertainty about the model-predicted distribution; and (b) case proximity enables researchers to perform clustering and outlier detection. We provide an overview of SEM forests and illustrate their utility in the context of cross-sectional factor models of intelligence and episodic memory. We discuss benefits and limitations, and provide advice on how and when to use SEM trees and forests in future research.
引用
收藏
页码:566 / 582
页数:17
相关论文
共 50 条
  • [1] Evaluation of Structural Equation Model Forests Performance to Identify Omitted Influential Covariates
    Diaz, John Alexander Silva
    Heene, Moritz
    Brandmaier, Andreas M.
    STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2025, 32 (02) : 319 - 331
  • [2] Theory-guided Machine learning in Materials science
    Wagner, Nicholas
    Rondinelli, James M.
    FRONTIERS IN MATERIALS, 2016, 3
  • [3] Structural Equation Model Trees
    Brandmaier, Andreas M.
    von Oertzen, Timo
    McArdle, John J.
    Lindenberger, Ulman
    PSYCHOLOGICAL METHODS, 2013, 18 (01) : 71 - 86
  • [4] Resilience of US Nursing Students: A Theory-Guided Scoping Review
    Opalinski, Andra
    Martinez, Laurie A.
    JOURNAL OF NURSING EDUCATION, 2025, 64 (03) : 145 - 155
  • [5] Making Visible a Theory-Guided Advance Care Planning Intervention
    Song, Mi-Kyung
    Ward, Sandra E.
    JOURNAL OF NURSING SCHOLARSHIP, 2015, 47 (05) : 389 - 396
  • [6] Deep Learning in Sheet Metal Bending With a Novel Theory-Guided Deep Neural Network
    Liu, Shiming
    Xia, Yifan
    Shi, Zhusheng
    Yu, Hui
    Li, Zhiqiang
    Lin, Jianguo
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2021, 8 (03) : 565 - 581
  • [7] The role of stress transactional theory on the development of fibromyalgia: A structural equation model
    Gonzalez-Ramirez, Monica T.
    Garcia-Campayo, Javier
    Landero-Hernandez, Rene
    ACTAS ESPANOLAS DE PSIQUIATRIA, 2011, 39 (02): : 81 - 87
  • [8] Reduction of Biosensor False Responses and Time Delay Using Dynamic Response and Theory-Guided Machine Learning
    Zhang, Junru
    Srivatsa, Purna
    Ahmadzai, Fazel Haq
    Liu, Yang
    Song, Xuerui
    Karpatne, Anuj
    Kong, Zhenyu
    Johnson, Blake N.
    ACS SENSORS, 2023, 8 (11) : 4079 - 4090
  • [9] Efficacy of Mobile Health for Self-management of Cardiometabolic Risk Factors A Theory-Guided Systematic Review
    Delva, Sabianca
    Mendez, Kyra J. Waligora
    Cajita, Mia
    Koirala, Binu
    Shan, Rongzi
    Wongvibulsin, Shannon
    Vilarino, Valerie
    Gilmore, Danielle R.
    Han, Hae-Ra
    JOURNAL OF CARDIOVASCULAR NURSING, 2021, 36 (01) : 34 - 55
  • [10] Functional structural equation model
    Lee, Kuang-Yao
    Li, Lexin
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2022, 84 (02) : 600 - 629