VARIABLE SELECTION FOR LATENT CLASS ANALYSIS WITH APPLICATION TO LOW BACK PAIN DIAGNOSIS

被引:34
作者
Fop, Michael [1 ,2 ]
Smart, Keith M. [3 ]
Murphy, Thomas Brendan [1 ,2 ]
机构
[1] Univ Coll Dublin, Sch Math & Stat, Dublin 4, Ireland
[2] Univ Coll Dublin, Insight Res Ctr, Dublin 4, Ireland
[3] St Vincents Univ Hosp, Dublin 4, Ireland
基金
爱尔兰科学基金会;
关键词
Clinical criteria selection; clustering; latent class analysis; low back pain; mixture models; model-based clustering; variable selection; DISCRIMINANT-ANALYSIS; CLASSIFICATION; MIXTURE; SEPARATION; MECHANISMS; CRITERIA;
D O I
10.1214/17-AOAS1061
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The identification of most relevant clinical criteria related to low back pain disorders may aid the evaluation of the nature of pain suffered in a way that usefully informs patient assessment and treatment. Data concerning low back pain can be of categorical nature, in the form of a check-list in which each item denotes presence or absence of a clinical condition. Latent class analysis is a model-based clustering method for multivariate categorical responses, which can be applied to such data for a preliminary diagnosis of the type of pain. In this work, we propose a variable selection method for latent class analysis applied to the selection of the most useful variables in detecting the group structure in the data. The method is based on the comparison of two different models and allows the discarding of those variables with no group information and those variables carrying the same information as the already selected ones. We consider a swap-stepwise algorithm where at each step the models are compared through an approximation to their Bayes factor. The method is applied to the selection of the clinical criteria most useful for the clustering of patients in different classes. It is shown to perform a parsimonious variable selection and to give a clustering performance comparable to the expert-based classification of patients into three classes of pain.
引用
收藏
页码:2080 / 2110
页数:31
相关论文
共 69 条
  • [51] Scrucca L., 2016, Unsupervised Learning Algorithms, P55
  • [52] Feature selection for clustering categorical data with an embedded modelling approach
    Silvestre, Claudia
    Cardoso, Margarida G. M. S.
    Figueiredo, Mario
    [J]. EXPERT SYSTEMS, 2015, 32 (03) : 444 - 453
  • [53] Towards a mechanisms-based classification of pain in musculoskeletal physiotherapy?
    Smart, Keith
    O'Connell, Neil
    Doody, Catherine
    [J]. PHYSICAL THERAPY REVIEWS, 2008, 13 (01) : 1 - 10
  • [54] The Discriminative Validity of "Nociceptive," " Peripheral Neuropathic," and "Central Sensitization" as Mechanisms-based Classifications of Musculoskeletal Pain
    Smart, Keith M.
    Blake, Catherine
    Staines, Anthony
    Doody, Catherine
    [J]. CLINICAL JOURNAL OF PAIN, 2011, 27 (08) : 655 - 663
  • [55] Clinical indicators of 'nociceptive', 'peripheral neuropathic' and 'central' mechanisms of musculoskeletal pain. A Delphi survey of expert clinicians
    Smart, Keith M.
    Blake, Catherine
    Staines, Anthony
    Doody, Catherine
    [J]. MANUAL THERAPY, 2010, 15 (01) : 80 - 87
  • [56] Classification of patients with low back-related leg pain: a systematic review
    Stynes, Siobhan
    Konstantinou, Kika
    Dunn, Kate M.
    [J]. BMC MUSCULOSKELETAL DISORDERS, 2016, 17
  • [57] Bayesian variable selection in clustering high-dimensional data
    Tadesse, MG
    Sha, N
    Vannucci, M
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (470) : 602 - 617
  • [58] The prevalence of low back pain: A systematic review of the literature from 1966 to 1998
    Walker, BF
    [J]. JOURNAL OF SPINAL DISORDERS, 2000, 13 (03): : 205 - 217
  • [59] Variable selection for model-based high-dimensional clustering and its application to microarray data
    Wang, Sijian
    Zhu, Ji
    [J]. BIOMETRICS, 2008, 64 (02) : 440 - 448
  • [60] Bayesian variable selection for latent class analysis using a collapsed Gibbs sampler
    White, Arthur
    Wyse, Jason
    Murphy, Thomas Brendan
    [J]. STATISTICS AND COMPUTING, 2016, 26 (1-2) : 511 - 527