Selecting pre-screening items for early intervention trials of dementia - a case study

被引:45
作者
Li, L
Huang, J
Sun, S
Shen, JZ
Unverzagt, FW
Gao, SJ
Hendrie, HH
Hall, K
Hui, SL
机构
[1] Indiana Univ, Sch Med, Div Biostat, Indianapolis, IN 46202 USA
[2] Indiana Univ Purdue Univ, Dept Comp & Informat Sci, Indianapolis, IN 46202 USA
[3] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
[4] Indiana Univ, Sch Med, Dept Psychiat, Indianapolis, IN 46202 USA
[5] Regenstrief Inst Hlth Care, Indianapolis, IN 46202 USA
关键词
discrimination; classification; LASSO; logistic regression; neural network; decision tree;
D O I
10.1002/sim.1715
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Our goal was to review and extend statistical methods for discriminating between normal subjects and those with dementia or cognitive impairment. We compared six different methods to one constructed by expert opinion, in their brevity and predictive power. The methods include logistic regression and neural networks, with standard and least absolute shrinkage and selection operator (LASSO) variable selection, as well as decision trees with and without boosting. These methods were applied to the baseline data of a subgroup of subjects in a dementia study, using their screening interview items to predict their clinical diagnosis of normal or non-normal (cognitively impaired or demented). The derived models were then validated on a different subgroup of subjects in the same study who had the screening and clinical diagnosis two to five years later. Performance of different models was compared based on their sensitivity and specificity in the validation sample. Generally, the six statistical methods performed slightly to moderately better than the expert-opinion model. Neural networks generally performed better than the logistic and decision tree models. LASSO improved the performance of logistic and neural network models, but it eliminated few input variables in the neural network. The single decision tree performed at least as well as the standard logistic model, and with fewer items, making it an attractive pre-screening tool. Using the boosting option for decision trees did not substantially improve the performance. We recommend that for each situation, different methods of classification should be attempted to obtain optimal results for a given purpose. Copyright (C) 2004 John Wiley Sons, Ltd.
引用
收藏
页码:271 / 283
页数:13
相关论文
共 18 条
  • [11] Incidence of dementia and Alzheimer disease in 2 communities - Yoruba residing in Ibadan, Nigeria, and African Americans residing in Indianapolis, Indiana
    Hendrie, HC
    Ogunniyi, A
    Hall, KS
    Baiyewu, O
    Unverzagt, FW
    Gureje, O
    Gao, SJ
    Evans, RM
    Ogunseyinde, AO
    Adeyinka, AO
    Musick, B
    Hui, SL
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2001, 285 (06): : 739 - 747
  • [12] Monotone shrinkage of trees
    LeBlanc, M
    Tibshirani, R
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1998, 7 (04) : 417 - 433
  • [13] Ripley B.D., 1996, PATTERN RECOGN
  • [14] SUN X, 2000, THESIS U TORONTO
  • [15] Tibshirani R, 1997, STAT MED, V16, P385, DOI 10.1002/(SICI)1097-0258(19970228)16:4<385::AID-SIM380>3.0.CO
  • [16] 2-3
  • [18] Prevalence of cognitive impairment - Data from the Indianapolis study of health and aging
    Unverzagt, FW
    Gao, S
    Baiyewu, O
    Ogunniyi, AO
    Gureje, O
    Perkins, A
    Emsley, CL
    Dickens, J
    Evans, R
    Musick, B
    Hall, KS
    Hui, SL
    Hendrie, HC
    [J]. NEUROLOGY, 2001, 57 (09) : 1655 - 1662