A Bayesian decision theory approach to variable selection for discrimination

被引:0
作者
T. Fearn
P. J. Brown
P. Besbeas
机构
来源
Statistics and Computing | 2002年 / 12卷
关键词
Bayes; decision theory; discriminant analysis; near infrared spectroscopy; simulated annealing; variable selection;
D O I
暂无
中图分类号
学科分类号
摘要
Motivated by examples in spectroscopy, we study variable selection for discrimination in problems with very many predictor variables. Assuming multivariate normal distributions with common variance for the predictor variables within groups, we develop a Bayesian decision theory approach that balances costs for variables against a loss due to classification errors. The approach is computationally intensive, requiring a simulation to approximate the intractable expected loss and a search, using simulated annealing, over a large space of possible subsets of variables. It is illustrated by application to a spectroscopic example with 3 groups, 100 variables, and 71 training cases, where the approach finds subsets of between 5 and 14 variables whose discriminatory power is comparable with that of linear discriminant analysis using principal components derived from the full 100 variables. We study both the evaluation of expected loss and the tuning of the simulated annealing for the example, and conclude that computational effort should be concentrated on the search.
引用
收藏
页码:253 / 260
页数:7
相关论文
共 19 条
[1]  
Brown P.J.(1999)Discrimination with many variables Journal of the American Statistical Association 94 1320-1329
[2]  
Fearn T.(1999)The choice of variables in multivariate regression: Anon-conjugate Bayesian decision theory approach Biometrika 86 635-648
[3]  
Haque M.S.(1981)Some matrix-variate distribution theory: Notational considerations and a Bayesian application Biometrika 68 265-274
[4]  
Brown P.J.(1995)Penalized discriminant analysis Annals of Statistics 23 73-102
[5]  
Fearn T.(1995)Discriminant analysis with singular covariance matrices: Methods and applications to spectroscopic data Applied Statistics 44 105-115
[6]  
Vannucci M.(1986)Convergence of an annealing algorithm Mathematical Programming 34 111-124
[7]  
Dawid A.P.(1971)Elicitation of personal probabilities and expectations Journal of the American Statistical Association 66 783-801
[8]  
Hastie T.(1994)Statistical thinking and technique for QSAR and related studies. Part II. Specific methods Journal of Chemometrics 8 1-20
[9]  
Buja A.(undefined)undefined undefined undefined undefined-undefined
[10]  
Tibshirani R.(undefined)undefined undefined undefined undefined-undefined