Conditional permutation importance revisited

被引:87
作者
Debeer, Dries [1 ,2 ,3 ]
Strobl, Carolin [1 ]
机构
[1] Univ Zurich, Psychol Methods Evaluat & Stat, Binzmuehlestr 14,Box 27, CH-8050 Zurich, Switzerland
[2] Katholieke Univ Leuven, Fac Psychol & Educ Sci, Etienne Sabbelaan 51 Box 7654, B-8500 Kortrijk, Belgium
[3] Katholieke Univ Leuven, IMEC, Res Grp, ITEC, Etienne Sabbelaan 51 Box 7654, B-8500 Leuven, Belgium
关键词
Conditional permutation importance; Random forest; R; VARIABLE IMPORTANCE; PARAMORPHIC REPRESENTATION; MULTIPLE-REGRESSION; RELATIVE IMPORTANCE; PREDICTORS; SELECTION; FORESTS; TREES;
D O I
10.1186/s12859-020-03622-2
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundRandom forest based variable importance measures have become popular tools for assessing the contributions of the predictor variables in a fitted random forest. In this article we reconsider a frequently used variable importance measure, the Conditional Permutation Importance (CPI). We argue and illustrate that the CPI corresponds to a more partial quantification of variable importance and suggest several improvements in its methodology and implementation that enhance its practical value. In addition, we introduce the threshold value in the CPI algorithm as a parameter that can make the CPI more partial or more marginal.ResultsBy means of extensive simulations, where the original version of the CPI is used as the reference, we examine the impact of the proposed methodological improvements. The simulation results show how the improved CPI methodology increases the interpretability and stability of the computations. In addition, the newly proposed implementation decreases the computation times drastically and is more widely applicable. The improved CPI algorithm is made freely available as an add-on package to the open-source software R.ConclusionThe proposed methodology and implementation of the CPI is computationally faster and leads to more stable results. It has a beneficial impact on practical research by making random forest analyses more interpretable.
引用
收藏
页数:30
相关论文
共 50 条