A farewell to the sum of Akaike weights: The benefits of alternative metrics for variable importance estimations in model selection

被引:77
|
作者
Galipaud, Matthias [1 ]
Gillingham, Mark A. F. [2 ]
Dechaume-Moncharmont, Francois-Xavier [3 ]
机构
[1] Bielefeld Univ, Dept Evolutionary Biol, Bielefeld, Germany
[2] Univ Ulm, Inst Evolutionary Ecol & Conservat Genom, Ulm, Germany
[3] Univ Bourgogne Franche Comte, Ecol Evolut Team, UMR CNRS Biogeosci 6282, Dijon, France
来源
METHODS IN ECOLOGY AND EVOLUTION | 2017年 / 8卷 / 12期
关键词
Akaike information criterion; effect size; evidence ratio; model-averaging; multi-model inferences; standardised parameter estimates; variable criticality; variable ranking; MULTIMODEL INFERENCE; BEHAVIORAL ECOLOGY; PREDICTOR VARIABLES; RELATIVE IMPORTANCE; INFORMATION-THEORY; REGRESSION; GUIDE; CALL;
D O I
10.1111/2041-210X.12835
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
1. In a previous article, we advocated against using the sum of Akaike weights (SW) as a metric to distinguish between genuine and spurious variables in Information Theoretic (IT) statistical analyses. A recent article (Giam & Olden, Methods in Ecology and Evolution, 2016, 7, 388) criticises our finding and instead argues in favour of SW. It points out that (1) we performed a biased data-generation procedure and (2) we erroneously evaluated SW on its capacity to estimate the proportion of variance in the data explained by a variable. We here respond to these points. 2. Giam and Olden's first concern is unfounded. When using the data-generating code they proposed, SW remains very imprecise. To respond to their second concern, we first list the meanings taken by a variable's importance in the context of IT. Although, SW is presented as an estimate of variable relative importance in methodological textbooks (i.e. a variable's rank in importance or its relative contribution to the variance in the data), it is also used as a metric of variable absolute importance (i.e. a variable's absolute effect size or its statistical significance). We then compare SW to alternative metrics on its ability to estimate variable absolute or relative importance. 3. SW values have low repeatability across analyses. As a result, based on SW, it is hard to distinguish between variables with weak and large effects. For estimations of variable absolute importance, experimenters should prefer model-averaged parameter estimates and/or compare nested models based on evidence ratios. Sum of Akaike weights is also a poor metric of variable relative importance. We showed that correct variable ranking in importance was generally more frequent when using model-averaged standardised parameter estimates, than when using SW. 4. To avoid recurrent errors in ecology and evolution, we therefore warn against the use of SW for estimations of variable absolute and relative importance and we propose that experimenters should instead use model-averaged standardised parameter estimates for statistical inferences.
引用
收藏
页码:1668 / 1678
页数:11
相关论文
共 16 条
  • [1] AIC model selection using Akaike weights
    Wagenmakers, EJ
    Farrell, S
    PSYCHONOMIC BULLETIN & REVIEW, 2004, 11 (01) : 192 - 196
  • [2] AIC model selection using Akaike weights
    Eric-Jan Wagenmakers
    Simon Farrell
    Psychonomic Bulletin & Review, 2004, 11 : 192 - 196
  • [3] Quantifying model selection uncertainty via bootstrapping and Akaike weights
    Rigdon, Edward
    Sarstedt, Marko
    Moisescu, Ovidiu-Ioan
    INTERNATIONAL JOURNAL OF CONSUMER STUDIES, 2023, 47 (04) : 1596 - 1608
  • [4] Screening and selection for quantile regression using an alternative measure of variable importance
    Kong, Yinfei
    Li, Yujie
    Zerom, Dawit
    JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 173 : 435 - 455
  • [5] Variables Selection for Aboveground Biomass Estimations Using Satellite Data: A Comparison between Relative Importance Approach and Stepwise Akaike's Information Criterion
    Libertad Adame-Campos, Rita
    Ghilardi, Adrian
    Gao, Yan
    Paneque-Galvez, Jaime
    Mas, Jean-Francois
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2019, 8 (06)
  • [6] Industrial PLS model variable selection using moving window variable importance in projection
    Lu, Bo
    Castillo, Ivan
    Chiang, Leo
    Edgar, Thomas F.
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2014, 135 : 90 - 109
  • [7] Cloud Manufacturing Service Selection Model Based on Adaptive Variable Evaluation Metrics
    Cui, Jin
    Ren, Lei
    Zhang, Lin
    THEORY, METHODOLOGY, TOOLS AND APPLICATIONS FOR MODELING AND SIMULATION OF COMPLEX SYSTEMS, PT III, 2016, 645 : 13 - 19
  • [8] Leveraging Model Inherent Variable Importance for Stable Online Feature Selection
    Haug, Johannes
    Pawelczyk, Martin
    Broelemann, Klaus
    Kasneci, Gjergji
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1478 - 1488
  • [9] Statistical model choice including variable selection based on variable importance: A relevant way for biomarkers selection to predict meat tenderness
    M. P. Ellies-Oury
    M. Chavent
    A. Conanec
    M. Bonnet
    B. Picard
    J. Saracco
    Scientific Reports, 9
  • [10] Statistical model choice including variable selection based on variable importance: A relevant way for biomarkers selection to predict meat tenderness
    Ellies-Oury, M. P.
    Chavent, M.
    Conanec, A.
    Bonnet, M.
    Picard, B.
    Saracco, J.
    SCIENTIFIC REPORTS, 2019, 9 (1)