The R Package Ecosystem for Robust Statistics

被引:0
|
作者
Todorov, Valentin [1 ]
机构
[1] United Nations Ind Dev Org UNIDO, Vienna, Austria
来源
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS | 2024年 / 16卷 / 06期
关键词
high dimensions; multivariate; outlier; R; robust; PRINCIPAL COMPONENT ANALYSIS; PROJECTION-PURSUIT APPROACH; MULTIVARIATE LOCATION; OUTLIER DETECTION; FAST ALGORITHM; REGRESSION; ESTIMATORS; COVARIANCE; DISPERSION; SCATTER;
D O I
10.1002/wics.70007
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In the last few years, the number of R packages implementing different robust statistical methods have increased substantially. There are now numerous packages for computing robust multivariate location and scatter, robust multivariate analysis like principal components and discriminant analysis, robust linear models, and other algorithms dedicated to cope with outliers and other irregularities in the data. This abundance of package options may be overwhelming for both beginners and more experienced R users. Here we provide an overview of the most important 25 R packages for different tasks. As metrics for the importance of each package, we consider its maturity and history, the number of total and average monthly downloads from CRAN (The Comprehensive R Archive Network), and the number of reverse dependencies. Then we briefly describe what each of these package does. After that we elaborate on the several above-mentioned topics of robust statistics, presenting the methodology and the implementation in R and illustrating the application on real data examples. Particular attention is paid to the robust methods and algorithms suitable for high-dimensional data. The code for all examples is accessible on the GitHub repository .
引用
收藏
页数:30
相关论文
共 50 条
  • [41] CEoptim: Cross-Entropy R Package for Optimization
    Benham, Tim
    Duan, Qibin
    Kroese, Dirk P.
    Liquet, Benoit
    JOURNAL OF STATISTICAL SOFTWARE, 2017, 76 (08): : 1 - 29
  • [42] Soft Methods in Robust Statistics
    Filzmoser, Peter
    COMBINING SOFT COMPUTING AND STATISTICAL METHODS IN DATA ANALYSIS, 2010, 77 : 273 - 280
  • [43] GENERALIZED RESILIENCE AND ROBUST STATISTICS
    Zhu, Banghua
    Jiao, Jiantao
    Steinhardt, Jacob
    ANNALS OF STATISTICS, 2022, 50 (04): : 2256 - 2283
  • [44] An R package for EPANET simulations
    Arandia, Ernesto
    Eck, Bradley J.
    ENVIRONMENTAL MODELLING & SOFTWARE, 2018, 107 : 59 - 63
  • [45] Imputation with the R Package VIM
    Kowarik, Alexander
    Templ, Matthias
    JOURNAL OF STATISTICAL SOFTWARE, 2016, 74 (07):
  • [46] mdatools - R package for chemometrics
    Kucheryayskiy, Sergey
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2020, 198
  • [47] The R Package metaLik for Likelihood Inference in Meta-Analysis
    Guolo, Annamaria
    Varin, Cristiano
    JOURNAL OF STATISTICAL SOFTWARE, 2012, 50 (07): : 1 - 14
  • [48] The "plantspec' r package: A tool for spectral analysis of plant stoichiometry
    Griffith, Daniel M.
    Anderson, T. Michael
    METHODS IN ECOLOGY AND EVOLUTION, 2019, 10 (05): : 673 - 679
  • [49] Multiple Response Variables Regression Models in R: The mcglm Package
    Bonat, Wagner Hugo
    JOURNAL OF STATISTICAL SOFTWARE, 2018, 84 (04): : 1 - 30
  • [50] npregfast: An R Package for Nonparametric Estimation and Inference in Life Sciences
    Sestelo, Marta
    Meira-Machado, Luis
    Villanueva, Nora M.
    Roca-Pardinas, Javier
    JOURNAL OF STATISTICAL SOFTWARE, 2017, 82 (12): : 1 - 27