The R Package Ecosystem for Robust Statistics

被引:0
|
作者
Todorov, Valentin [1 ]
机构
[1] United Nations Ind Dev Org UNIDO, Vienna, Austria
来源
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS | 2024年 / 16卷 / 06期
关键词
high dimensions; multivariate; outlier; R; robust; PRINCIPAL COMPONENT ANALYSIS; PROJECTION-PURSUIT APPROACH; MULTIVARIATE LOCATION; OUTLIER DETECTION; FAST ALGORITHM; REGRESSION; ESTIMATORS; COVARIANCE; DISPERSION; SCATTER;
D O I
10.1002/wics.70007
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In the last few years, the number of R packages implementing different robust statistical methods have increased substantially. There are now numerous packages for computing robust multivariate location and scatter, robust multivariate analysis like principal components and discriminant analysis, robust linear models, and other algorithms dedicated to cope with outliers and other irregularities in the data. This abundance of package options may be overwhelming for both beginners and more experienced R users. Here we provide an overview of the most important 25 R packages for different tasks. As metrics for the importance of each package, we consider its maturity and history, the number of total and average monthly downloads from CRAN (The Comprehensive R Archive Network), and the number of reverse dependencies. Then we briefly describe what each of these package does. After that we elaborate on the several above-mentioned topics of robust statistics, presenting the methodology and the implementation in R and illustrating the application on real data examples. Particular attention is paid to the robust methods and algorithms suitable for high-dimensional data. The code for all examples is accessible on the GitHub repository .
引用
收藏
页数:30
相关论文
共 50 条
  • [21] thestats: An Open-Data R Package for Exploring Turkish Higher Education Statistics
    cavus, Mustafa
    Aydin, Olgun
    YUKSEKOGRETIM DERGISI, 2023, 13 (01): : 1 - 7
  • [22] rdrobust: An R Package for Robust Nonparametric Inference in Regression-Discontinuity Designs
    Calonico, Sebastian
    Cattaneo, Matias D.
    Titiunik, Rocio
    R JOURNAL, 2015, 7 (01): : 38 - 51
  • [23] robustlmm: An R Package for Robust Estimation of Linear Mixed-Effects Models
    Koller, Manuel
    JOURNAL OF STATISTICAL SOFTWARE, 2016, 75 (06): : 1 - 24
  • [24] RobPer: An R Package to Calculate Periodograms for Light Curves Based on Robust Regression
    Thieler, Anita M.
    Fried, Roland
    Rathjens, Jonathan
    JOURNAL OF STATISTICAL SOFTWARE, 2016, 69 (09): : 1 - 37
  • [25] A Theoretical Review of Modern Robust Statistics
    Loh, Po-Ling
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, 2025, 12 : 477 - 496
  • [26] The R Package HDSpatialScan for the Detection of Clusters of Multivariate and Functional Data using Spatial Scan Statistics
    Frevent, Camille
    Ahmed, Mohamed-Salem
    Soula, Julien
    Smida, Zaineb
    Cucala, Lionel
    Dabo-Niang, Sophie
    Genin, Michael
    R JOURNAL, 2022, 14 (03): : 95 - 120
  • [27] Use Of R in Statistics Lithuania
    Rudys, Tomas
    ROMANIAN STATISTICAL REVIEW, 2016, (02) : 119 - 124
  • [28] metaplus: An R Package for the Analysis of Robust Meta-Analysis and Meta-Regression
    Beath, Ken J.
    R JOURNAL, 2016, 8 (01): : 5 - 16
  • [29] CovSel: An R Package for Covariate Selection When Estimating Average Causal Effects
    Haggstrom, Jenny
    Persson, Emma
    Waernbaum, Ingeborg
    de Luna, Xavier
    JOURNAL OF STATISTICAL SOFTWARE, 2015, 68 (01):
  • [30] Computing the Oja Median in R: The Package OjaNP
    Fischer, Daniel
    Mosler, Karl
    Mottonen, Jyrki
    Nordhausen, Klaus
    Pokotylo, Oleksii
    Vogel, Daniel
    JOURNAL OF STATISTICAL SOFTWARE, 2020, 92 (08): : 1 - 36