The R Package Ecosystem for Robust Statistics

被引:0
|
作者
Todorov, Valentin [1 ]
机构
[1] United Nations Ind Dev Org UNIDO, Vienna, Austria
来源
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS | 2024年 / 16卷 / 06期
关键词
high dimensions; multivariate; outlier; R; robust; PRINCIPAL COMPONENT ANALYSIS; PROJECTION-PURSUIT APPROACH; MULTIVARIATE LOCATION; OUTLIER DETECTION; FAST ALGORITHM; REGRESSION; ESTIMATORS; COVARIANCE; DISPERSION; SCATTER;
D O I
10.1002/wics.70007
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In the last few years, the number of R packages implementing different robust statistical methods have increased substantially. There are now numerous packages for computing robust multivariate location and scatter, robust multivariate analysis like principal components and discriminant analysis, robust linear models, and other algorithms dedicated to cope with outliers and other irregularities in the data. This abundance of package options may be overwhelming for both beginners and more experienced R users. Here we provide an overview of the most important 25 R packages for different tasks. As metrics for the importance of each package, we consider its maturity and history, the number of total and average monthly downloads from CRAN (The Comprehensive R Archive Network), and the number of reverse dependencies. Then we briefly describe what each of these package does. After that we elaborate on the several above-mentioned topics of robust statistics, presenting the methodology and the implementation in R and illustrating the application on real data examples. Particular attention is paid to the robust methods and algorithms suitable for high-dimensional data. The code for all examples is accessible on the GitHub repository .
引用
收藏
页数:30
相关论文
共 50 条
  • [1] Anomaly detection by robust statistics
    Rousseeuw, Peter J.
    Hubert, Mia
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 8 (02)
  • [2] Robust statistics for outlier detection
    Rousseeuw, Peter J.
    Hubert, Mia
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (01) : 73 - 79
  • [3] Robust Estimation of the Generalized Loggamma Model: The R Package robustloggamma
    Agostinelli, Claudio
    Marazzi, Alfio
    Yohai, Victor J.
    Randriamiharisoa, Alex
    JOURNAL OF STATISTICAL SOFTWARE, 2016, 70 (07): : 1 - 21
  • [4] geneHapR: an R package for gene haplotypic statistics and visualization
    Renliang Zhang
    Guanqing Jia
    Xianmin Diao
    BMC Bioinformatics, 24
  • [5] Robust Mediation Analysis: The R Package robmed
    Alfons, Andreas
    Ates, Nufer Y.
    Groenen, Patrick J. F.
    JOURNAL OF STATISTICAL SOFTWARE, 2022, 103 (13): : 1 - 45
  • [6] geneHapR: an R package for gene haplotypic statistics and visualization
    Zhang, Renliang
    Jia, Guanqing
    Diao, Xianmin
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [7] Robust Analysis of Sample Selection Models through the R Package ssmrob
    Zhelonkin, Mikhail
    Ronchetti, Elvezio
    JOURNAL OF STATISTICAL SOFTWARE, 2021, 99 (04): : 1 - 35
  • [8] Meta-Statistics for Variable Selection: The R Package BioMark
    Wehrens, Ron
    Franceschi, Pietro
    JOURNAL OF STATISTICAL SOFTWARE, 2012, 51 (10): : 1 - 18
  • [9] GWASBrewer: An R Package for Simulating Realistic GWAS Summary Statistics
    Morrison, Jean
    GENETIC EPIDEMIOLOGY, 2025, 49 (01)
  • [10] Evolution and prospects of the Comprehensive R Archive Network (CRAN) package ecosystem
    Mora-Cantallops, Marcal
    Sicilia, Miguel-angel
    Garcia-Barriocanal, Elena
    Sanchez-Alonso, Salvador
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2020, 32 (11)