LIC: An R package for optimal subset selection for distributed data

被引:0
|
作者
Chang, Di [1 ]
Guo, Guangbao [1 ]
机构
[1] Shandong Univ Technol, Sch Math & Stat, Zibo, Peoples R China
关键词
R package; Optimal subset selection; Redundant information; Parallel computing; REGRESSION;
D O I
10.1016/j.softx.2024.101909
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The goal of the Length and Information Optimization Criterion (LIC) is to handle datasets containing redundant information, identify and select the most informative subsets, and ensure that a large portion of the information from the dataset is retained. The proposed R package, called LIC, is specifically designed for optimal subset selection in distributed redundant data. It achieves this by minimizing the length of the final interval estimator while maximizing the amount of information retained from the selected data subset. This functionality is highly useful across various fields such as economics, industry, and medicine. For example, in studies involving the prediction of nitrogen oxide emissions from gas turbines, self-noise of airfoils under stochastic wind conditions, and real estate valuation predictions, LIC can be used to explore the performance of random distributed block methods in parallel computing environments.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] LIC criterion for optimal subset selection in distributed interval estimation
    Guo, Guangbao
    Sun, Yue
    Qian, Guoqi
    Wang, Qian
    JOURNAL OF APPLIED STATISTICS, 2023, 50 (09) : 1900 - 1920
  • [2] TLIC: An R package for the LIC for T distribution regression analysis
    Jing, Guofu
    Guo, Guangbao
    SOFTWAREX, 2025, 30
  • [3] The COR criterion for optimal subset selection in distributed estimation
    Guo, Guangbao
    Song, Haoyue
    Zhu, Lixing
    STATISTICS AND COMPUTING, 2024, 34 (05)
  • [4] Communication-efficient estimation for distributed subset selection
    Chen, Yan
    Dong, Ruipeng
    Wen, Canhong
    STATISTICS AND COMPUTING, 2023, 33 (06)
  • [5] Wrapped: An R package for circular data
    Nadarajah, Saralees
    Zhang, Yuanyuan
    PLOS ONE, 2017, 12 (12):
  • [6] Optimal column subset selection for image classification by genetic algorithms
    Kroemer, Pavel
    Platos, Jan
    Nowakova, Jana
    Snasel, Vaclav
    ANNALS OF OPERATIONS RESEARCH, 2018, 265 (02) : 205 - 222
  • [7] good : An R package for modelling count data
    Agis, David
    Tur, Jordi
    Morina, David
    Puig, Pedro
    Fernandez-Fontelo, Amanda
    METHODS IN ECOLOGY AND EVOLUTION, 2024, 15 (12): : 2192 - 2197
  • [8] MonteCarloSEM: An R Package to Simulate Data for SEM
    Orcan, Fatih
    INTERNATIONAL JOURNAL OF ASSESSMENT TOOLS IN EDUCATION, 2021, 8 (03): : 704 - 713
  • [9] Meta-Statistics for Variable Selection: The R Package BioMark
    Wehrens, Ron
    Franceschi, Pietro
    JOURNAL OF STATISTICAL SOFTWARE, 2012, 51 (10): : 1 - 18
  • [10] Optimal Subset Selection of Stochastic Model Using Statistical Hypothesis Test
    Choi, Seon Han
    Kim, Tag Gon
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2018, 48 (04): : 557 - 564