Strategies to improve variable selection performance

被引:0
|
作者
Wang, HJ [1 ]
Parrish, A [1 ]
Smith, RK [1 ]
Vrbsky, S [1 ]
机构
[1] Univ Alabama, Dept Comp Sci, Tuscaloosa, AL 35487 USA
来源
IKE '05: PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE ENGINEERING | 2005年
关键词
variable selection; disk storage; column major order; performance;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increasingly large datasets available for data mining and machine learning task are placing a premium on algorithm performance. The datasets are increasing in both rows (i.e., records) and columns (i.e., variables). One critical item that impacts the performance of these algorithms is the approach taken for storing and processing the data elements. This paper examines the performance tradeoffs between row major order and column major order in the context of heuristic variable selection for the case when variable selection is frequently used in some applications. The column major order approach is empirically shown to provide better runtime performance than the row major order.
引用
收藏
页码:209 / 214
页数:6
相关论文
共 50 条
  • [31] Benchmarking Variable Selection in QSAR
    Eklund, Martin
    Norinder, Ulf
    Boyer, Scott
    Carlsson, Lars
    MOLECULAR INFORMATICS, 2012, 31 (02) : 173 - 179
  • [32] Variable selection by genetic algorithms
    Zhang, Y
    Zhu, EY
    Zhuang, ZX
    Wang, XR
    CHEMICAL JOURNAL OF CHINESE UNIVERSITIES-CHINESE, 1999, 20 (09): : 1371 - 1375
  • [33] Variable Selection for Clustering and Classification
    Andrews, Jeffrey L.
    McNicholas, Paul D.
    JOURNAL OF CLASSIFICATION, 2014, 31 (02) : 136 - 153
  • [34] Variable Selection for Clustering and Classification
    Jeffrey L. Andrews
    Paul D. McNicholas
    Journal of Classification, 2014, 31 : 136 - 153
  • [35] Using Kendall's tau(b) correlations to improve variable selection methods in case-control studies
    OGorman, TW
    Woolson, RF
    BIOMETRICS, 1995, 51 (04) : 1451 - 1460
  • [36] Variable selection for mode regression
    Chen, Yingzhen
    Ma, Xuejun
    Zhou, Jingke
    JOURNAL OF APPLIED STATISTICS, 2018, 45 (06) : 1077 - 1084
  • [37] Variable selection in linear regression
    Lindsey, Charles
    Sheather, Simon
    STATA JOURNAL, 2010, 10 (04) : 650 - 669
  • [38] STABILIZING VARIABLE SELECTION AND REGRESSION
    Pfister, Niklas
    Williams, Evan G.
    Peters, Jonas
    Aebersold, Ruedi
    Buehlmann, Peter
    ANNALS OF APPLIED STATISTICS, 2021, 15 (03) : 1220 - 1246
  • [39] VARIABLE SELECTION IN QUANTILE REGRESSION
    Wu, Yichao
    Liu, Yufeng
    STATISTICA SINICA, 2009, 19 (02) : 801 - 817
  • [40] Variable selection by ensembles for the Cox
    Zhu, Mu
    Fan, Guangzhe
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2011, 81 (12) : 1983 - 1992