Strategies to improve variable selection performance

被引:0
|
作者
Wang, HJ [1 ]
Parrish, A [1 ]
Smith, RK [1 ]
Vrbsky, S [1 ]
机构
[1] Univ Alabama, Dept Comp Sci, Tuscaloosa, AL 35487 USA
来源
IKE '05: PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE ENGINEERING | 2005年
关键词
variable selection; disk storage; column major order; performance;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increasingly large datasets available for data mining and machine learning task are placing a premium on algorithm performance. The datasets are increasing in both rows (i.e., records) and columns (i.e., variables). One critical item that impacts the performance of these algorithms is the approach taken for storing and processing the data elements. This paper examines the performance tradeoffs between row major order and column major order in the context of heuristic variable selection for the case when variable selection is frequently used in some applications. The column major order approach is empirically shown to provide better runtime performance than the row major order.
引用
收藏
页码:209 / 214
页数:6
相关论文
共 50 条
  • [21] PROVIDING AN APPLIED MODEL FOR ALIGNING HUMAN RESOURCE STRATEGIES WITH THE CORPORATE STRATEGIES OF CUSTOMS IN IRAN TO IMPROVE THE PERFORMANCE
    Rezaeian, Mohammad Reza
    Salajegheh, Sanjar
    Sayadi, Saeed
    REVISTA INCLUSIONES, 2019, 6 : 12 - 29
  • [22] Simultaneous variable selection
    Turlach, BA
    Venables, WN
    Wright, SJ
    TECHNOMETRICS, 2005, 47 (03) : 349 - 363
  • [23] Performance of variable selection methods in regression using variations of the Bayesian information criterion
    Burr, Tom
    Fry, Herb
    McVey, Brian
    Sander, Eric
    Cavanaugh, Joseph
    Neath, Andrew
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2008, 37 (03) : 507 - 520
  • [24] Variable Selection for Credit Risk Scoring on Loan Performance Using Regression Analysis
    Calibo, Dawn Iris
    Ballera, Melvin A.
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2019), 2019, : 746 - 750
  • [25] Performance of several variable-selection methods applied to real ecological data
    Murtaugh, Paul A.
    ECOLOGY LETTERS, 2009, 12 (10) : 1061 - 1068
  • [26] Regularization and variable selection in Heckman selection model
    Emmanuel O. Ogundimu
    Statistical Papers, 2022, 63 : 421 - 439
  • [27] Regularization and variable selection in Heckman selection model
    Ogundimu, Emmanuel O.
    STATISTICAL PAPERS, 2022, 63 (02) : 421 - 439
  • [28] Comparison of the variable importance in projection (VIP) and of the selectivity ratio (SR) methods for variable selection and interpretation
    Farres, Mireia
    Platikanov, Stefan
    Tsakovski, Stefan
    Tauler, Roma
    JOURNAL OF CHEMOMETRICS, 2015, 29 (10) : 528 - 536
  • [29] Boosting the Performance of Genetic Algorithms for Variable Selection in Partial Least Squares Spectral Calibrations
    Lavine, Barry K.
    White, Collin G.
    APPLIED SPECTROSCOPY, 2017, 71 (09) : 2092 - 2101
  • [30] A new criterion for variable selection
    Philips, R
    Guttman, I
    STATISTICS & PROBABILITY LETTERS, 1998, 38 (01) : 11 - 19