A comparative study of different variable selection methods based on numerical simulation and empirical analysis

被引:9
|
作者
Hou, Dake [1 ]
Zhou, Wenli [2 ]
Zhang, Qiuxia [3 ]
Zhang, Kun [3 ]
Fang, Jiaqi [2 ]
机构
[1] Shandong Univ, Sch Math, Jinan, Peoples R China
[2] Wenzhou Univ, Sch Business, Wenzhou, Peoples R China
[3] Beijing Jiaotong Univ, Sch Econ & Management, Beijing, Peoples R China
关键词
Linear random effect model; Variable selection; Coefficient consistency; Prediction accuracy; Boxplot; Stability; LASSO;
D O I
10.7717/peerj-cs.1522
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study employs the principles of computer science and statistics to evaluate the efficacy of the linear random effect model, utilizing Lasso variable selection techniques (including Lasso, Elastic-Net, Adaptive-Lasso, and SCAD) through numerical simulation and empirical research. The analysis focuses on the model's consistency in variable selection, prediction accuracy, stability, and efficiency. This study employs a novel approach to assess the consistency of variable selection across models. Specifically, the angle between the actual coefficient vector beta and the estimated coefficient vector (beta) over cap is computed to determine the degree of consistency. Additionally, the boxplot tool of statistical analysis is utilized to visually represent the distribution of model prediction accuracy data and variable selection consistency. The comparative stability of each model is assessed based on the frequency of outliers. This study conducts comparative experiments of numerical simulation to evaluate a proposed model evaluation method against commonly used analysis methods. The results demonstrate the effectiveness and correctness of the proposed method, highlighting its ability to conveniently analyze the stability and efficiency of each fitting model.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] Empirical Bayes methods in variable selection
    Bar, Haim
    Liu, Kangyan
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2019, 11 (02)
  • [2] SIMULATION STUDY ON SOME VARIABLE SELECTION PENALIZED METHODS
    Ali, Hayder Nadhim Mohamed
    Flaih, Ahmad Naeem
    INTERNATIONAL JOURNAL OF AGRICULTURAL AND STATISTICAL SCIENCES, 2023, 19 : 1277 - 1284
  • [3] The Two Comparative Study of Methods Based on the Financial Risk Empirical Analysis
    Wang Xiuwen
    Chen Xizhen
    PROCEEDINGS OF THE 3RD (2011) INTERNATIONAL CONFERENCE ON FINANCIAL RISK AND CORPORATE FINANCE MANAGEMENT, VOLS 1 AND 2, 2011, : 796 - 799
  • [4] Server selection methods in personal metasearch: a comparative empirical study
    Thomas, Paul
    Hawking, David
    INFORMATION RETRIEVAL, 2009, 12 (05): : 581 - 604
  • [5] Server selection methods in personal metasearch: a comparative empirical study
    Paul Thomas
    David Hawking
    Information Retrieval, 2009, 12 : 581 - 604
  • [6] ADEQUACY OF NUMERICAL TAXONOMIC METHODS - A COMPARATIVE-STUDY BASED ON SIMULATION EXPERIMENTS
    HEIJERMAN, T
    ZEITSCHRIFT FUR ZOOLOGISCHE SYSTEMATIK UND EVOLUTIONSFORSCHUNG, 1992, 30 (01): : 1 - 20
  • [7] Comparative Study of Synthetic Jet Numerical Simulation Methods
    Tang, Shang Qin
    Huang, Chang Qiang
    APPLIED MECHANICS AND MATERIALS I, PTS 1-3, 2013, 275-277 : 486 - 490
  • [8] An Analysis of Different Variable Selection Methods in the Context of a Banking Institution
    Velazquez Juarez, Jaime
    Lage Ramirez, Ana Elisa
    INDUSTRIAL ENGINEERING AND OPERATIONS MANAGEMENT, XXVIII IJCIEOM, 2022, 400 : 423 - 432
  • [9] Comparative Study of Different Techniques for Numerical Reservoir Simulation
    Kasiri, N.
    Bashiri, A.
    PETROLEUM SCIENCE AND TECHNOLOGY, 2010, 28 (05) : 494 - 503
  • [10] A comparative analysis of feature selection methods for ensembles with different combination methods
    Santana, Laura Enunanuella A.
    de Oliveira, Diogo F.
    Canuto, Anne M. P.
    de Soutol, Marcilio C. P.
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 643 - 648