Filter-Based Factor Selection Methods in Partial Least Squares Regression

被引:9
作者
Mehmood, Tahir [1 ]
Sadiq, Maryam [2 ,3 ]
Aslam, Muhammad [3 ]
机构
[1] NUST, SNS, Islamabad 44000, Pakistan
[2] Univ Azad Jammu & Kashmir, Dept Stat, Muzaffarabad 13100, Pakistan
[3] Riphah Int Univ, Dept Math & Stat, Islamabad 45210, Pakistan
关键词
Factor selection; filter; partial least squares; regression; VARIABLE IMPORTANCE; NUTRITIONAL-STATUS; CHILD;
D O I
10.1109/ACCESS.2019.2948782
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Factor discovery of high-dimensional data is a crucial problem and extremely challenging from a scientific viewpoint with enormous applications in research studies. In this study, the main focus is to introduce the improved subset factor selection method and hence, 9 subset selection methods for partial least squares regression (PLSR) based on filter factor subset selection approach are proposed. Existing and proposed methods are compared in terms of accuracy, sensitivity, F1 score and number of selected factors over the simulated data set. Further, these methods are practiced on a real data set of nutritional status of children obtained from Pakistan Demographic and Health Survey (PDHS) by addressing performance using a Monte Carlo algorithm. The optimal method is implemented to assess the important factors of nutritional status of children. Dispersion importance (DIMP) factor selection index for PLSR is observed to be a more efficient method regarding accuracy and number of selected factors. The recommended factors contain key information for the nutritional status of children and could be useful in related research.
引用
收藏
页码:153499 / 153508
页数:10
相关论文
共 50 条
  • [1] A partition-based variable selection in partial least squares regression
    Li, Chuan-Quan
    Fang, Zhaoyu
    Xu, Qing-Song
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2020, 198
  • [2] Use of Partial Least Squares Regression for Variable Selection and Quality Prediction
    Jun, Chi-Hyuck
    Lee, Sang-Ho
    Park, Hae-Sang
    Lee, Jeong-Hwa
    CIE: 2009 INTERNATIONAL CONFERENCE ON COMPUTERS AND INDUSTRIAL ENGINEERING, VOLS 1-3, 2009, : 1302 - 1307
  • [3] Study of partial least squares and ridge regression methods
    Firinguetti, Luis
    Kibria, Golam
    Araya, Rodrigo
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2017, 46 (08) : 6631 - 6644
  • [4] Model selection for partial least squares based dimension reduction
    Li, Guo-Zheng
    Zhao, Rui-Wei
    Qu, Hai-Ni
    You, Mingyu
    PATTERN RECOGNITION LETTERS, 2012, 33 (05) : 524 - 529
  • [5] Partial least squares improvement and research principal component regression extraction methods
    Xiong, Wangping
    Du, Jianqiang
    Nie, Wang
    2014 IEEE 11TH INTL CONF ON UBIQUITOUS INTELLIGENCE AND COMPUTING AND 2014 IEEE 11TH INTL CONF ON AUTONOMIC AND TRUSTED COMPUTING AND 2014 IEEE 14TH INTL CONF ON SCALABLE COMPUTING AND COMMUNICATIONS AND ITS ASSOCIATED WORKSHOPS, 2014, : 583 - 585
  • [7] Partial least trimmed squares regression
    Xie, Zhonghao
    Feng, Xi'an
    Chen, Xiaojing
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2022, 221
  • [8] Envelopes and partial least squares regression
    Cook, R. D.
    Helland, I. S.
    Su, Z.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2013, 75 (05) : 851 - 877
  • [9] Partial least median of squares regression
    Xie, Zhonghao
    Feng, Xi'an
    Li, Limin
    Chen, Xiaojing
    JOURNAL OF CHEMOMETRICS, 2022, 36 (08)
  • [10] Computing Frechet derivatives in partial least squares regression
    Elden, Lars
    LINEAR ALGEBRA AND ITS APPLICATIONS, 2015, 473 : 316 - 338