Filter-Based Factor Selection Methods in Partial Least Squares Regression

被引:11
作者
Mehmood, Tahir [1 ]
Sadiq, Maryam [2 ,3 ]
Aslam, Muhammad [3 ]
机构
[1] NUST, SNS, Islamabad 44000, Pakistan
[2] Univ Azad Jammu & Kashmir, Dept Stat, Muzaffarabad 13100, Pakistan
[3] Riphah Int Univ, Dept Math & Stat, Islamabad 45210, Pakistan
关键词
Factor selection; filter; partial least squares; regression; VARIABLE IMPORTANCE; NUTRITIONAL-STATUS; CHILD;
D O I
10.1109/ACCESS.2019.2948782
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Factor discovery of high-dimensional data is a crucial problem and extremely challenging from a scientific viewpoint with enormous applications in research studies. In this study, the main focus is to introduce the improved subset factor selection method and hence, 9 subset selection methods for partial least squares regression (PLSR) based on filter factor subset selection approach are proposed. Existing and proposed methods are compared in terms of accuracy, sensitivity, F1 score and number of selected factors over the simulated data set. Further, these methods are practiced on a real data set of nutritional status of children obtained from Pakistan Demographic and Health Survey (PDHS) by addressing performance using a Monte Carlo algorithm. The optimal method is implemented to assess the important factors of nutritional status of children. Dispersion importance (DIMP) factor selection index for PLSR is observed to be a more efficient method regarding accuracy and number of selected factors. The recommended factors contain key information for the nutritional status of children and could be useful in related research.
引用
收藏
页码:153499 / 153508
页数:10
相关论文
共 50 条
[31]   Tensor Envelope Partial Least-Squares Regression [J].
Zhang, Xin ;
Li, Lexin .
TECHNOMETRICS, 2017, 59 (04) :426-436
[32]   Some theoretical aspects of partial least squares regression [J].
Helland, IS .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2001, 58 (02) :97-107
[33]   Partial least squares regression and projection on latent structure regression (PLS Regression) [J].
Abdi, Herve .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (01) :97-106
[34]   STABILITY OF TWO DIRECT METHODS FOR BIDIAGONALIZATION AND PARTIAL LEAST SQUARES [J].
Bjorck, Ake .
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2014, 35 (01) :279-291
[35]   Sparse matrix transform based weight updating in partial least squares regression [J].
Jiangtao Peng .
Journal of Mathematical Chemistry, 2014, 52 :2197-2209
[36]   Sparse matrix transform based weight updating in partial least squares regression [J].
Peng, Jiangtao .
JOURNAL OF MATHEMATICAL CHEMISTRY, 2014, 52 (08) :2197-2209
[37]   Use of the bootstrap and permutation methods for a more robust variable importance in the projection metric for partial least squares regression [J].
Afanador, N. L. ;
Tran, T. N. ;
Buydens, L. M. C. .
ANALYTICA CHIMICA ACTA, 2013, 768 :49-56
[38]   Sparse partial least squares regression for on-line variable selection with multivariate data streams [J].
McWilliams B. ;
Montana G. .
Statistical Analysis and Data Mining, 2010, 3 (03) :170-193
[39]   Toward Power Analysis for Partial Least Squares-Based Methods [J].
Andreella, Angela ;
Finos, Livio ;
Scarpa, Bruno ;
Stocchero, Matteo .
BIOMETRICAL JOURNAL, 2025, 67 (02)
[40]   Selection of variables based on most stable normalised partial least squares regression coefficients in an ensemble Monte Carlo procedure [J].
Esquerre, Carlos ;
Gowen, Aoife A. ;
Downey, Gerard ;
O'Donnell, Calm P. .
JOURNAL OF NEAR INFRARED SPECTROSCOPY, 2011, 19 (06) :443-450