Filter-Based Factor Selection Methods in Partial Least Squares Regression

被引:11
作者
Mehmood, Tahir [1 ]
Sadiq, Maryam [2 ,3 ]
Aslam, Muhammad [3 ]
机构
[1] NUST, SNS, Islamabad 44000, Pakistan
[2] Univ Azad Jammu & Kashmir, Dept Stat, Muzaffarabad 13100, Pakistan
[3] Riphah Int Univ, Dept Math & Stat, Islamabad 45210, Pakistan
关键词
Factor selection; filter; partial least squares; regression; VARIABLE IMPORTANCE; NUTRITIONAL-STATUS; CHILD;
D O I
10.1109/ACCESS.2019.2948782
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Factor discovery of high-dimensional data is a crucial problem and extremely challenging from a scientific viewpoint with enormous applications in research studies. In this study, the main focus is to introduce the improved subset factor selection method and hence, 9 subset selection methods for partial least squares regression (PLSR) based on filter factor subset selection approach are proposed. Existing and proposed methods are compared in terms of accuracy, sensitivity, F1 score and number of selected factors over the simulated data set. Further, these methods are practiced on a real data set of nutritional status of children obtained from Pakistan Demographic and Health Survey (PDHS) by addressing performance using a Monte Carlo algorithm. The optimal method is implemented to assess the important factors of nutritional status of children. Dispersion importance (DIMP) factor selection index for PLSR is observed to be a more efficient method regarding accuracy and number of selected factors. The recommended factors contain key information for the nutritional status of children and could be useful in related research.
引用
收藏
页码:153499 / 153508
页数:10
相关论文
共 50 条
[41]   FEATURE SELECTION/VISUALISATION OF ADNI DATA WITH ITERATIVE PARTIAL LEAST SQUARES [J].
Wasterlid, Torbjorn ;
Bai, Li .
2014 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIG DATA (CIBD), 2014, :46-53
[42]   Distance-Based Partial Least Squares Analysis [J].
Krishnan, Anjali ;
Kriegeskorte, Nikolaus ;
Abdi, Herve .
NEW PERSPECTIVES IN PARTIAL LEAST SQUARES AND RELATED METHODS, 2013, 56 :131-145
[43]   ENVELOPE-BASED SPARSE PARTIAL LEAST SQUARES [J].
Zhu, Guangyu ;
Su, Zhihua .
ANNALS OF STATISTICS, 2020, 48 (01) :161-182
[44]   Variable selection in random calibration of near-infrared instruments: ridge regression and partial least squares regression settings [J].
Gusnanto, A ;
Pawitan, Y ;
Huang, J ;
Lane, B .
JOURNAL OF CHEMOMETRICS, 2003, 17 (03) :174-185
[45]   Partial least squares based dimension reduction with gene selection for tumor classification [J].
Li, Guo-Zheng ;
Zeng, Xue-Qiang ;
Yang, Jack Y. ;
Yang, Mary Qu .
PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, 2007, :1439-+
[46]   Feature selection and partial least squares based dimension reduction for tumor classification [J].
Bu, Hua-Long ;
Li, Guo-Zheng ;
Zeng, Xue-Qiang ;
Yang, Jack Y. ;
Yang, Mary Qu .
PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, 2007, :967-+
[47]   Bankruptcy prediction using Partial Least Squares Logistic Regression [J].
Ben Jabeur, Sami .
JOURNAL OF RETAILING AND CONSUMER SERVICES, 2017, 36 :197-202
[48]   Stacked interval sparse partial least squares regression analysis [J].
Poerio, Dominic V. ;
Brown, Steven D. .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2017, 166 :49-60
[49]   Sensory profiling data studied by partial least squares regression [J].
Martens, M ;
Bredie, WLP ;
Martens, H .
FOOD QUALITY AND PREFERENCE, 2000, 11 (1-2) :147-149
[50]   Partial Least-squares Regression for Identification of Liquid Materials [J].
Li, Wei ;
Zhong, Yu ;
Zhang, Yu ;
Yu, Daoyang ;
Sun, Bai ;
Li, Minqiang ;
Liu, Jinhuai .
2010 SYMPOSIUM ON SECURITY DETECTION AND INFORMATION PROCESSING, 2010, 7 :130-134