Feature selection for semi-supervised multi-target regression using genetic algorithm

被引:0
作者
Farrukh Hasan Syed
Muhammad Atif Tahir
Muhammad Rafi
Mir Danish Shahab
机构
[1] National University of Computer and Emerging Sciences,
来源
Applied Intelligence | 2021年 / 51卷
关键词
Multi-target learning; Feature selection; Regression; Semi-supervised learning; Genetic algorithm;
D O I
暂无
中图分类号
学科分类号
摘要
Multi-target regression (MTR) is an exciting area of machine learning where the challenge is to predict the values of more than one target variables which can take on continuous values. These variables may or may not be correlated. Such problems commonly occur in real life scenarios, and therefore, interest and research in this area has increased in recent times. Some examples of applications include analyzing brain-activity data gathered using multimedia sensors, stock information from continuous web data, data related to characteristics of the vegetation at a certain site, etc. For a real-world multi-target learning system, the problem can be further complicated when new issues emerge with very little data available. In such cases, a semi-supervised approach can be adopted. This paper proposes a Genetic Algorithm (GA) based semi-supervised technique on multi-target regression problems to predict new targets, using very small number of labelled examples by incorporating GA with MTR-SAFER. Experiments are carried out on real world MTR data sets. The proposed method isexplored with different variations and also compared with the state of the art MTR methods. Results have indicated a significantly better performance with the further benefit of having a reduced feature set.
引用
收藏
页码:8961 / 8984
页数:23
相关论文
共 189 条
[1]  
Altman N(2018)The curse (s) of dimensionality Nat Methods 15 399-400
[2]  
Krzywinski M(2020)Novel nonlinear hypothesis for the delta parallel robot modeling IEEE Access 8 46324-46334
[3]  
Aquino G(2020)Benchmark for filter methods for feature selection in high-dimensional classification data Computational Statistics & Data Analysis 143 106839-233
[4]  
Rubio JDJ(2015)A survey on multi-output regression Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 5 216-14
[5]  
Pacheco J(2013)Hybrid evolutionary particle swarm optimization and ant colony optimization for variable selection Series 3rd World Conference on Information Technology (WCIT-2012) 3 7-28
[6]  
Gutierrez GJ(2014)A survey on feature selection methods Computers & Electrical Engineering 40 16-542
[7]  
Ochoa G(2009)Semi-supervised learning (Chapelle, O. et al., Eds.; 2006) IEEE Transactions on Neural Networks 20 542-76048
[8]  
Balcazar R(2018)Feature selection for high dimensional data using monte carlo tree search IEEE Access 6 76036-4
[9]  
Cruz DR(2011)Multi-label problem transformation methods: a case study CLEI Electronic Journal 14 4-1309
[10]  
Garcia E(2009)Sofmls: online self-organizing fuzzy modified least-squares network Trans Fuz Sys 17 1296-407