Feature selection based on an improved cat swarm optimization algorithm for big data classification

被引:0
作者
Kuan-Cheng Lin
Kai-Yuan Zhang
Yi-Hung Huang
Jason C. Hung
Neil Yen
机构
[1] National Chung Hsing University,Department of Management Information Systems
[2] National Taichung University of Education,Department of Mathematics Education
[3] Overseas Chinese University,Department of Information Technology
[4] The University of Aizu,School of Computer Science and Engineering
来源
The Journal of Supercomputing | 2016年 / 72卷
关键词
Cat swarm optimization; Feature selection; Support vector machine; Big data classification;
D O I
暂无
中图分类号
学科分类号
摘要
Feature selection, which is a type of optimization problem, is generally achieved by combining an optimization algorithm with a classifier. Genetic algorithms and particle swarm optimization (PSO) are two commonly used optimal algorithms. Recently, cat swarm optimization (CSO) has been proposed and demonstrated to outperform PSO. However, CSO is limited by long computation times. In this paper, we modify CSO to present an improved algorithm, ICSO. We then apply the ICSO algorithm to select features in a text classification experiment for big data. Results show that the proposed ICSO outperforms traditional CSO. For big data classification, the results show that using term frequency-inverse document frequency (TF-IDF) with ICSO for feature selection is more accurate than using TF-IDF alone.
引用
收藏
页码:3210 / 3221
页数:11
相关论文
共 14 条
[1]  
Chu SC(2007)Computational intelligence based on the behavior of cats Int J Innov Comput Inf Control 3 163-173
[2]  
Tsai PW(2013)A novel cat swarm optimization algorithm for unconstrained optimization problems Inf Technol Comput Sci 5 32-41
[3]  
Orouskhani M(1995)Support-vector networks Mach Learn 20 273-297
[4]  
Orouskhani Y(1997)On comparing classifiers: pitfalls to avoid and a recommended approach Data Min Knowl Discov 1 317-328
[5]  
Mansouri M(1995)Support-vector networks Mach Learn 20 273-297
[6]  
Teshnehlab M(2012)Adaptive SVM-based classification systems based on the improved endocrine-based PSO algorithm Lect Notes Comput Sci 7669 543-552
[7]  
Cortes C(undefined)undefined undefined undefined undefined-undefined
[8]  
Vapnik V(undefined)undefined undefined undefined undefined-undefined
[9]  
Salzberg SL(undefined)undefined undefined undefined undefined-undefined
[10]  
Cortes C(undefined)undefined undefined undefined undefined-undefined