Bacterial colony algorithm with adaptive attribute learning strategy for feature selection in classification of customers for personalized recommendation

被引:11
作者
Wang, Hong [1 ]
Niu, Ben [1 ,2 ,3 ]
Tan, Lijing [4 ]
机构
[1] Shenzhen Univ, Coll Management, Shenzhen, Peoples R China
[2] Shenzhen Univ, Inst Big Data Intelligent Management & Decis, Shenzhen, Peoples R China
[3] Shenzhen Univ, Great Bay Area Int Inst Innovat, Shenzhen, Peoples R China
[4] Shenzhen Inst Informat Technol, Dept Business Management, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; Bacterial Colony Optimization; Recommendation system; Classification; OPTIMIZATION; INFORMATION; FRAMEWORK; MAX;
D O I
10.1016/j.neucom.2020.07.142
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates a new bacterial colony-based feature selection algorithm to improve the classifi-cation accuracy of customers for personalized products recommendation. An attribute learning strategy is developed in this study to update the feature related population. Specifically, the features can be weighted according to their historic contributions to both the individual-and group-based subsets. Additionally, the frequency of appearance is also recorded for the feature candidates to improve the diversity of feature distribution and avoid the over-fitting. Based on the weight-based feature indexes and frequency of appearance records, the performance of feature subsets are enhanced by replacing the features being repeatedly appeared in a same vector. To explore the feasibility of the proposed method for the missing feature problems, the objective of the optimization is to minimize the classifica-tion error using the acceptable number of features. K-Nearest Neighbor is employed as the learning tech-nique to cooperate with the proposed feature selection method. The effectiveness of the proposed feature selection method is demonstrated by performing test on the datasets from UCI machine learning repos-itory and real-world data from Amazon customer reviews of products. Compared with other seven fea-ture selections methods, the proposed feature selection algorithm outperforms the other algorithms by achieving higher classification accuracy rate using smaller features. (c) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:747 / 755
页数:9
相关论文
共 32 条
[1]  
[Anonymous], 2011, INT J ARTIFICIAL INT
[2]  
Ben Niu, 2012, Intelligent Computing Technology. Proceedings 8th International Conference, ICIC 2012, P649, DOI 10.1007/978-3-642-31588-6_83
[3]  
Brown G, 2012, J MACH LEARN RES, V13, P27
[4]   Optimized Parameter Search for Large Datasets of the Regularization Parameter and Feature Selection for Ridge Regression [J].
Buteneers, Pieter ;
Caluwaerts, Ken ;
Dambre, Joni ;
Verstraeten, David ;
Schrauwen, Benjamin .
NEURAL PROCESSING LETTERS, 2013, 38 (03) :403-416
[5]  
Cho J., 2011, COMMUN COMPUT PHYS, V199, P238, DOI DOI 10.1007/978-3-642-23312-8_30
[6]   Feature selection techniques with class separability for multivariate time series [J].
Han, Min ;
Liu, Xiaoxin .
NEUROCOMPUTING, 2013, 110 :29-34
[7]  
Huang D., 2009, The Study of Data Mining Methods for Gene Expression Profiles
[8]   Object-oriented feature selection of high spatial resolution images using an improved Relief algorithm [J].
Jia, Jianhua ;
Yang, Ning ;
Zhang, Chao ;
Yue, Anzhi ;
Yang, Jianyu ;
Zhu, Dehai .
MATHEMATICAL AND COMPUTER MODELLING, 2013, 58 (3-4) :619-626
[9]   Random Walk-Based Solution to Triple Level Stochastic Point Location Problem [J].
Jiang, Wen ;
Huang, De-Shuang ;
Li, Shenghong .
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (06) :1438-1451
[10]   Hybrid Bacterial Foraging and Particle Swarm Optimization for detecting Bundle Branch Block [J].
Kora, Padmavathi ;
Kalva, Sri Ramakrishna .
SPRINGERPLUS, 2015, 4