Customer purchasing behavior prediction using machine learning classification techniques

被引:19
作者
Chaubey G. [1 ]
Gavhane P.R. [2 ]
Bisen D. [3 ]
Arjaria S.K. [1 ]
机构
[1] Department of Information Technology, Rajkiya Engineering College, Atarra, UP, Banda
[2] Dr. D Y Patil School of Engineering and Technology, Charoli (BK), via Lohegaon, Maharastra, Pune
[3] Department of Information Technology, Madhav Institute of Technology and Science, MP, Gwalior
关键词
Customer purchasing behavior prediction; E-commerce; Machine learning; Supervised classification algorithms;
D O I
10.1007/s12652-022-03837-6
中图分类号
学科分类号
摘要
Many sales and service-providing companies need to talk up related customers while launching the new products, services, and updated versions of existing products. While doing so, they need to target their existing customers. The behavior of these customers gives companies information about how to sell products. This paper presents a comparative study of different machine learning techniques that have been applied to the problem of customer purchasing behavior prediction. Experiments are done using supervised classification machine learning techniques like logistic regression, decision tree, k-nearest neighbors (KNN), Naïve Bayes, SVM, random forest, stochastic gradient descent (SGD), ANN, AdaBoost, XgBoost, and dummy classifier, as well as some hybrid algorithms that use stacking like SvmAda, RfAda, and KnnSgd. Models are evaluated using the cross-validation technique. Furthermore, the confusion matrix and ROC curve are used to calculate the accuracy of each model. Finally, the best classifier is a hybrid classifier using the ensemble stacking technique (KnnSgd), with an accuracy of 92.42%. KnnSgd gives the highest accuracy with maximum features because the error of the KNN and SGD are minimized by the KNN at the end. © 2022, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.
引用
收藏
页码:16133 / 16157
页数:24
相关论文
共 40 条
[1]  
Adebola Orogun B.O., Predicting consumer behaviour in digital market: a machine learning approach, Int J Innov Res Sci Eng Technol, 8, 8, pp. 8391-8402, (2019)
[2]  
Adeniyi D., Wei Z., Yongquan Y., Automated web usage data mining and recommendation system using k-nearest neighbor (KNN) classification method, Appl Comput Inform, 12, 1, pp. 90-108, (2016)
[3]  
Agatonovic-Kustrin S., Beresford R., Basic concepts of artificial neural network (ANN) modeling and its application in pharmaceutical research, J Pharm Biomed Anal, 22, 5, pp. 717-727, (2000)
[4]  
Ali J., Khan R., Ahmad N., Maqsood I., Random forests and decision trees, Int J Comp Sci, 9, 5, (2012)
[5]  
Alloghani M., Al-Jumeily D., Baker T., Hussain A., Mustafina J., Aljaaf A.J., Applications of machine learning techniques for software engineering learning and early prediction of students’ performance, . in Communications in Computer and Information Science, pp. 246-258, (2018)
[6]  
Amin A., Shah B., Khattak A.M., Baker T., Ur Rahman Durani H., Anwar S., Just-in-time customer churn prediction: Eith and without data transformation, 2018 IEEE Congress on Evolutionary Computation (CEC). IEEE, (2018)
[7]  
Bala R., Kumar D., Classification using ANN: a review, Int J Comput Intell Res, 13, 7, pp. 1811-1820, (2017)
[8]  
Bottou L., Large-scale machine learning with stochastic gradient descent, Proceedings of COMPSTAT’2010, pp. 177-186, (2010)
[9]  
Breiman L., Random forests, Mach Learn, 45, 1, pp. 5-32, (2001)
[10]  
Uci Machine Learning Repository, (2014)