Explainable Artificial Intelligence Solution for Online Retail

被引:2
作者
Javaid, Kumail [1 ]
Siddiqa, Ayesha [2 ]
Naqvi, Syed Abbas Zilqurnain [2 ]
Ditta, Allah [3 ]
Ahsan, Muhammad [2 ]
Khan, M. A. [4 ]
Mahmood, Tariq [5 ]
Khan, Muhammad Adnan [6 ]
机构
[1] Univ South Asia, Dept Comp Sci, Lahore 54000, Pakistan
[2] Univ Engn & Technol, Dept Mechatron & Control Engn, Lahore 54000, Pakistan
[3] Univ Educ, Div Sci & Technol, Dept Informat Sci, Lahore 54000, Pakistan
[4] Riphah Int Univ, Fac Comp, Riphah Sch Comp & Innovat, Lahore Campus, Lahore 54000, Pakistan
[5] Univ Educ, Div Sci & Technol, Lahore 54000, Pakistan
[6] Gachon Univ, Dept Software, Pattern Recognit & Machine Learning Lab, Seongnam 13557, South Korea
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2022年 / 71卷 / 03期
关键词
Explainable artificial intelligence; online retail; neural network; random forest regression; BEHAVIOR;
D O I
10.32604/cmc.2022.022984
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Artificial intelligence (AI) and machine learning (ML) help in making predictions and businesses to make key decisions that are beneficial for them. In the case of the online shopping business, it's very important to find trends in the data and get knowledge of features that helps drive the success of the business. In this research, a dataset of 12,330 records of customers has been analyzedwho visited an online shoppingwebsite over a period of one year. The main objective of this research is to find features that are relevant in terms of correctly predicting the purchasing decisions made by visiting customers and build ML models which could make correct predictions on unseen data in the future. The permutation feature importance approach has been used to get the importance of features according to the output variable (Revenue). Five ML models i.e., decision tree (DT), random forest (RF), extra tree (ET) classifier, Neural networks (NN), and Logistic regression (LR) have been used to make predictions on the unseen data in the future. The performance of each model has been discussed in detail using performance measurement techniques such as accuracy score, precision, recall, F1 score, and ROC-AUC curve. RF model is the bestmodel among all five chosen based on accuracy score of 90% and F1 score of 79% followed by extra tree classifier. Hence, our study indicates that RF model can be used by online retailing businesses for predicting consumer buying behaviour. Our research also reveals the importance of page value as a key feature for capturing online purchasing trends. This may give a clue to future businesses who can focus on this specific feature and can find key factors behind page value success which in turn will help the online shopping business.
引用
收藏
页码:4425 / 4442
页数:18
相关论文
共 22 条
  • [1] Performance study of classification algorithms for consumer online shopping attitudes and behavior using data mining
    Ahmed, Rana Alaa El-Deen
    Shehab, M. Elemam.
    Morsy, Shereen
    Mekawie, Nermeen
    [J]. 2015 FIFTH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT2015), 2015, : 1344 - 1349
  • [2] Permutation importance: a corrected feature importance measure
    Altmann, Andre
    Tolosi, Laura
    Sander, Oliver
    Lengauer, Thomas
    [J]. BIOINFORMATICS, 2010, 26 (10) : 1340 - 1347
  • [3] Real-Time Prediction of Online Shoppers' Purchasing Intention Using Random Forest
    Baati, Karim
    Mohsil, Mouad
    [J]. ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2020, PT I, 2020, 583 : 43 - 51
  • [4] Customer behavior in an online ordering application: A decision scoring model
    Boyer, KK
    Hult, GTM
    [J]. DECISION SCIENCES, 2005, 36 (04) : 569 - 598
  • [5] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [6] Charbuty B., 2021, J APPL SCI TECHNOL T, V2, P20, DOI DOI 10.38094/JASTT20165
  • [7] Predicting and explaining patronage behavior toward web and traditional stores using neural networks: a comparative analysis with logistic regression
    Chiang, WYK
    Zhang, DS
    Zhou, L
    [J]. DECISION SUPPORT SYSTEMS, 2006, 41 (02) : 514 - 531
  • [8] Effendy Veronikha, 2014, 2014 2nd International Conference on Information and Communication Technology (ICoICT), P325, DOI 10.1109/ICoICT.2014.6914086
  • [9] Hadden J., 2006, International Journal of Intelligent Systems and Technologies, V1, P104
  • [10] Classification of Retail Products: From Probabilistic Ranking to Neural Networks
    Hafez, Manar Mohamed
    Fernandez Vilas, Ana
    Redondo, Rebeca P. Diaz
    Pazo, Hector Olivera
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (09):