Analysis of the Customer Churn Prediction Project in the Hotel Industry Based on Text Mining and the Random Forest Algorithm

被引:1
|
作者
Taherkhani, Leila [1 ]
Daneshvar, Amir [2 ]
Khalili, Hossein Amoozad [3 ]
Sanaei, Mohamad Reza [4 ]
机构
[1] Islamic Azad Univ, Dept Informat Technol Management, Sci & Res Branch, Tehran, Iran
[2] Islamic Azad Univ, Dept Ind Management, Sci & Res Branch, Tehran, Iran
[3] Islamic Azad Univ, Dept Ind Engn, Sari Branch, Sari, Iran
[4] Islamic Azad Univ, Qazvin Branch, Coll Management & Econ, Dept Informat & Technol Management, Qazvin, Iran
关键词
D O I
10.1155/2023/6029121
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The ability of hotels to differentiate themselves from competitors and continue to operate profitably depends on their ability to retain their customers by building long-term and permanent customer relationships. Technological developments in recent years have made it possible for companies to predict their customers' behavior by accessing their opinions faster and preventing them from churning. Managing customer churn prediction projects has become an important issue today, especially in the hotel industry. Therefore, this research seeks to analyze projects that predict the churn of hotel customers to provide a model to help hotel managers in this field. In this research, an approach based on text mining on customers' comments in the Persian language is presented, which uses the random forest algorithm for classification that was considered the most effective method to solve this problem. In this model, to increase the efficiency of the proposed method in compare with existing works, the gravitational search algorithm was used to select the useful features, and the differential evolution algorithm was used to adjust the parameters of the classification method. The dataset of this research is the collected data from the customer database on social networks and hotels' websites, especially the hotels on Kish Island in Iran. The results of this research showed that after the implementation of the preprocessing operations, the method of adjusting the parameters and removing the unimportant features, the model's accuracy increased significantly. The precision, recall, F1, and accuracy criteria were 0.77, 0.76, 0.76, and 0.77, respectively.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] A comparative analysis of data preparation algorithms for customer churn prediction: A case study in the telecommunication industry
    Coussement, Kristof
    Lessmann, Stefan
    Verstraeten, Geert
    DECISION SUPPORT SYSTEMS, 2017, 95 : 27 - 36
  • [32] A Swish RNN based customer churn prediction for the telecom industry with a novel feature selection strategy
    Sudharsan, R.
    Ganesh, E. N.
    CONNECTION SCIENCE, 2022, 34 (01) : 1855 - 1876
  • [33] Empirical analysis of tree-based classification models for customer churn prediction
    Usman-Hamza, Fatima E.
    Balogun, Abdullateef O.
    Nasiru, Salahdeen K.
    Capretz, Luiz Fernando
    Mojeed, Hammed A.
    Salihu, Shakirat A.
    Akintola, Abimbola G.
    Mabayoje, Modinat A.
    Awotunde, Joseph B.
    SCIENTIFIC AFRICAN, 2024, 23
  • [34] Accuracy Measure of Customer Churn Prediction in Telecom Industry using Adaboost over K Nearest Neighbor Algorithm
    Jeyaprakaash, P.
    Rekha, Sashi K.
    JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 : 1504 - 1512
  • [35] Correction to: Improve customer churn prediction through the proposed PCA‑PSO‑K means algorithm in the communication industry
    Maryam Sadeghi
    Mohammad Naderi Dehkordi
    Behrang Barekatain
    Naser Khani
    The Journal of Supercomputing, 2023, 79 : 15212 - 15212
  • [36] Correction to: Improve customer churn prediction through the proposed PCA‑PSO‑K means algorithm in the communication industry
    Maryam Sadeghi
    Mohammad Naderi Dehkordi
    Behrang Barekatain
    Naser Khani
    The Journal of Supercomputing, 2023, 79 : 10505 - 10505
  • [37] Churn prediction in digital game-based learning using data mining techniques: Logistic regression, decision tree, and random forest
    Kiguchi, Mai
    Saeed, Waddah
    Medi, Imran
    APPLIED SOFT COMPUTING, 2022, 118
  • [38] Improve customer churn prediction through the proposed PCA-PSO-K means algorithm in the communication industry
    Maryam Sadeghi
    Mohammad Naderi Dehkordi
    Behrang Barekatain
    Naser Khani
    The Journal of Supercomputing, 2023, 79 : 6871 - 6888
  • [39] Improve customer churn prediction through the proposed PCA-PSO-K means algorithm in the communication industry
    Sadeghi, Maryam
    Dehkordi, Mohammad Naderi
    Barekatain, Behrang
    Khani, Naser
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (06): : 6871 - 6888
  • [40] A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees
    De Caigny, Arno
    Coussement, Kristof
    De Bock, Koen W.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2018, 269 (02) : 760 - 772