Analysis of the Customer Churn Prediction Project in the Hotel Industry Based on Text Mining and the Random Forest Algorithm

被引:1
|
作者
Taherkhani, Leila [1 ]
Daneshvar, Amir [2 ]
Khalili, Hossein Amoozad [3 ]
Sanaei, Mohamad Reza [4 ]
机构
[1] Islamic Azad Univ, Dept Informat Technol Management, Sci & Res Branch, Tehran, Iran
[2] Islamic Azad Univ, Dept Ind Management, Sci & Res Branch, Tehran, Iran
[3] Islamic Azad Univ, Dept Ind Engn, Sari Branch, Sari, Iran
[4] Islamic Azad Univ, Qazvin Branch, Coll Management & Econ, Dept Informat & Technol Management, Qazvin, Iran
关键词
D O I
10.1155/2023/6029121
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The ability of hotels to differentiate themselves from competitors and continue to operate profitably depends on their ability to retain their customers by building long-term and permanent customer relationships. Technological developments in recent years have made it possible for companies to predict their customers' behavior by accessing their opinions faster and preventing them from churning. Managing customer churn prediction projects has become an important issue today, especially in the hotel industry. Therefore, this research seeks to analyze projects that predict the churn of hotel customers to provide a model to help hotel managers in this field. In this research, an approach based on text mining on customers' comments in the Persian language is presented, which uses the random forest algorithm for classification that was considered the most effective method to solve this problem. In this model, to increase the efficiency of the proposed method in compare with existing works, the gravitational search algorithm was used to select the useful features, and the differential evolution algorithm was used to adjust the parameters of the classification method. The dataset of this research is the collected data from the customer database on social networks and hotels' websites, especially the hotels on Kish Island in Iran. The results of this research showed that after the implementation of the preprocessing operations, the method of adjusting the parameters and removing the unimportant features, the model's accuracy increased significantly. The precision, recall, F1, and accuracy criteria were 0.77, 0.76, 0.76, and 0.77, respectively.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Mobile Communications Customer Churn Prediction Algorithm Based on Improved GA-BP Network
    Yu R.-Y.
    Xue L.
    An X.-M.
    Xia X.-Y.
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2019, 40 (02): : 180 - 185
  • [42] Fracture zone prediction based on random forest algorithm
    He J.
    Wen X.
    Nie W.-L.
    Li L.
    Yang J.
    Shiyou Diqiu Wuli Kantan/Oil Geophysical Prospecting, 2020, 55 (01): : 161 - 166
  • [43] A novel classification algorithm for customer churn prediction based on hybrid Ensemble-Fusion model
    He, Chenggang
    Ding, Chris H. Q.
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [44] Research on the Role of Influencing Factors on Hotel Customer Satisfaction Based on BP Neural Network and Text Mining
    Wang, Jiaying
    Zhao, Zhijie
    Liu, Yang
    Guo, Yiqi
    INFORMATION, 2021, 12 (03) : 1 - 19
  • [45] CDR Analysis Based Telco Churn Prediction and Customer Behavior Insights: A Case Study
    Modani, Natwar
    Dey, Kuntal
    Gupta, Ritesh
    Godbole, Shantanu
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2013, PT II, 2013, 8181 : 256 - 269
  • [46] The comparative analysis and study of Mobile-based customer data churn prediction model
    Lei Jin-hui
    He Jian-jun
    2009 WRI WORLD CONGRESS ON SOFTWARE ENGINEERING, VOL 4, PROCEEDINGS, 2009, : 524 - 528
  • [47] Customer Churn Prediction Modelling Based on Behavioural Patterns Analysis using Deep Learning
    Agrawal, Sanket
    Das, Aditya
    Gaikwad, Amit
    Dhage, Sudhir
    2018 INTERNATIONAL CONFERENCE ON SMART COMPUTING AND ELECTRONIC ENTERPRISE (ICSCEE), 2018,
  • [48] Analysis of English Writing Text Features Based on Random Forest and Logistic Regression Classification Algorithm
    Sun, Chuan
    Luo, Bo
    MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [49] A Text Classification Algorithm for Power Equipment Defects Based on Random Forest
    Zhu, Longzhu
    Tian, Nuo
    Li, Wei
    Yang, Jing
    INTERNATIONAL JOURNAL OF RELIABILITY QUALITY AND SAFETY ENGINEERING, 2022, 29 (05)
  • [50] Collapse column prediction model based on random forest algorithm
    Hao Shuai
    Wang Huaixiu
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 1045 - 1048