Airbnb Price Prediction Using Machine Learning and Sentiment Analysis

被引:12
作者
Kalehbasti, Pouya Rezazadeh [1 ]
Nikolenko, Liubov [1 ]
Rezaei, Hoormazd [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
来源
MACHINE LEARNING AND KNOWLEDGE EXTRACTION (CD-MAKE 2021) | 2021年 / 12844卷
关键词
AirBNB; Rental property pricing; Machine learning; Sentiment analysis;
D O I
10.1007/978-3-030-84060-0_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pricing a property and evaluating the proposed price for a property are challenges that, respectively, owners and customers of Airbnb rentals face on a daily basis. This paper aims to create a model for predicting the price of an Airbnb listing using property specifications, owner information, and customer reviews for the listing. Owners and customers can use the resulting model to estimate the expected value of an Airbnb listing. Linear regression, tree-based models, K-means Clustering, Support Vector Regression (SVR), and neural networks are trained and tuned on a dataset of Airbnb listings from New York city, and the resulting models are compared in terms of Mean Squared Error, Mean Absolute Error, and R-2 score. Sentiment analysis is used to extract features from the customer reviews which help enhance the performance of the selected predictive models. Feature importance analysis is also used to select the most representative features for predicting the price of the listings. Experimentation shows that SVR model can achieve an R-2 score of 69% and a MSE of 0.147 (defined on ln(price)) on the test set, outperforming the other models considered in the paper. [Link to the repository: github.com/PouyaREZ/AirBnbPricePrediction].
引用
收藏
页码:173 / 184
页数:12
相关论文
共 27 条
[1]  
Aggarwal K., 2019, ARXIV190512868CSSTAT
[2]  
[Anonymous], 2014, SECONDARY TEXTBLOB S
[3]  
[Anonymous], Ultralytics-Python library operating the YOLO models
[4]  
[Anonymous], 2011, Acm T. Intel. Syst. Tec., DOI DOI 10.1145/1961189.1961199
[5]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[6]   A Client-Centric Evaluation System to Evaluate Guest's Satisfaction on Airbnb Using Machine Learning and NLP [J].
Chiny, Mohamed ;
Bencharef, Omar ;
Hadi, Moulay Youssef ;
Chihab, Younes .
APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2021, 2021
[7]   Application of deep learning to large scale riverine flow velocity estimation [J].
Forghani, Mojtaba ;
Qian, Yizhou ;
Lee, Jonghyun ;
Farthing, Matthew W. ;
Hesser, Tyler ;
Kitanidis, Peter K. ;
Darve, Eric F. .
STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2021, 35 (05) :1069-1088
[8]  
Hall MA, 1998, AUST COMP S, V20, P181
[9]  
Johansson R., An intuitive explanation of gradient boosting
[10]  
Kalehbasti Pouya Rezazadeh, 2021, GECCO '21: Proceedings of the Genetic and Evolutionary Computation Conference Companion, P1879, DOI 10.1145/3449726.3463675