Towards understanding and detecting fake reviews in app stores

被引:0
作者
Daniel Martens
Walid Maalej
机构
[1] University of Hamburg,Department of Informatics
来源
Empirical Software Engineering | 2019年 / 24卷
关键词
Fake reviews; App reviews; User feedback; App stores;
D O I
暂无
中图分类号
学科分类号
摘要
App stores include an increasing amount of user feedback in form of app ratings and reviews. Research and recently also tool vendors have proposed analytics and data mining solutions to leverage this feedback to developers and analysts, e.g., for supporting release decisions. Research also showed that positive feedback improves apps’ downloads and sales figures and thus their success. As a side effect, a market for fake, incentivized app reviews emerged with yet unclear consequences for developers, app users, and app store operators. This paper studies fake reviews, their providers, characteristics, and how well they can be automatically detected. We conducted disguised questionnaires with 43 fake review providers and studied their review policies to understand their strategies and offers. By comparing 60,000 fake reviews with 62 million reviews from the Apple App Store we found significant differences, e.g., between the corresponding apps, reviewers, rating distribution, and frequency. This inspired the development of a simple classifier to automatically detect fake reviews in app stores. On a labelled and imbalanced dataset including one-tenth of fake reviews, as reported in other domains, our classifier achieved a recall of 91% and an AUC/ROC value of 98%. We discuss our findings and their impact on software engineering, app users, and app store operators.
引用
收藏
页码:3316 / 3355
页数:39
相关论文
共 57 条
[1]  
Chawla NV(2004)Special issue on learning from imbalanced data sets ACM Sigkdd Explor Newsl 6 1-6
[2]  
Japkowicz N(1995)Support-vector networks Mach Learn 20 273-297
[3]  
Kotcz A(2012)Distributional footprints of deceptive product reviews ICWSM 12 98-105
[4]  
Cortes C(2017)Investigating the relationship between price, rating, and popularity in the blackberry world app store Inf Softw Technol 87 119-139
[5]  
Vapnik V(2012)Effect size estimates: current use, calculations, and interpretation J Exp Psychol Gen 141 2-3427
[6]  
Feng S(2016)Fake it till you make it: reputation, competition, and yelp review fraud Manag Sci 62 3412-331
[7]  
Xing L(2016)On the automatic classification of app reviews Requirements Engineering 21 311-54
[8]  
Gogar A(2016)Toward Data-Driven Requirements Engineering IEEE Software 33 48-1
[9]  
Choi Y(2016)A survey of app store analysis for software engineering IEEE Trans Softw Eng PP 1-2830
[10]  
Finkelstein A(2011)Scikit-learn: machine learning in Python J Mach Learn Res 12 2825-4:29