Towards understanding and detecting fake reviews in app stores

被引:82
作者
Martens, Daniel [1 ]
Maalej, Walid [1 ]
机构
[1] Univ Hamburg, Dept Informat, Hamburg, Germany
基金
欧盟地平线“2020”;
关键词
Fake reviews; App reviews; User feedback; App stores;
D O I
10.1007/s10664-019-09706-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
App stores include an increasing amount of user feedback in form of app ratings and reviews. Research and recently also tool vendors have proposed analytics and data mining solutions to leverage this feedback to developers and analysts, e.g., for supporting release decisions. Research also showed that positive feedback improves apps' downloads and sales figures and thus their success. As a side effect, a market for fake, incentivized app reviews emerged with yet unclear consequences for developers, app users, and app store operators. This paper studies fake reviews, their providers, characteristics, and how well they can be automatically detected. We conducted disguised questionnaires with 43 fake review providers and studied their review policies to understand their strategies and offers. By comparing 60,000 fake reviews with 62 million reviews from the Apple App Store we found significant differences, e.g., between the corresponding apps, reviewers, rating distribution, and frequency. This inspired the development of a simple classifier to automatically detect fake reviews in app stores. On a labelled and imbalanced dataset including one-tenth of fake reviews, as reported in other domains, our classifier achieved a recall of 91% and an AUC/ROC value of 98%. We discuss our findings and their impact on software engineering, app users, and app store operators.
引用
收藏
页码:3316 / 3355
页数:40
相关论文
共 60 条
[1]  
9to5Mac, 2017, BELL FAC 1 25M FIN P
[2]  
[Anonymous], 2017, TIMES NY
[3]  
[Anonymous], 2014, LNICST, DOI DOI 10.1007/978-3-319-05452-04
[4]  
[Anonymous], 2013, CHI 13 EXTENDED ABST
[5]  
[Anonymous], 1964, Automation and Remote Control
[6]  
Apple, 2017, APP STOR REV GUID
[7]  
Bird S., 2009, Natural language processing with Python: analyzing text with the natural language toolkit
[8]  
Calefato F., 2017, ARXIV170803892
[9]  
Carreño LVG, 2013, PROCEEDINGS OF THE 35TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2013), P582, DOI 10.1109/ICSE.2013.6606604
[10]   AR-Miner: Mining Informative Reviews for Developers from Mobile App Marketplace [J].
Chen, Ning ;
Lin, Jialiu ;
Hoi, Steven C. H. ;
Xiao, Xiaokui ;
Zhang, Boshen .
36TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2014), 2014, :767-778