Towards understanding and detecting fake reviews in app stores

被引:82
作者
Martens, Daniel [1 ]
Maalej, Walid [1 ]
机构
[1] Univ Hamburg, Dept Informat, Hamburg, Germany
基金
欧盟地平线“2020”;
关键词
Fake reviews; App reviews; User feedback; App stores;
D O I
10.1007/s10664-019-09706-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
App stores include an increasing amount of user feedback in form of app ratings and reviews. Research and recently also tool vendors have proposed analytics and data mining solutions to leverage this feedback to developers and analysts, e.g., for supporting release decisions. Research also showed that positive feedback improves apps' downloads and sales figures and thus their success. As a side effect, a market for fake, incentivized app reviews emerged with yet unclear consequences for developers, app users, and app store operators. This paper studies fake reviews, their providers, characteristics, and how well they can be automatically detected. We conducted disguised questionnaires with 43 fake review providers and studied their review policies to understand their strategies and offers. By comparing 60,000 fake reviews with 62 million reviews from the Apple App Store we found significant differences, e.g., between the corresponding apps, reviewers, rating distribution, and frequency. This inspired the development of a simple classifier to automatically detect fake reviews in app stores. On a labelled and imbalanced dataset including one-tenth of fake reviews, as reported in other domains, our classifier achieved a recall of 91% and an AUC/ROC value of 98%. We discuss our findings and their impact on software engineering, app users, and app store operators.
引用
收藏
页码:3316 / 3355
页数:40
相关论文
共 60 条
[11]  
Cohen JW., 1988, STAT POWER ANAL BEHA, DOI 10.4324/9780203771587
[12]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[13]  
Dickerson JP, 2014, 2014 PROCEEDINGS OF THE IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2014), P620, DOI 10.1109/ASONAM.2014.6921650
[14]  
DigitalTrends, 2018, CAN YOU REALL TRUST
[15]  
Drummond C., 2003, P ICML 03 WORKSH LEA, P1
[16]  
Feng S., 2012, ICWSM, V12, P98
[17]  
Ferrara E, 2014, CORR ARXIV 1407 5225
[18]   Investigating the relationship between price, rating, and popularity in the Blackberry World App Store [J].
Finkelstein, Anthony ;
Harman, Mark ;
Jia, Yue ;
Martin, William ;
Sarro, Federica ;
Zhang, Yuanyuan .
INFORMATION AND SOFTWARE TECHNOLOGY, 2017, 87 :119-139
[19]  
Fritz CO, 2012, J EXP PSYCHOL GEN, V141, P2, DOI 10.1037/a0024338
[20]  
Fu B, 2013, 19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), P1276