Caspar: Extracting and Synthesizing User Stories of Problems from App Reviews

被引:26
作者
Guo, Hui [1 ]
Singh, Munindar P. [1 ]
机构
[1] North Carolina State Univ, Secure Comp Inst, Raleigh, NC 27695 USA
来源
2020 ACM/IEEE 42ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2020) | 2020年
关键词
D O I
10.1145/3377811.3380924
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
A user's review of an app often describes the user's interactions with the app. These interactions, which we interpret as mini stories, are prominent in reviews with negative ratings. In general, a story in an app review would contain at least two types of events: user actions and associated app behaviors. Being able to identify such stories would enable an app's developer in better maintaining and improving the app's functionality and enhancing user experience. We present Caspar, a method for extracting and synthesizing user-reported mini stories regarding app problems from reviews. By extending and applying natural language processing techniques, Caspar extracts ordered events from app reviews, classifies them as user actions or app problems, and synthesizes action-problem pairs. Our evaluation shows that Caspar is effective in finding action-problem pairs from reviews. First, Caspar classifies the events with an accuracy of 82.0% on manually labeled data. Second, relative to human evaluators, Caspar extracts event pairs with 92.9% precision and 34.2% recall. In addition, we train an inference model on the extracted action-problem pairs that automatically predicts possible app problems for different use cases. Preliminary evaluation shows that our method yields promising results. Caspar illustrates the potential for a deeper understanding of app reviews and possibly other natural language artifacts arising in software engineering.
引用
收藏
页码:628 / 640
页数:13
相关论文
共 42 条
[1]  
Beamer B, 2009, LECT NOTES COMPUT SC, V5449, P430, DOI 10.1007/978-3-642-00382-0_35
[2]  
Beatrice Santorini, 1995, PART OF SPEECH TAGGI
[3]  
Cer D, 2018, CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, P169
[4]  
Chambers Nathanael, 2008, P ACL 08, P789
[5]   AR-Miner: Mining Informative Reviews for Developers from Mobile App Marketplace [J].
Chen, Ning ;
Lin, Jialiu ;
Hoi, Steven C. H. ;
Xiao, Xiaokui ;
Zhang, Boshen .
36TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2014), 2014, :767-778
[6]  
De Marneffe Marie-Catherine, 2008, Stanford typed dependencies manual
[7]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[8]   App Review Analysis via Active Learning Reducing Supervision Effort without Compromising Classification Accuracy [J].
Dhinakaran, Venkatesh T. ;
Pulle, Raseshwari ;
Ajmeri, Nirav ;
Murukannaiah, Pradeep K. .
2018 IEEE 26TH INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE 2018), 2018, :170-181
[9]   What Would Users Change in My App? Summarizing App Reviews for Recommending Software Changes [J].
Di Sorbo, Andrea ;
Panichella, Sebastiano ;
Alexandru, Carol V. ;
Shimagaki, Junji ;
Visaggio, Corrado A. ;
Canfora, Gerardo ;
Gall, Harald C. .
FSE'16: PROCEEDINGS OF THE 2016 24TH ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON FOUNDATIONS OF SOFTWARE ENGINEERING, 2016, :499-510
[10]   INTRODUCTION TO MODERN INFORMATION-RETRIEVAL - SALTON,G, MCGILL,M [J].
DILLON, M .
INFORMATION PROCESSING & MANAGEMENT, 1983, 19 (06) :402-403