Domain-Specific Analysis of Mobile App Reviews Using Keyword-Assisted Topic Models

被引:15
作者
Tushev, Miroslav [1 ]
Ebrahimi, Fahimeh [1 ]
Mahmoud, Anas [1 ]
机构
[1] Louisiana State Univ, Div Comp Sci & Engn, Baton Rouge, LA 70803 USA
来源
2022 ACM/IEEE 44TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2022) | 2022年
基金
美国国家科学基金会;
关键词
D O I
10.1145/3510003.3510201
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Mobile application (app) reviews contain valuable information for app developers. A plethora of supervised and unsupervised techniques have been proposed in the literature to synthesize useful user feedback from app reviews. However, traditional supervised classification algorithms require extensive manual effort to label ground truth data, while unsupervised text mining techniques, such as topic models, often produce suboptimal results due to the sparsity of useful information in the reviews. To overcome these limitations, in this paper, we propose a fully automatic and unsupervised approach for extracting useful information from mobile app reviews. The proposed approach is based on keyATM, a keyword-assisted approach for generating topic models. keyATM overcomes the problem of data sparsity by using seeding keywords extracted directly from the review corpus. These keywords are then used to generate meaningful domain-specific topics. Our approach is evaluated over two datasets of mobile app reviews sampled from the domains of Investing and Food Delivery apps. The results show that our approach produces significantly more coherent topics than traditional topic modeling techniques.
引用
收藏
页码:762 / 773
页数:12
相关论文
共 87 条
  • [1] Aggarwal C.C., 2012, Mining Text Data, DOI [DOI 10.1007/978-1-4614-3223-4_6, 10.1007/978-1-4614-3223-4, 10.1007/978-1-4614-3223-4_6, DOI 10.1007/978-1-4614-3223-4]
  • [2] Alsaedi N, 2016, 2016 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2016), P515, DOI [10.1109/WI.2016.86, 10.1109/WI.2016.0087]
  • [3] Anaya LeticiaH., 2011, Comparing Latent Dirichlet Allocation and Latent Semantic Analysis as Classifiers
  • [4] Andrzejewski David, 2009, Proc Int Conf Mach Learn, V382, P25
  • [5] [Anonymous], 2009, P 18 INT C WORLD WID
  • [6] [Anonymous], 2013, CHI 13 EXTENDED ABST
  • [7] [Anonymous], 2009, Proceedings of the NAACL HLT 2009 Workshop on Semi-Supervised Learning for Natural Language Processing, DOI DOI 10.3115/1621829.1621835
  • [8] Bing L., 2011, International Conference on Information and Knowledge Management, P583
  • [9] Aggregated topic models for increasing social media topic coherence
    Blair, Stuart J.
    Bi, Yaxin
    Mulvenna, Maurice D.
    [J]. APPLIED INTELLIGENCE, 2020, 50 (01) : 138 - 156
  • [10] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022