Python']Python data odyssey: Mining user feedback from google play store

被引:0
作者
Yasin, Affan [1 ]
Fatima, Rubia [2 ]
Ghazi, Ahmad Nauman [3 ]
Wei, Ziqi [4 ]
机构
[1] Northwestern Polytech Univ, Sch Software, Xian 710072, Shaanxi, Peoples R China
[2] Emerson Univ, Dept Comp Sci, Multan, Pakistan
[3] Blekinge Inst Technol, Dept Software Engn, SE-37179 Karlskrona, Sweden
[4] Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
来源
DATA IN BRIEF | 2024年 / 54卷
关键词
Data mining; App reviews; User reviews; Crowd-source data; NLP;
D O I
10.1016/j.dib.2024.110499
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Context: The Google Play Store is widely recognized as one of the largest platforms for downloading applications, both free and paid 1 . On a daily basis, millions of users avail themselves of this marketplace, sharing their thoughts through various means such as star ratings, user comments, suggestions, and feedback. These insights, in the form of comments and feedback, constitute a valuable resource for organizations, competitors, and emerging companies seeking to expand their market presence. These comments provide insights into app deficiencies, suggestions for new features, identified issues, and potential enhancements. Unlocking the potential of this repository of suggestions holds significant value. Objective: This study sought to gather and analyze user reviews from the Google Play store for leading game apps. The primary aim was to construct a dataset for subsequent analysis utilizing requirements engineering, machine learning, and competitive assessment. Methodology: The authors employed a Python-based web scraping method to extract a comprehensive set of over 429,000+ reviews from the Google Play pages of selected apps. The scraped data encompassed reviewer names (removed due to privacy), ratings, and the textual content of the reviews. Results: The outcome was a dataset comprising the extracted user reviews, ratings, and associated metadata. A total of 429,0 0 0 + reviews were acquired through the scraping process for popular apps like Subway Surfers, Candy Crush Saga, PUBG Mobile, among others. This dataset not only serves as a valuable educational resource for instructors, aiding in the training of students in data analysis, but also offers practitioners the opportunity for in-depth examination and insights (in the past data of top apps). (c) 2024 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY license ( http://creativecommons.org/licenses/by/4.0/ )
引用
收藏
页数:7
相关论文
共 7 条
  • [1] RE-SWOT: From User Feedback to Requirements via Competitor Analysis
    Dalpiaz, Fabiano
    Parente, Micaela
    [J]. REQUIREMENTS ENGINEERING: FOUNDATION FOR SOFTWARE QUALITY (REFSQ 2019), 2019, 11412 : 55 - 70
  • [2] Valuating requirements arguments in the online user's forum for requirements decision-making: The CrowdRE-VArg framework
    Khan, Javed Ali
    Yasin, Affan
    Fatima, Rubia
    Vasan, Danish
    Khan, Arif Ali
    Khan, Abdul Wahid
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2022, 52 (12) : 2537 - 2573
  • [3] Requirements Engineering for Machine Learning: A Review and Reflection
    Pei, Zhongyi
    Liu, Lin
    Wang, Chen
    Wang, Jianmin
    [J]. 2022 IEEE 30TH INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE WORKSHOPS (REW), 2022, : 166 - 175
  • [4] Using App Reviews for Competitive Analysis: Tool Support
    Shah, Faiz Ali
    Sirts, Kairit
    Pfahl, Dietmar
    [J]. PROCEEDINGS OF THE 3RD ACM SIGSOFT INTERNATIONAL WORKSHOP ON APP MARKET ANALYTICS (WAMA '19), 2019, : 40 - 46
  • [5] Venkatakrishnan S., 2020, Appl. Mach. Learn., V1, P15, DOI DOI 10.1007/978-981-15-3357-0_2
  • [6] Wei JL, 2022, Arxiv, DOI arXiv:2206.14669
  • [7] On the utilization of non-quality assessed literature in software engineering research
    Yasin, Affan
    Fatima, Rubia
    Liu, Lin
    Ali Khan, Javed
    Ali, Raian
    Wang, Jianmin
    [J]. JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2022, 34 (07)