Evaluating Unsupervised Text Embeddings on Software User Feedback

被引:11
作者
Devine, Peter [1 ]
Koh, Yun Sing [1 ]
Blincoe, Kelly [1 ]
机构
[1] Univ Auckland, Auckland, New Zealand
来源
29TH IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE WORKSHOPS (REW 2021) | 2021年
关键词
REVIEWS; MODELS;
D O I
10.1109/REW53955.2021.00020
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
User feedback on software products has been shown to be useful for development and can be exceedingly abundant online. Many approaches have been developed to elicit requirements in different ways from this large volume of feedback, including the use of unsupervised clustering, underpinned by text embeddings. Methods for embedding text can vary significantly within the literature, highlighting the lack of a consensus as to which approaches are best able to cluster user feedback into requirements relevant groups. This work proposes a methodology for comparing text embeddings of user feedback using existing labelled datasets. Using 7 diverse datasets from the literature, we apply this methodology to evaluate both established text embedding techniques from the user feedback analysis literature (including topic modelling and word embeddings) as well as text embeddings from state of the art deep text embedding models. Results demonstrate that text embeddings produced by state of the art models, most notably the Universal Sentence Encoder (USE), group feedback with similar requirements relevant characteristics together better than other evaluated techniques across all seven datasets. These results can help researchers select appropriate embedding techniques when developing future unsupervised clustering approaches within user feedback analysis.
引用
收藏
页码:87 / 95
页数:9
相关论文
共 50 条
  • [31] Applying short text topic models to instant messaging communication of software developers
    Silva, Camila Costa
    Galster, Matthias
    Gilson, Fabian
    JOURNAL OF SYSTEMS AND SOFTWARE, 2024, 216
  • [32] Query-Based Configuration of Text Retrieval Solutions for Software Engineering Tasks
    Moreno, Laura
    Bavota, Gabriele
    Haiduc, Sonia
    Di Penta, Massimiliano
    Oliveto, Rocco
    Russo, Barbara
    Marcus, Andrian
    2015 10TH JOINT MEETING OF THE EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND THE ACM SIGSOFT SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE 2015) PROCEEDINGS, 2015, : 567 - 578
  • [33] User preferences based software defect detection algorithms selection using MCDM
    Peng, Yi
    Wang, Guoxun
    Wang, Honggang
    INFORMATION SCIENCES, 2012, 191 : 3 - 13
  • [34] Analyzing Customer Experience Feedback Using Text Mining: A Linguistics-Based Approach
    Ordenes, Francisco Villarroel
    Theodoulidis, Babis
    Burton, Jamie
    Gruber, Thorsten
    Zaki, Mohamed
    JOURNAL OF SERVICE RESEARCH, 2014, 17 (03) : 278 - 295
  • [35] Enhancing Feedback for Clinical Use: Creating and Evaluating Profiles of Clients Seeking Counseling
    Nordberg, Samuel S.
    Castonguay, Louis G.
    McAleavey, Andrew A.
    Locke, Benjamin D.
    Hayes, Jeffrey A.
    JOURNAL OF COUNSELING PSYCHOLOGY, 2016, 63 (03) : 278 - 293
  • [36] RETRACTED: The Statistical Analysis of Multidimensional Psychological Characteristics and User Feedback Willingness (Retracted Article)
    Wang, Haiying
    Li, Yaning
    Zhou, Chang
    Jin, Haizhe
    Wang, Lin
    ADVANCES IN MATHEMATICAL PHYSICS, 2021, 2021
  • [37] Evaluating Online Products Using Text Mining: A Reliable Evidence-Based Approach
    Xu, Haiping
    Wei, Ran
    Degroof, Richard
    Carberry, Joshua
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2022, 16 (04) : 585 - 611
  • [38] Evaluating Pred(p) and standardized accuracy criteria in software development effort estimation
    Idri, Ali
    Abnane, Ibtissam
    Abran, Alain
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2018, 30 (04)
  • [39] Evaluating filter fuzzy analogy homogenous ensembles for software development effort estimation
    Hosni, Mohamed
    Idri, Ali
    Abran, Alain
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2019, 31 (02)
  • [40] Evaluating the understandability and user acceptance of Attack-Defense Trees: Original experiment and replication
    Broccia, Giovanna
    ter Beek, Maurice H.
    Lafuente, Alberto Lluch
    Spoletini, Paola
    Fantechi, Alessandro
    Ferrari, Alessio
    INFORMATION AND SOFTWARE TECHNOLOGY, 2025, 178