Evaluating Unsupervised Text Embeddings on Software User Feedback

被引：11

作者：

Devine, Peter ^{[1
]}

Koh, Yun Sing ^{[1
]}

Blincoe, Kelly ^{[1
]}

机构：

[1] Univ Auckland, Auckland, New Zealand

来源：

29TH IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE WORKSHOPS (REW 2021) | 2021年

关键词：

REVIEWS; MODELS;

D O I：

10.1109/REW53955.2021.00020

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

User feedback on software products has been shown to be useful for development and can be exceedingly abundant online. Many approaches have been developed to elicit requirements in different ways from this large volume of feedback, including the use of unsupervised clustering, underpinned by text embeddings. Methods for embedding text can vary significantly within the literature, highlighting the lack of a consensus as to which approaches are best able to cluster user feedback into requirements relevant groups. This work proposes a methodology for comparing text embeddings of user feedback using existing labelled datasets. Using 7 diverse datasets from the literature, we apply this methodology to evaluate both established text embedding techniques from the user feedback analysis literature (including topic modelling and word embeddings) as well as text embeddings from state of the art deep text embedding models. Results demonstrate that text embeddings produced by state of the art models, most notably the Universal Sentence Encoder (USE), group feedback with similar requirements relevant characteristics together better than other evaluated techniques across all seven datasets. These results can help researchers select appropriate embedding techniques when developing future unsupervised clustering approaches within user feedback analysis.

引用

页码：87 / 95

页数：9

共 50 条

[21] Implicit Feedback Recommendation Method Based on User-Generated Content
Fang, Bing
Hu, Enpeng
Shen, Junyang
Zhang, Jingwen
Chen, Yang
SCIENTIFIC PROGRAMMING, 2021, 2021
[22] QuESo-Process: Evaluating OSS Software Ecosystems Quality
Franco-Bedoya, Oscar
Cabrera, Oscar
Hurtado-Gil, Sandra
PROCEEDINGS OF THE 10TH EURO-AMERICAN CONFERENCE ON TELEMATICS AND INFORMATION SYSTEMS (EATIS 2020), 2020,
[23] Evaluating network embedding techniques' performances in software bug prediction
Qu, Yu
Yin, Heng
EMPIRICAL SOFTWARE ENGINEERING, 2021, 26 (04)
[24] Classification of Shopify App User Reviews Using Novel Multi Text Features
Rustam, Furqan
Mehmood, Arif
Ahmad, Muhammad
Ullah, Saleem
Khan, Dost Muhammad
Choi, Gyu Sang
IEEE ACCESS, 2020, 8 (08): : 30234 - 30244
[25] Named Entity Recognition in User-Generated Text: A Systematic Literature Review
Esmaail, Naji
Omar, Nazlia
Mohd, Masnizah
Fauzi, Fariza
Mansur, Zainab
IEEE ACCESS, 2024, 12 : 136330 - 136353
[26] iMet: A graphical user interface software tool to merge metabolic networks
Mohammadi, Reza
Zahiri, Javad
Niroomand, Mohammad Javad
HELIYON, 2019, 5 (06)
[27] Mining User Reviews for Software Requirements of A New Mobile Banking Application
Amalia, Andika Elok
Naf'an, Muhammad Zidny
2021 4TH INTERNATIONAL SEMINAR ON RESEARCH OF INFORMATION TECHNOLOGY AND INTELLIGENT SYSTEMS (ISRITI 2021), 2020,
[28] On the Comparison of User Space and Kernel Space Traces in Identification of Software Anomalies
Murtaza, Syed Shariyar
Sultana, Afroza
Hamou-Lhadj, Abdelwahab
Couture, Mario
2012 16TH EUROPEAN CONFERENCE ON SOFTWARE MAINTENANCE AND REENGINEERING (CSMR), 2012, : 127 - 136
[29] Image Retrieval with Text Feedback by Deep Hierarchical Attention Mutual Information Maximization
Gu, Chunbin
Bu, Jiajun
Zhang, Zhen
Yu, Zhi
Ma, Dongfang
Wang, Wei
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4600 - 4609
[30] Evaluating feedback dynamics between poaching and population with an application to Indian tigers
Lopes, Adrian A.
NATURAL RESOURCE MODELING, 2024, 37 (01)

← 1 2 3 4 5 →