Detecting relevant app reviews for software evolution and maintenance through multimodal one-class learning

被引:4
作者
Golo, Marcos P. S. [1 ]
Araujo, Adailton F. [1 ]
Rossi, Rafael G. [2 ]
Marcacini, Ricardo M. [1 ]
机构
[1] Univ Sao Paulo, Inst Math & Comp Sci, POB 668, BR-13560970 Sao Carlos, SP, Brazil
[2] FACOM Fed Univ Mato Grosso do Sul, BR-79070900 Campo Grande, MS, Brazil
基金
巴西圣保罗研究基金会;
关键词
One-class classification; App reviews classification; Multimodal Autoencoders;
D O I
10.1016/j.infsof.2022.106998
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Context: Mobile app reviews are a rich source of information for software evolution and maintenance. Several studies have shown the effectiveness of exploring relevant reviews in the software development lifecycle, such as release planning and requirements engineering tasks. Popular apps receive even millions of reviews, thereby making manual extraction of relevant information an impractical task. The literature presents several machine learning approaches to detect relevant reviews. However, these approaches use multi-class learning, implying more user effort for data labeling since users must label a significant set of relevant and irrelevant reviews. Objective: This article investigates methods for detecting relevant app reviews considering scenarios with small sets of labeled data. We evaluated unimodal and multimodal representations, different labeling levels, as well as different app review domains and languages. Method: We present a one-class multimodal learning method for detecting relevant reviews. Our approaches have two main contributions. First, we use one-class learning that requires only the labeling of relevant app reviews, thereby minimizing the labeling effort. Second, to handle the smaller amount of labeled reviews without harming classification performance, we also present methods to improve feature extraction and reviews representation. We propose the Multimodal Autoencoder and the Multimodal Variational Autoencoder. The methods learn representations which explore both textual data and visual information based on the density of the reviews. Density information can be interpreted as a summary of the main topics or clusters extracted from the reviews. Results: Our methods achieved competitive results even using only 25% of labeled reviews compared to models that used the entire training set. Also, our multimodal approaches obtain the highest F-1-Score and AUC-ROC in twenty-three out of twenty-four scenarios. Conclusion: Our one-class multimodal methods proved to be a competitive alternative for detecting relevant reviews and promising for practical scenarios involving data-driven software evolution and maintenance.
引用
收藏
页数:12
相关论文
共 51 条
[1]  
Aggarwal CC., 2018, Neural networks and deep learning, DOI DOI 10.1007/978-3-319-94463-0
[2]  
Aggarwal CC., 2018, MACHINE LEARNING TEX, DOI [10.1007/978-3-319-73531-3, DOI 10.1007/978-3-319-73531-3]
[3]  
Al Kilani N, 2019, 2019 SIXTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), P541, DOI [10.1109/SNAMS.2019.8931820, 10.1109/snams.2019.8931820]
[4]   One-class support vector classifiers: A survey [J].
Alam, Shamshe ;
Sonbhadra, Sanjay Kumar ;
Agarwal, Sonali ;
Nagabhushan, P. .
KNOWLEDGE-BASED SYSTEMS, 2020, 196
[5]  
[Anonymous], 2001, ASCI Dissertation Series
[6]  
Araujo Adailton., 2020, Anais do Encontro Nacional de Inteligencia Artificial e Computacional (ENIAC 2020), P378, DOI DOI 10.5753/ENIAC.2020.12144
[7]   Opinion mining for app reviews: an analysis of textual representation and predictive models [J].
Araujo, Adailton F. ;
Golo, Marcos P. S. ;
Marcacini, Ricardo M. .
AUTOMATED SOFTWARE ENGINEERING, 2022, 29 (01)
[8]   A graph-based approach for positive and unlabeled learning [J].
Carnevali, Julio Cesar ;
Rossi, Rafael Geraldeli ;
Milios, Evangelos ;
Lopes, Alneu de Andrade .
INFORMATION SCIENCES, 2021, 580 :655-672
[9]   AR-Miner: Mining Informative Reviews for Developers from Mobile App Marketplace [J].
Chen, Ning ;
Lin, Jialiu ;
Hoi, Steven C. H. ;
Xiao, Xiaokui ;
Zhang, Boshen .
36TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2014), 2014, :767-778
[10]   RE-BERT: Automatic Extraction of Software Requirements from App Reviews using BERT Language Model [J].
de Araujo, Adailton Ferreira ;
Marcacini, Ricardo Marcondes .
36TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2021, 2021, :1321-1327