Automatic query reformulations for feature location in a model-based family of software products

被引:8
作者
Perez, Francisca [1 ]
Font, Jaime [1 ,2 ]
Arcega, Lorena [1 ,2 ]
Cetina, Carlos [1 ]
机构
[1] Univ San Jorge, SVIT Res Grp, Autovia A-23 Zaragoza Huesca Km 299, Zaragoza 50830, Spain
[2] Univ Oslo, Dept Informat, Postboks 1080 Blindern, N-0316 Oslo, Norway
关键词
Conceptual modeling; Information retrieval; Feature location; Query reformulation; Software maintenance and evolution; Families of software products; SOURCE CODE; TESTS;
D O I
10.1016/j.datak.2018.06.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
No maintenance activity can be completed without Feature Location (FL), which is finding the set of software artifacts that realize a particular functionally. Despite the importance of FL, the vast majority of work has been focused on retrieving code, whereas other software artifacts such as the models have been neglected. Furthermore, locating a piece of information from a query in a large repository is a challenging task as it requires knowledge of the vocabulary used in the software artifacts. This can be alleviated by automatically reformulating the query (adding or removing terms). In this paper, we test four existing query reformulation techniques, which perform the best for FL in code but have never been used for FL in models. Specifically, we test these techniques in two industrial domains: a model-based family of firmwares for induction hobs, and a model-based family of PLC software to control trains. We compare the results provided by our FL approach using the query and the reformulated queries by means of statistical analysis. Our results show that reformulated queries do not improve the performance in models, which could lead towards a new direction in the creation or reconsideration of these techniques to be applied in models.
引用
收藏
页码:159 / 176
页数:18
相关论文
共 50 条
[1]   An Exploratory Study of Information Retrieval Techniques in Domain Analysis [J].
Alves, Vander ;
Schwanninger, Christa ;
Barbosa, Luciano ;
Rashid, Awais ;
Sawyer, Peter ;
Rayson, Paul ;
Pohl, Christoph ;
Rummler, Andreas .
SPLC 2008: 12TH INTERNATIONAL SOFTWARE PRODUCT LINE CONFERENCE, PROCEEDINGS, 2008, :67-76
[2]  
[Anonymous], 1971, The SMART Retrieval System-Experiments in Automatic Document Processing
[3]  
[Anonymous], 2008, P 31 ANN INT ACM SIG
[4]   A Hitchhiker's guide to statistical tests for assessing randomized algorithms in software engineering [J].
Arcuri, Andrea ;
Briand, Lionel .
SOFTWARE TESTING VERIFICATION & RELIABILITY, 2014, 24 (03) :219-250
[5]   Parameter tuning or default values? An empirical investigation in search-based software engineering [J].
Arcuri, Andrea ;
Fraser, Gordon .
EMPIRICAL SOFTWARE ENGINEERING, 2013, 18 (03) :594-623
[6]   The Oracle Problem in Software Testing: A Survey [J].
Barr, Earl T. ;
Harman, Mark ;
McMinn, Phil ;
Shahbaz, Muzammil ;
Yoo, Shin .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2015, 41 (05) :507-525
[7]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[8]   A Survey of Automatic Query Expansion in Information Retrieval [J].
Carpineto, Claudio ;
Romano, Giovanni .
ACM COMPUTING SURVEYS, 2012, 44 (01)
[9]  
CAVALCANTI YC, 2014, P 29 ACM IEEE INT C, P325, DOI DOI 10.1145/2642937.2642964
[10]  
Conover WJ, 1999, Practical nonparametric statistics, V350