Extracting features from online software reviews to aid requirements reuse

被引:34
作者
Bakar, Noor Hasrina [1 ]
Kasirun, Zarinah M. [1 ]
Salleh, Norsaremah [2 ]
Jalab, Hamid A. [1 ]
机构
[1] Univ Malaya, Fac Comp Sci & IT, Dept Software Engn, Kuala Lumpur 50603, Malaysia
[2] Int Islamic Univ Malaysia, Kulliyyah ICT, Dept Comp Sci, Kuala Lumpur 50728, Malaysia
关键词
Requirements reuse; Software engineering; Natural language processing; Unsupervised learning; Latent semantic analysisa; PRODUCT LINES;
D O I
10.1016/j.asoc.2016.07.048
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sets of common features are essential assets to be reused in fulfilling specific needs in software product line methodology. In Requirements Reuse (RR), the extraction of software features from Software Requirement Specifications (SRS) is viable only to practitioners who have access to these software artefacts. Due to organisational privacy, SRS are always kept confidential and not easily available to the public. As alternatives, researchers opted to use the publicly available software descriptions such as product brochures and online software descriptions to identify potential software features to initiate the RR process. The aim of this paper is to propose a semi-automated approach, known as Feature Extraction for Reuse of Natural Language requirements (FENL), to extract phrases that can represent software features from software reviews in the absence of SRS as a way to initiate the RR process. FENL is composed of four stages, which depend on keyword occurrences from several combinations of nouns, verbs, and/or adjectives. In the experiment conducted, phrases that could reflect software features, which reside within online software reviews were extracted by utilising the techniques from information retrieval (IR) area. As a way to demonstrate the feature groupings phase, a semi-automated approach to group the extracted features were then conducted with the assistance of a modified word overlap algorithm. As for the evaluation, the proposed extraction approach is evaluated through experiments against the truth data set created manually. The performance results obtained from the feature extraction phase indicates that the proposed approach performed comparably with related works in terms of recall, precision, and F-Measure (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:1297 / 1315
页数:19
相关论文
共 31 条
[1]   An Exploratory Study of Information Retrieval Techniques in Domain Analysis [J].
Alves, Vander ;
Schwanninger, Christa ;
Barbosa, Luciano ;
Rashid, Awais ;
Sawyer, Peter ;
Rayson, Paul ;
Pohl, Christoph ;
Rummler, Andreas .
SPLC 2008: 12TH INTERNATIONAL SOFTWARE PRODUCT LINE CONFERENCE, PROCEEDINGS, 2008, :67-76
[2]   Requirements engineering for software product lines: A systematic literature review [J].
Alves, Vander ;
Niu, Nan ;
Alves, Carina ;
Valenca, George .
INFORMATION AND SOFTWARE TECHNOLOGY, 2010, 52 (08) :806-820
[3]  
[Anonymous], 2001, ADAP COMP MACH LEARN
[4]   Feature extraction approaches from natural language requirements for reuse in software product lines: A systematic literature review [J].
Bakar, Noor Hasrina ;
Kasirun, Zarinah M. ;
Salleh, Norsaremah .
JOURNAL OF SYSTEMS AND SOFTWARE, 2015, 106 :132-149
[5]   FCM - THE FUZZY C-MEANS CLUSTERING-ALGORITHM [J].
BEZDEK, JC ;
EHRLICH, R ;
FULL, W .
COMPUTERS & GEOSCIENCES, 1984, 10 (2-3) :191-203
[6]  
Carreño LVG, 2013, PROCEEDINGS OF THE 35TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2013), P582, DOI 10.1109/ICSE.2013.6606604
[7]  
Chen K, 2005, 13TH IEEE INTERNATIONAL CONFERENCE ON REQUIREMENTS ENGINEERING, PROCEEDINGS, P31
[8]   Efficient software clustering technique using an adaptive and preventive dendrogram cutting approach [J].
Chong, Chun Yong ;
Lee, Sai Peck ;
Ling, Teck Chaw .
INFORMATION AND SOFTWARE TECHNOLOGY, 2013, 55 (11) :1994-2012
[9]  
Clements P., 2002, SEI Series in Software Engineering
[10]  
Cohen S., 1990, Tech. Rep. CMU/SEI-90-TR-021