Mining domain knowledge from app descriptions

被引:26
作者
Liu, Yuzhou [1 ,2 ]
Liu, Lei [1 ,2 ]
Liu, Huaxiao [1 ,2 ]
Wang, Xiaoyu [1 ,2 ]
Yang, Hongji [3 ]
机构
[1] Minist Educ, Key Lab Symbol Computat & Knowledge Engn, Changchun 130012, Jilin, Peoples R China
[2] Jilin Univ, Coll Comp Sci & Technol, Changchun, Jilin, Peoples R China
[3] Bath Spa Univ, Ctr Creat Comp, Corsham SN13 0BZ, England
关键词
Domain analysis; Feature extraction; App descriptions; Data analysis; REQUIREMENTS; VARIABILITY; REUSE; EXTRACTION; SYSTEMS;
D O I
10.1016/j.jss.2017.08.024
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Domain analysis aims at gaining knowledge to a particular domain in the early stage of software development. A key challenge in domain analysis is to extract features automatically from related product artifacts. Compared with other kinds of artifacts, high volume of descriptions can be collected from App marketplaces (such as Google Play and Apple Store) easily when developing a new mobile application (App), so it is essential for the success of domain analysis to gain features and relationships from them using data analysis techniques. In this paper, we propose an approach to mine domain knowledge from App descriptions automatically, where the information of features in a single App description is firstly extracted and formally described by a Concern-based Description Model (CDM), which is based on predefined rules of feature extraction and a modified topic modeling method; then the overall knowledge in the domain is identified by classifying, clustering and merging the knowledge in the set of CDMs and topics, and the results are formalized by a Data-based Raw Domain Model (DRDM). Furthermore, we propose a quantified evaluation method for prioritizing the knowledge in DRDM. The proposed approach is validated by a series of experiments. (C) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页码:126 / 144
页数:19
相关论文
共 40 条
  • [1] Acher M., 2012, P 6 INT WORKSH VAR M, P45, DOI DOI 10.1145/2110147.2110153
  • [2] Extraction and evolution of architectural variability models in plugin-based systems
    Acher, Mathieu
    Cleve, Anthony
    Collet, Philippe
    Merle, Philippe
    Duchien, Laurence
    Lahire, Philippe
    [J]. SOFTWARE AND SYSTEMS MODELING, 2014, 13 (04) : 1367 - 1394
  • [3] [Anonymous], 2015, MINING USER OPINIONS
  • [4] [Anonymous], 2008, Introduction to information retrieval
  • [5] Antkiewicz M., 2004, Proc. of the Wksp on Eclipse Technology eXchange, P67, DOI DOI 10.1145/1066129.1066143
  • [6] Decision support for the software product line domain engineering lifecycle
    Bagheri, Ebrahim
    Ensan, Faezeh
    Gasevic, Dragan
    [J]. AUTOMATED SOFTWARE ENGINEERING, 2012, 19 (03) : 335 - 377
  • [7] Bakar N. H., 2016, APPL SOFT COMPUT
  • [8] Feature extraction approaches from natural language requirements for reuse in software product lines: A systematic literature review
    Bakar, Noor Hasrina
    Kasirun, Zarinah M.
    Salleh, Norsaremah
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2015, 106 : 132 - 149
  • [9] Benavides D., 2007, INT WORKSH VAR MOD S
  • [10] Commonality and variability in software engineering
    Coplien, J
    Hoffman, D
    Weiss, D
    [J]. IEEE SOFTWARE, 1998, 15 (06) : 37 - +