A learning-based module extraction method for object-oriented systems

被引:14
作者
Erdemir, Ural [1 ]
Buzluca, Feza [2 ]
机构
[1] Informat & Informat Secur Res Ctr, Kocaeli, Turkey
[2] Istanbul Tech Univ, Dept Comp Engn, TR-80626 Istanbul, Turkey
关键词
Software architecture recovery; Software modularization; SOA; RECOVERING SOFTWARE ARCHITECTURE; CLUSTERING ALGORITHMS; INFORMATION; QUALITY; MODEL;
D O I
10.1016/j.jss.2014.07.038
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Developers apply object-oriented (OO) design principles to produce modular, reusable software. Therefore, service-specific groups of related software classes called modules arise in OO systems. Extracting the modules is critical for better software comprehension, efficient architecture recovery, determination of service candidates to migrate legacy software to a service-oriented architecture, and transportation of such services to cloud-based distributed systems. In this study, we propose a novel approach to automatic module extraction to identify services in OO software systems. In our approach, first we create a weighted and directed graph of the software system in which vertices and edges represent the classes and their relations, respectively. Then, we apply a clustering algorithm over the graph to extract the modules. We calculate the weight of an edge by considering its probability of being within a module or between modules. To estimate these positional probabilities, we propose a machine-learning-based classification system that we train with data gathered from a real-world OO reference system. We have implemented an automatic module extraction tool and evaluated the proposed approach on several open-source and industrial projects. The experimental results show that the proposed approach generates highly accurate decompositions that are close to authoritative module structures and outperforms existing methods. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:156 / 177
页数:22
相关论文
共 65 条
  • [1] Andreopoulos B, 2005, WCRE: 12TH WORKING CONFERENCE ON REVERSE ENGINEERING 2005, PROCEEDINGS, P79
  • [2] Information-theoretic software clustering
    Andritsos, P
    Tzerpos, V
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2005, 31 (02) : 150 - 165
  • [3] [Anonymous], 8 IEEE INT C SOFTW E
  • [4] [Anonymous], 2007, DYNAM CONT DIS SER B
  • [5] [Anonymous], ENTERP SYST J
  • [6] [Anonymous], INF SOFTW TECHNOL
  • [7] [Anonymous], MOR KAUF D
  • [8] [Anonymous], EM ALGORITHM EXTENSI
  • [9] [Anonymous], EC IMP IN INFR SOFTW
  • [10] [Anonymous], P 18 INT C PROGR COM