RefDetect: A Multi-Language Refactoring Detection Tool Based on String Alignment

被引:5
|
作者
Moghadam, Iman Hemati [1 ]
Cinneide, Mel O. [2 ]
Zarepour, Faezeh [3 ]
Jahanmir, Mohamad Aref [1 ]
机构
[1] Vali E Asr Univ Rafsanjan, Dept Comp Engn, Rafsanjan 7718897111, Iran
[2] Natl Univ Ireland, Sch Comp Sci, Dublin D04 V1W8, Ireland
[3] Allameh Jafari Inst Rafsanjan, Dept Comp Engn, Rafsanjan 77181, Iran
来源
IEEE ACCESS | 2021年 / 9卷
基金
爱尔兰科学基金会;
关键词
Tools; !text type='Java']Java[!/text; Software; C plus plus languages; Measurement; Unified modeling language; Software algorithms; Refactoring; refactoring detection; alignment algorithm; empirical studies; C plus plus; CODE;
D O I
10.1109/ACCESS.2021.3086689
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Refactoring is performed to improve software quality while leaving the behaviour of the software unchanged. Identifying refactorings applied to a software system is an important activity that leads to a better understanding of the evolution of the software system, and several techniques have been proposed and implemented to address this issue. The vast majority of existing refactoring detection techniques are language-specific, including the accepted state of the art, RMiner, which is exclusively Java-based. Although impressive performance has been achieved to date, there is scope for improvement in refactoring detection and such improvement would enhance both refactoring research and practice. In this paper, we propose a novel, language-neutral technique to identify refactorings in commit histories. Our approach is motivated by a desire to explore the use of string alignment algorithms in refactoring detection, and to determine if such approaches are competitive with the state of the art. The proposed approach has been implemented in a tool called RefDetect, evaluated, and compared with the current state-of-the-art refactoring detection tool: RMiner. In experiments we applied RefDetect to 514 commits of 185 Java applications containing 5,058 true refactoring instances, achieving an f-score slightly better than that achieved by RMiner (87.3% vs. 86%). RefDetect clearly outperformed RMiner in method and class based refactorings, achieving f-scores respectively of 87.7% vs. 81.7% for method-level refactorings and 92.1% vs. 86.9% for class-level refactorings. To demonstrate the language-independence of RefDetect, we conducted a further study with four C++ applications, achieving high values for both precision (96.1%) and recall (94.1%). The achieved results indicate that RefDetect performs better than the current state of the art in refactoring detection and is demonstrably capable of handling different programming languages.
引用
收藏
页码:86698 / 86727
页数:30
相关论文
共 50 条
  • [31] CDCAT: A Multi-Language Cross-Document Entity and Event Coreference Annotation Tool
    Yang Xu
    Boming Xia
    Yueliang Wan
    Fan Zhang
    Jiabo Xu
    Huansheng Ning
    TsinghuaScienceandTechnology, 2022, 27 (03) : 589 - 598
  • [32] CDCAT: A multi-language cross-document entity and event coreference annotation tool
    Xu, Yang
    Xia, Boming
    Wan, Yueliang
    Zhang, Fan
    Xu, Jiabo
    Ning, Huansheng
    TSINGHUA SCIENCE AND TECHNOLOGY, 2022, 27 (03) : 589 - 598
  • [33] ANALYSIS OF THE DNN-BASED SRE SYSTEMS IN MULTI-LANGUAGE CONDITIONS
    Novotny, Ondrej
    Matejka, Pavel
    Glembek, Ondrej
    Plchot, Oldrich
    Grezl, Frantisek
    Burget, Lukas
    Cernocky, Jan
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 199 - 204
  • [34] Multi-language: ensemble learning-based speech emotion recognition
    Sruthi, Anumula
    Kumar, Anumula Kalyan
    Dasari, Kishore
    Sivaramaiah, Yenugu
    Divya, Garikapati
    Kumar, Gunupudi Sai Chaitanya
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024, 19 (3) : 453 - 467
  • [35] Fast multi-language LSTM-based online handwriting recognition
    Carbune, Victor
    Gonnet, Pedro
    Deselaers, Thomas
    Rowley, Henry A.
    Daryin, Alexander
    Calvo, Marcos
    Wang, Li-Lun
    Keysers, Daniel
    Feuz, Sandro
    Gervais, Philippe
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2020, 23 (02) : 89 - 102
  • [36] Fast multi-language LSTM-based online handwriting recognition
    Victor Carbune
    Pedro Gonnet
    Thomas Deselaers
    Henry A. Rowley
    Alexander Daryin
    Marcos Calvo
    Li-Lun Wang
    Daniel Keysers
    Sandro Feuz
    Philippe Gervais
    International Journal on Document Analysis and Recognition (IJDAR), 2020, 23 : 89 - 102
  • [37] Multi-Language Handwritten Digits Recognition based on Novel Structural Features
    Alghazo, Jaafar M.
    Latif, Ghazanfar
    Alzubaidi, Loay
    Elhassan, Ammar
    JOURNAL OF IMAGING SCIENCE AND TECHNOLOGY, 2019, 63 (02)
  • [38] An Efficient New Multi-Language Clone Detection Approach from Large Source Code
    Rehman, Saif Ur
    Khan, Kamran
    Fong, Simon
    Biuk-Aghai, Robert
    PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 937 - 940
  • [39] Experience Report on Building ASTM Based Tools for Multi-Language Reverse Engineering
    Fleck, Guenter
    Kirchmayr, Wilhelm
    Moser, Michael
    Nocke, Ludwig
    Pichler, Josef
    Tober, Rudolf
    Witlatschil, Michael
    2016 IEEE 23RD INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION, AND REENGINEERING (SANER), VOL 1, 2016, : 683 - 687
  • [40] Multi-language Person Social Relation Extraction Model Based on Distant Supervision
    Huang, Yangchen
    Jia, Yan
    Huang, Jiuming
    He, Zhonghe
    2018 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2018, : 368 - 374