Towards Training Set Reduction for Bug Triage

被引:23
|
作者
Zou, Weiqin [1 ]
Hu, Yan [1 ]
Xuan, Jifeng [2 ]
Jiang, He [1 ]
机构
[1] Dalian Univ Technol, Sch Software, Dalian, Peoples R China
[2] Dalian Univ Technol, Sch Mat Sci, Dalian, Peoples R China
基金
新加坡国家研究基金会;
关键词
bug triage; training set reduction; feature selection; instance selection; software quality; NATURAL-LANGUAGE;
D O I
10.1109/COMPSAC.2011.80
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Bug triage is an important step in the process of bug fixing. The goal of bug triage is to assign a new-coming bug to the correct potential developer. The existing bug triage approaches are based on machine learning algorithms, which build classifiers from the training sets of bug reports. In practice, these approaches suffer from the large-scale and low-quality training sets. In this paper, we propose the training set reduction with both feature selection and instance selection techniques for bug triage. We combine feature selection with instance selection to improve the accuracy of bug triage. The feature selection algorithm. chi(2)-test, instance selection algorithm Iterative Case Filter, and their combinations are studied in this paper. We evaluate the training set reduction on the bug data of Eclipse. For the training set, 70% words and 50% bug reports are removed after the training set reduction. The experimental results show that the new and small training sets can provide better accuracy than the original one.
引用
收藏
页码:576 / 581
页数:6
相关论文
共 50 条
  • [1] Towards Effective Bug Triage with Software Data Reduction Techniques
    Xuan, Jifeng
    Jiang, He
    Hu, Yan
    Ren, Zhilei
    Zou, Weiqin
    Luo, Zhongxuan
    Wu, Xindong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (01) : 264 - 280
  • [2] Enhancing Bug Report Assignment with an Optimized Reduction of Training Set
    Wei, Miaomiao
    Guo, Shikai
    Chen, Rong
    Gao, Jian
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2018, PT II, 2018, 11062 : 36 - 47
  • [3] Improving the Robustness of the Bug Triage Model through Adversarial Training
    Kim, Min-ha
    Wang, Dae-sung
    Wang, Sheng-tsai
    Park, Seo-Hyeon
    Lee, Chan-gun
    36TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2022), 2022, : 478 - 481
  • [4] DATA REDUCTION FOR BUG TRIAGE USING EFFECTIVE PREDICTION OF REDUCTION ORDER TECHNIQUES
    Govindasamy, V.
    Akila, V.
    Anjanadevi, G.
    Deepika, H.
    Sivasankari, G.
    2016 INTERNATIONAL CONFERENCE ON COMPUTATION OF POWER, ENERGY INFORMATION AND COMMUNICATION (ICCPEIC), 2016, : 85 - 90
  • [5] T-REC: Towards accurate bug triage for technical groups
    De Lara Pahins, Cicero Augusto
    D'Morison, Fabricio
    Rocha, Thiago M.
    Almeida, Larissa M.
    Batista, Arthur F.
    Souza, Diego F.
    Proceedings - 18th IEEE International Conference on Machine Learning and Applications, ICMLA 2019, 2019, : 889 - 895
  • [6] High-Dimensional Hybrid Data Reduction for Effective Bug Triage
    Ge, Xin
    Zheng, Shengjie
    Wang, Jiahui
    Li, Hui
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [7] Bug Prioritization to Facilitate Bug Report Triage
    Jaweria Kanwal
    Onaiza Maqbool
    JournalofComputerScience&Technology, 2012, 27 (02) : 397 - 412
  • [8] Bug Prioritization to Facilitate Bug Report Triage
    Jaweria Kanwal
    Onaiza Maqbool
    Journal of Computer Science and Technology, 2012, 27 : 397 - 412
  • [9] Improving Bug Triage with Bug Tossing Graphs
    Jeong, Gaeul
    Kim, Sunghun
    Zimmerman, Thomas
    7TH JOINT MEETING OF THE EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND THE ACM SIGSOFT SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, 2009, : 111 - 120
  • [10] Bug Prioritization to Facilitate Bug Report Triage
    Kanwal, Jaweria
    Maqbool, Onaiza
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2012, 27 (02) : 397 - 412