Demo: Automatically Retrainable Self Improving Model for the Automated Classification of Software Incidents into Multiple Classes

被引:2
作者
Agrawal, Badal [1 ]
Mishra, Mohit [2 ]
机构
[1] Microsoft India, Hyderabad, India
[2] IIIT Guwahati, Gauhati, India
来源
2021 IEEE 41ST INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2021) | 2021年
关键词
Bug reports; NLP; Machine learning; Bug Triaging; Automatic retraining; Feedback;
D O I
10.1109/ICDCS51616.2021.00113
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Developers across most of the organizations face the issue of manually dealing with the classification of the software bug reports. Software bug reports often contain text and other useful information that are common for a particular type of bug. This information can be extracted using the techniques of Natural Language Processing and combined with the manual classification done by the developers until now to create a properly labelled data set for training a supervised learning model for automatically classifying the bug reports into their respective categories. Previous studies have only focused on binary classification of software incident reports as bug and non-bug. Our novel approach achieves an accuracy of 76.94% for a 10-factor classification problem on the bug repository created by Microsoft Dynamics 365 team. In addition, we propose a novel method for automatically retraining the model and updating it with developer feedback in case of misclassification that will significantly reduce the maintenance cost and effort.
引用
收藏
页码:1110 / 1113
页数:4
相关论文
共 7 条
[1]  
Drucker H, 1996, ADV NEUR IN, V8, P479
[2]  
Gujral S, 2015, 2015 1ST INTERNATIONAL CONFERENCE ON FUTURISTIC TRENDS ON COMPUTATIONAL ANALYSIS AND KNOWLEDGE MANAGEMENT (ABLAZE), P599, DOI 10.1109/ABLAZE.2015.7154933
[3]  
Herzig K, 2013, PROCEEDINGS OF THE 35TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2013), P392, DOI 10.1109/ICSE.2013.6606585
[4]   A Novel Deep-Learning-Based Bug Severity Classification Technique Using Convolutional Neural Networks and Random Forest with Boosting [J].
Kukkar, Ashima ;
Mohana, Rajni ;
Nayyar, Anand ;
Kim, Jeamin ;
Kang, Byeong-Gwon ;
Chilamkurti, Naveen .
SENSORS, 2019, 19 (13)
[5]  
Mikolov T., 2013, 1 INT C LEARN REPR I, DOI DOI 10.48550/ARXIV.1301.3781
[6]   Automated classification of software issue reports using machine learning techniques: an empirical study [J].
Pandey N. ;
Sanyal D.K. ;
Hudait A. ;
Sen A. .
Innovations in Systems and Software Engineering, 2017, 13 (04) :279-297
[7]  
Sainin MS, 2012, 2012 4TH CONFERENCE ON DATA MINING AND OPTIMIZATION (DMO), P59, DOI 10.1109/DMO.2012.6329799