Ticket Tagger: Machine Learning Driven Issue Classification

被引:77
作者
Kallis, Rafael [1 ]
Di Sorbo, Andrea [2 ]
Canfora, Gerardo [2 ]
Panichella, Sebastiano [3 ]
机构
[1] Univ Zurich, Dept Informat, Zurich, Switzerland
[2] Univ Sannio, Dept Engn, Benevento, Italy
[3] Zurich Univ Appl Sci, Sch Engn, Zurich, Switzerland
来源
2019 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2019) | 2019年
关键词
Software maintenance and evolution; Issue Processing; Unstructured Data Labeling;
D O I
10.1109/ICSME.2019.00070
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Software maintenance is crucial for software projects evolution and success: code should be kept up-to-date and error-free, this with little effort and continuous updates for the end-users. In this context, issue trackers are essential tools for creating, managing and addressing the several (often hundreds of) issues that occur in software systems. A critical aspect for handling and prioritizing issues involves the assignment of labels to them (e.g., for projects hosted on GitHub), in order to determine the type (e.g., bug report, feature request and so on) of each specific issue. Although this labeling process has a positive impact on the effectiveness of issue processing, the current labeling mechanism is scarcely used on GitHub. In this demo, we introduce a tool, called Ticket Tagger, which leverages machine learning strategies on issue titles and descriptions for automatically labeling GitHub issues. Ticket Tagger automatically predicts the labels to assign to issues, with the aim of stimulating the use of labeling mechanisms in software projects, this to facilitate the issue management and prioritization processes. Along with the presentation of the tool's architecture and usage, we also evaluate its effectiveness in performing the issue labeling/classification process, which is critical to help maintainers to keep control of their workloads by focusing on the most critical issue tickets.
引用
收藏
页码:406 / 409
页数:4
相关论文
共 14 条
[1]   On the Usage of Python']Pythonic Idioms [J].
Alexandru, Carol V. ;
Merchante, Jose J. ;
Panichella, Sebastiano ;
Proksch, Sebastian ;
Gall, Harald C. ;
Robles, Gregorio .
ONWARD!'18: PROCEEDINGS OF THE 2018 ACM SIGPLAN INTERNATIONAL SYMPOSIUM ON NEW IDEAS, NEW PARADIGMS, AND REFLECTIONS ON PROGRAMMING AND SOFTWARE, 2018, :1-11
[2]  
Antoniol Giuliano, 2008, P 2008 C CTR ADV STU
[3]  
Bissyandé TF, 2013, PROC INT SYMP SOFTW, P188, DOI 10.1109/ISSRE.2013.6698918
[4]  
Cabot J, 2015, 2015 22ND INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION, AND REENGINEERING (SANER), P550, DOI 10.1109/SANER.2015.7081875
[5]  
Izquierdo JLC, 2015, 2015 22ND INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION, AND REENGINEERING (SANER), P479, DOI 10.1109/SANER.2015.7081860
[6]   DECA: Development Emails Content Analyzer [J].
Di Sorbo, Andrea ;
Panichella, Sebastiano ;
Visaggio, Corrado A. ;
Di Penta, Massimiliano ;
Canfora, Gerardo ;
Gall, Harald .
2016 IEEE/ACM 38TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING COMPANION (ICSE-C), 2016, :641-644
[7]   Where is the Road for Issue Reports Classification Based on Text Mining? [J].
Fan, Qiang ;
Yu, Yue ;
Yin, Gang ;
Wang, Tao ;
Wang, Huaimin .
11TH ACM/IEEE INTERNATIONAL SYMPOSIUM ON EMPIRICAL SOFTWARE ENGINEERING AND MEASUREMENT (ESEM 2017), 2017, :121-130
[8]  
Floris P., 2010, SOURCE 2 VALUE
[9]  
Herzig K, 2013, PROCEEDINGS OF THE 35TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2013), P392, DOI 10.1109/ICSE.2013.6606585
[10]  
Joulin A., 2016, P ACL