New Methodology for Contextual Features Usage in Duplicate Bug Reports Detection

被引:0
|
作者
Neysiani, Behzad Soleimani [1 ]
Babamir, Seyed Morteza [1 ]
机构
[1] Univ Kashan, Fac Comp & Elect Engn, Dept Software Engn, Kashan, Esfahan, Iran
来源
2019 5TH INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR) | 2019年
关键词
Information Retrieval; Natural Language Processing; Duplicate Detection; Bug Reports; Topic; Feature Expansion;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Duplicate bug report detection is one of the major problems in software triage systems like Bugzilla to deal with end user requests. User request contains some categorical and especially textual fields which need feature extraction for duplicate detection. Contextual and topical features are acquired using calculating cosine similarity between term frequency or inverse document frequency or BM25F technique from a pair of bug reports against some topics. This research proposes the individual Manhattan distance similarity approach instead of cosine distance similarity for every topic in contextual features to expand the feature dimension which can increase the accuracy of the duplicate bug report detection process. The four famous datasets of bug reports have used for evaluation of the proposed method including Android, Eclipse, Mozilla, and Open Office which the experimental results indicate performance improvement for four contextual features including general, cryptography, network, and Java topics.
引用
收藏
页码:178 / 183
页数:6
相关论文
共 44 条
  • [41] Automatic detection of contextual laterality in Mammography Reports using Large Language Models
    Godoy, Eduardo
    de Ferrari, Joaquin
    Mellado, Diego
    Chabert, Steren
    Salas, Rodrigo
    2024 14TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION SYSTEMS, ICPRS, 2024,
  • [42] Prioritizing user concerns in app reviews-A study of requests for new features, enhancements and bug fixes
    Malgaonkar, Saurabh
    Licorish, Sherlock A.
    Savarimuthu, Bastin Tony Roy
    INFORMATION AND SOFTWARE TECHNOLOGY, 2022, 144
  • [43] Features for Discourse-New Referent Detection in Russian
    Toldova, Svetlana
    Ionov, Max
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT I, 2018, 9623 : 648 - 662
  • [44] Web Spam Detection: New Classification Features Based on Qualified Link Analysis and Language Models
    Araujo, Lourdes
    Martinez-Romo, Juan
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2010, 5 (03) : 581 - 590