Mining expertise of developers from software repositories

被引:0
|
作者
Hammad, Maen [1 ]
Hijazi, Haneen [2 ]
Hammad, Mustafa [3 ]
Otoom, Ahmed Fawzi [1 ]
机构
[1] Hashemite Univ, Dept Software Engn, Zarqa, Jordan
[2] Hashemite Univ, Dept Comp Informat Syst, Zarqa, Jordan
[3] Mutah Univ, Dept Comp Sci, Mutah, Al Karak, Jordan
关键词
software maintenance and evolution; mining software repositories; expertise mining; ASSIGNMENT;
D O I
10.1504/IJCAT.2020.106581
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper presents a technique to mine the developers' contributions to explore their expertise in open source projects. The technique analyses the keywords that appear in the textual content of commits. It is a lightweight technique since the text in commits is analysed without making any syntactic code differencing. Each developer is linked with a list of keywords, with their frequencies, that appeared in his commits. Based on these keywords, three types of expertise are defined: unique, common and frequent. The identified expertise can help in identifying topics or issues that are handled by specific or group of developers. A tool is developed to automatically mine and analyse committed code changes to support expertise identification. A case study is presented on three open source projects to show how the proposed techniques can be applied. The observations of the study showed that frequent terms provide useful information about developers' expertise.
引用
收藏
页码:227 / 239
页数:13
相关论文
共 50 条
  • [31] Extracting enhanced artificial intelligence model metadata from software repositories
    Jason Tsay
    Alan Braz
    Martin Hirzel
    Avraham Shinnar
    Todd Mummert
    Empirical Software Engineering, 2022, 27
  • [32] Assigning change requests to software developers
    Kagdi, Huzefa
    Gethers, Malcom
    Poshyvanyk, Denys
    Hammad, Maen
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2012, 24 (01) : 3 - 33
  • [33] The Emotional Side of Software Developers in JIRA
    Ortu, Marco
    Murgia, Alessandro
    Destefanis, Giuseppe
    Tourani, Parastou
    Tonelli, Roberto
    Marchesi, Michele
    Adams, Bram
    13TH WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2016), 2016, : 480 - 483
  • [34] CodeCV: Mining Expertise of GitHub Users from Coding Activities
    Atzberger, Daniel
    Scordialo, Nico
    Cech, Tim
    Scheibel, Willy
    Trapp, Matthias
    Doellner, Juergen
    2022 IEEE 22ND INTERNATIONAL WORKING CONFERENCE ON SOURCE CODE ANALYSIS AND MANIPULATION (SCAM 2022), 2022, : 143 - 147
  • [35] Using Pig as a data preparation language for large-scale mining software repositories studies: An experience report
    Shang, Weiyi
    Adams, Bram
    Hassan, Ahmed E.
    JOURNAL OF SYSTEMS AND SOFTWARE, 2012, 85 (10) : 2195 - 2204
  • [36] Detecting API Usage Patterns from Software Repositories Using Method Categorization
    Akbar, Rizky Januar
    Omori, Takayuki
    Maruyama, Katsuhisa
    KNOWLEDGE-BASED SOFTWARE ENGINEERING, 2012, 240 : 237 - +
  • [37] On the reproducibility of empirical software engineering studies based on data retrieved from development repositories
    Gonzalez-Barahona, Jesus M.
    Robles, Gregorio
    EMPIRICAL SOFTWARE ENGINEERING, 2012, 17 (1-2) : 75 - 89
  • [38] On the reproducibility of empirical software engineering studies based on data retrieved from development repositories
    Jesús M. González-Barahona
    Gregorio Robles
    Empirical Software Engineering, 2012, 17 : 75 - 89
  • [39] Changeset-Based Topic Modeling of Software Repositories
    Corley, Christopher S.
    Damevski, Kostadin
    Kraft, Nicholas A.
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2020, 46 (10) : 1068 - 1080
  • [40] Exploring the software repositories of embedded systems: An industrial experience
    Polaczek, Jakub
    Sosnowski, Janusz
    INFORMATION AND SOFTWARE TECHNOLOGY, 2021, 131