The Qualitas Corpus: A Curated Collection of Java']Java Code for Empirical Studies

被引:196
|
作者
Tempero, Ewan [1 ]
Anslow, Craig [4 ]
Dietrich, Jens [2 ]
Han, Ted [1 ]
Li, Jing [1 ]
Lumpe, Markus [3 ]
Melton, Hayden [1 ]
Noble, James [4 ]
机构
[1] Univ Auckland, Dept Comp Sci, Auckland, New Zealand
[2] Massey Univ, Sch Engn & Adv Technol, Palmerston North, New Zealand
[3] Swinburne Univ Technol, Fac Informat & Commun Technol, Hawthorn, Vic, Australia
[4] Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington, New Zealand
来源
17TH ASIA PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2010) | 2010年
关键词
Empirical studies; curated code corpus; experimental infrastructure; SOFTWARE; METRICS;
D O I
10.1109/APSEC.2010.46
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In order to increase our ability to use measurement to support software development practise we need to do more analysis of code. However, empirical studies of code are expensive and their results are difficult to compare. We describe the Qualitas Corpus, a large curated collection of open source Java systems. The corpus reduces the cost of performing large empirical studies of code and supports comparison of measurements of the same artifacts. We discuss its design, organisation, and issues associated with its development.
引用
收藏
页码:336 / 345
页数:10
相关论文
共 38 条
  • [1] Empirical analysis of the relationship between CC and SLOC in a large corpus of Java']Java methods
    Landman, Davy
    Serebrenik, Alexander
    Vinju, Jurgen
    2014 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME), 2014, : 221 - 230
  • [2] Predicting Defects in SAP Java']Java Code: An Experience Report
    Holschuh, Tilman
    Paeuser, Markus
    Herzig, Kim
    Zimmermann, Thomas
    Premraj, Rahul
    Zeller, Andreas
    2009 31ST INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, COMPANION VOLUME, 2009, : 172 - 181
  • [3] An Empirical Study on the Practice of Maintaining Object-Relational Mapping Code in Java']Java Systems
    Chen, Tse-Hsun
    Shang, Weiyi
    Yang, Jinqiu
    Hassan, Ahmed E.
    Godfrey, Michael W.
    Nasser, Mohamed
    Flora, Parminder
    13TH WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2016), 2016, : 165 - 176
  • [4] Empirical analysis of the relationship between CC and SLOC in a large corpus of Java']Java methods and C functions
    Landman, Davy
    Serebrenik, Alexander
    Bouwers, Eric
    Vinju, Jurgen J.
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2016, 28 (07) : 589 - 618
  • [5] A large study on the effect of code obfuscation on the quality of java']java code
    Ceccato, Mariano
    Capiluppi, Andrea
    Falcarin, Paolo
    Boldyreff, Cornelia
    EMPIRICAL SOFTWARE ENGINEERING, 2015, 20 (06) : 1486 - 1524
  • [6] Injecting Shortcuts for Faster Running Java']Java Code
    Brownlee, Alexander E., I
    Petke, Justyna
    Rasburn, Anna F.
    2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,
  • [7] Towards a Curated Collection of Code Clones
    Tempero, Ewan
    2013 7TH INTERNATIONAL WORKSHOP ON SOFTWARE CLONES (IWSC), 2013, : 53 - 59
  • [8] An Empirical Study on Maintainable Method Size in Java']Java
    Chowdhury, Shaiful Alam
    Uddin, Gias
    Holmes, Reid
    2022 MINING SOFTWARE REPOSITORIES CONFERENCE (MSR 2022), 2022, : 252 - 264
  • [9] Automatic Source Code Summarization of Context for Java']Java Methods
    McBurney, Paul W.
    McMillan, Collin
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2016, 42 (02) : 103 - 119
  • [10] An Empirical Study on the Use and Misuse of Java']Java 8 Streams
    Khatchadourian, Raffi
    Tang, Yiming
    Bagherzadeh, Mehdi
    Ray, Baishakhi
    FUNDAMENTAL APPROACHES TO SOFTWARE ENGINEERING (FASE 2020), 2020, 12076 : 97 - 118