A Graph-based Dataset of Commit History of Real-World Android apps

被引:29
作者
Geiger, Franz-Xaver [1 ]
Malavolta, Ivano [1 ]
Pascarella, Luca [2 ]
Palomba, Fabio [3 ]
Di Nucci, Dario [4 ]
Bacchelli, Alberto [3 ]
机构
[1] Vrije Univ Amsterdam, Amsterdam, Netherlands
[2] Delft Univ Technol, Delft, Netherlands
[3] Univ Zurich, Zurich, Switzerland
[4] Vrije Univ Brussel, Brussels, Belgium
来源
2018 IEEE/ACM 15TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR) | 2018年
基金
瑞士国家科学基金会;
关键词
Android; Mining Software Repositories; Dataset;
D O I
10.1145/3196398.3196460
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Obtaining a good dataset to conduct empirical studies on the engineering of Android apps is an open challenge. To start tackling this challenge, we present AndroidTimeMachine, the first, self-contained, publicly available dataset weaving spread-out data sources about real-world, open-source Android apps. Encoded as a graph-based database, AndroidTimeMachine concerns 8,431 real open-source Android apps and contains: (i) metadata about the apps' GitHub projects, (ii) Git repositories with full commit history and (iii) metadata extracted from the Google Play store, such as app ratings and permissions.
引用
收藏
页码:30 / 33
页数:4
相关论文
共 15 条
  • [1] [Anonymous], 2018, 5 IEEE ACM INT C MOB
  • [2] An Empirical Analysis of the Docker Container Ecosystem on GitHub
    Cito, Jurgen
    Schermann, Gerald
    Witternt, John Erik
    Leitner, Philipp
    Zumberi, Sali
    Gall, Harald C.
    [J]. 2017 IEEE/ACM 14TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2017), 2017, : 323 - 333
  • [3] A Quantitative and Qualitative Investigation of Performance-Related Commits in Android Apps
    Das, Teerath
    Di Penta, Massimiliano
    Malavolta, Ivano
    [J]. 32ND IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2016), 2016, : 443 - 447
  • [4] Di Nucci D, 2017, 2017 IEEE 24TH INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION, AND REENGINEERING (SANER), P103, DOI 10.1109/SANER.2017.7884613
  • [5] Mining Software Engineering Data from GitHub
    Gousios, Georgios
    Spinellis, Diomidis
    [J]. PROCEEDINGS OF THE 2017 IEEE/ACM 39TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING COMPANION (ICSE-C 2017), 2017, : 501 - 502
  • [6] Joorabchi Mona Erfani, 2013, 2013 ACM / IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM), P15, DOI 10.1109/ESEM.2013.9
  • [7] An in-depth study of the promises and perils of mining GitHub
    Kalliamvakou, Eirini
    Gousios, Georgios
    Blincoe, Kelly
    Singer, Leif
    German, Daniel M.
    Damian, Daniela
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2016, 21 (05) : 2035 - 2071
  • [8] A Dataset of Open-Source Android Applications
    Krutz, Daniel E.
    Mirakhorli, Mehdi
    Malachowsky, Samuel A.
    Ruiz, Andres
    Peterson, Jacob
    Filipski, Andrew
    Smith, Jared
    [J]. 12TH WORKING CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2015), 2015, : 522 - 525
  • [9] Krutz Daniel E., 2017, WHO ADDED PERMISSION, P165, DOI [10.1109/MOBILESoft.2017.5, DOI 10.1109/MOBILESOFT.2017.5]
  • [10] Mining AndroZoo: A Retrospect
    Li, Li
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME), 2017, : 675 - 680