A graph-based dataset of commit history of real-world Android apps

Franz Xaver Geiger, Ivano Malavolta, Luca Pascarella, Fabio Palomba, Dario Di Nucci, Alberto Bacchelli

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

Obtaining a good dataset to conduct empirical studies on the engineering of Android apps is an open challenge. To start tackling this challenge, we present AndroidTimeMachine, the first, self-contained, publicly available dataset weaving spread-out data sources about real-world, open-source Android apps. Encoded as a graph-based database, AndroidTimeMachine concerns 8,431 real open-source Android apps and contains: (i) metadata about the apps' GitHub projects, (ii) Git repositories with full commit history and (iii) metadata extracted from the Google Play store, such as app ratings and permissions.

LanguageEnglish
Title of host publicationProceedings - 2018 ACM/IEEE 15th International Conference on Mining Software Repositories, MSR 2018
Place of PublicationNew York, NY
PublisherACM, IEEE Computer Society
Pages30-33
Number of pages4
ISBN (Print)9781450357166
DOIs
StatePublished - 28 May 2018
Event15th ACM/IEEE International Conference on Mining Software Repositories, MSR 2018, co-located with the 40th International Conference on Software Engineering, ICSE 2018 - Gothenburg, Sweden
Duration: 28 May 201829 May 2018

Conference

Conference15th ACM/IEEE International Conference on Mining Software Repositories, MSR 2018, co-located with the 40th International Conference on Software Engineering, ICSE 2018
CountrySweden
CityGothenburg
Period28/05/1829/05/18

Fingerprint

Application programs
Metadata
Android (operating system)

Keywords

  • Android
  • dataset
  • mining software repositories

Cite this

Geiger, F. X., Malavolta, I., Pascarella, L., Palomba, F., Di Nucci, D., & Bacchelli, A. (2018). A graph-based dataset of commit history of real-world Android apps. In Proceedings - 2018 ACM/IEEE 15th International Conference on Mining Software Repositories, MSR 2018 (pp. 30-33). New York, NY: ACM, IEEE Computer Society. DOI: 10.1145/3196398.3196460
Geiger, Franz Xaver ; Malavolta, Ivano ; Pascarella, Luca ; Palomba, Fabio ; Di Nucci, Dario ; Bacchelli, Alberto. / A graph-based dataset of commit history of real-world Android apps. Proceedings - 2018 ACM/IEEE 15th International Conference on Mining Software Repositories, MSR 2018. New York, NY : ACM, IEEE Computer Society, 2018. pp. 30-33
@inproceedings{03564a7c2f4b4d1a8c110f203ac9c218,
title = "A graph-based dataset of commit history of real-world Android apps",
abstract = "Obtaining a good dataset to conduct empirical studies on the engineering of Android apps is an open challenge. To start tackling this challenge, we present AndroidTimeMachine, the first, self-contained, publicly available dataset weaving spread-out data sources about real-world, open-source Android apps. Encoded as a graph-based database, AndroidTimeMachine concerns 8,431 real open-source Android apps and contains: (i) metadata about the apps' GitHub projects, (ii) Git repositories with full commit history and (iii) metadata extracted from the Google Play store, such as app ratings and permissions.",
keywords = "Android, dataset, mining software repositories",
author = "Geiger, {Franz Xaver} and Ivano Malavolta and Luca Pascarella and Fabio Palomba and {Di Nucci}, Dario and Alberto Bacchelli",
year = "2018",
month = "5",
day = "28",
doi = "10.1145/3196398.3196460",
language = "English",
isbn = "9781450357166",
pages = "30--33",
booktitle = "Proceedings - 2018 ACM/IEEE 15th International Conference on Mining Software Repositories, MSR 2018",
publisher = "ACM, IEEE Computer Society",

}

Geiger, FX, Malavolta, I, Pascarella, L, Palomba, F, Di Nucci, D & Bacchelli, A 2018, A graph-based dataset of commit history of real-world Android apps. in Proceedings - 2018 ACM/IEEE 15th International Conference on Mining Software Repositories, MSR 2018. ACM, IEEE Computer Society, New York, NY, pp. 30-33, 15th ACM/IEEE International Conference on Mining Software Repositories, MSR 2018, co-located with the 40th International Conference on Software Engineering, ICSE 2018, Gothenburg, Sweden, 28/05/18. DOI: 10.1145/3196398.3196460

A graph-based dataset of commit history of real-world Android apps. / Geiger, Franz Xaver; Malavolta, Ivano; Pascarella, Luca; Palomba, Fabio; Di Nucci, Dario; Bacchelli, Alberto.

Proceedings - 2018 ACM/IEEE 15th International Conference on Mining Software Repositories, MSR 2018. New York, NY : ACM, IEEE Computer Society, 2018. p. 30-33.

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

TY - GEN

T1 - A graph-based dataset of commit history of real-world Android apps

AU - Geiger,Franz Xaver

AU - Malavolta,Ivano

AU - Pascarella,Luca

AU - Palomba,Fabio

AU - Di Nucci,Dario

AU - Bacchelli,Alberto

PY - 2018/5/28

Y1 - 2018/5/28

N2 - Obtaining a good dataset to conduct empirical studies on the engineering of Android apps is an open challenge. To start tackling this challenge, we present AndroidTimeMachine, the first, self-contained, publicly available dataset weaving spread-out data sources about real-world, open-source Android apps. Encoded as a graph-based database, AndroidTimeMachine concerns 8,431 real open-source Android apps and contains: (i) metadata about the apps' GitHub projects, (ii) Git repositories with full commit history and (iii) metadata extracted from the Google Play store, such as app ratings and permissions.

AB - Obtaining a good dataset to conduct empirical studies on the engineering of Android apps is an open challenge. To start tackling this challenge, we present AndroidTimeMachine, the first, self-contained, publicly available dataset weaving spread-out data sources about real-world, open-source Android apps. Encoded as a graph-based database, AndroidTimeMachine concerns 8,431 real open-source Android apps and contains: (i) metadata about the apps' GitHub projects, (ii) Git repositories with full commit history and (iii) metadata extracted from the Google Play store, such as app ratings and permissions.

KW - Android

KW - dataset

KW - mining software repositories

UR - http://www.scopus.com/inward/record.url?scp=85051647148&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85051647148&partnerID=8YFLogxK

U2 - 10.1145/3196398.3196460

DO - 10.1145/3196398.3196460

M3 - Conference contribution

SN - 9781450357166

SP - 30

EP - 33

BT - Proceedings - 2018 ACM/IEEE 15th International Conference on Mining Software Repositories, MSR 2018

PB - ACM, IEEE Computer Society

CY - New York, NY

ER -

Geiger FX, Malavolta I, Pascarella L, Palomba F, Di Nucci D, Bacchelli A. A graph-based dataset of commit history of real-world Android apps. In Proceedings - 2018 ACM/IEEE 15th International Conference on Mining Software Repositories, MSR 2018. New York, NY: ACM, IEEE Computer Society. 2018. p. 30-33. Available from, DOI: 10.1145/3196398.3196460