JTeC: A Large Collection of Java Test Classes for Test Code Analysis and Processing

Federico Corò, Roberto Verdecchia, Emilio Cruciani, Breno Miranda†, Antonia Bertolino

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

The recent push towards test automation and test-driven development continues to scale up the dimensions of test code that needs to be maintained, analysed, and processed side-by-side with production code. As a consequence, on the one side regression testing techniques, e.g., for test suite prioritization or test case selection, capable to handle such large-scale test suites become indispensable; on the other side, as test code exposes own characteristics, specific techniques for its analysis and refactoring are actively sought. We present JTeC, a large-scale dataset of test cases that researchers can use for benchmarking the above techniques or any other type of tool expressly targeting test code. JTeC collects more than 2.5M test classes belonging to 31K+ GitHub projects and summing up to more than 430 Million SLOCs of ready-to-use real-world test code.

Original languageEnglish
Title of host publicationMSR 2020
Subtitle of host publicationProceedings of the 17th International Conference on Mining Software Repositories
PublisherAssociation for Computing Machinery, Inc
Pages578-582
Number of pages5
ISBN (Electronic)9781450379571
DOIs
Publication statusPublished - 29 Jun 2020
Event17th IEEE/ACM International Conference on Mining Software Repositories, MSR 2020, co-located with the 42nd International Conference on Software Engineering. ICSE 2020 - Virtual, Online, Korea, Republic of
Duration: 29 Jun 202030 Jun 2020

Conference

Conference17th IEEE/ACM International Conference on Mining Software Repositories, MSR 2020, co-located with the 42nd International Conference on Software Engineering. ICSE 2020
Country/TerritoryKorea, Republic of
CityVirtual, Online
Period29/06/2030/06/20

Keywords

  • GitHub
  • Java
  • Large Scale
  • Software Testing
  • Test Suite

Fingerprint

Dive into the research topics of 'JTeC: A Large Collection of Java Test Classes for Test Code Analysis and Processing'. Together they form a unique fingerprint.

Cite this