Log Parsing Evaluation in the Era of Modern Software Systems

Stefan Petrescu, Floris Den Hengst, Alexandru Uta, Jan S. Rellermeyer

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

Due to the complexity and size of modern software systems, the amount of logs generated is tremendous. Hence, it is infeasible to manually investigate these data in a reasonable time, thereby requiring automating log analysis to derive insights about the functioning of the systems. Motivated by an industry use-case, we zoom-in on one integral part of automated log analysis, log parsing, which is the prerequisite to deriving any insights from logs. Our investigation reveals problematic aspects within the log parsing field, particularly its inefficiency in handling heterogeneous real-world logs. We show this by assessing the 14 most-recognized log parsing approaches in the literature using (i) nine publicly available datasets, (ii) one dataset comprised of combined publicly available data, and (iii) one dataset generated within the infrastructure of a large bank. Subsequently, toward improving log parsing robustness in real-world production scenarios, we propose a tool, LOGCHIMERA, that enables estimating log parsing performance in industry contexts through generating synthetic log data that resemble industry logs. Our contributions serve as a foundation to consolidate past research efforts, facilitate future research advancements, and establish a strong link between research and industry log parsing.
Original languageEnglish
Title of host publication2023 IEEE 34th International Symposium on Software Reliability Engineering (ISSRE)
Subtitle of host publication[Proceedings]
PublisherInstitution of Electrical Engineers (IEE)
Pages379-390
Number of pages12
ISBN (Electronic)9798350315943
ISBN (Print)9798350315950
DOIs
Publication statusPublished - 2023
Event IEEE 34th International Symposium on Software Reliability Engineering - Florence, Italy
Duration: 9 Oct 202312 Oct 2023
https://issre.github.io/2023/

Conference

Conference IEEE 34th International Symposium on Software Reliability Engineering
Abbreviated titleISSRE 2023
Country/TerritoryItaly
CityFlorence
Period9/10/2312/10/23
Internet address

Fingerprint

Dive into the research topics of 'Log Parsing Evaluation in the Era of Modern Software Systems'. Together they form a unique fingerprint.

Cite this