Skip to main navigation Skip to search Skip to main content

Improving Model Inference in Industry by Combining Active and Passive Learning

  • N. Yang
  • , K. Aslam
  • , R. Schiffelers
  • , L. Lensink
  • , D. Hendriks
  • , L. Cleophas
  • , A. Serebrenik

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

© 2019 IEEE.Inferring behavioral models (e.g., state machines) of software systems is an important element of re-engineering activities. Model inference techniques can be categorized as active or passive learning, constructing models by (dynamically) interacting with systems or (statically) analyzing traces, respectively. Application of those techniques in the industry is, however, hindered by the trade-off between learning time and completeness achieved (active learning) or by incomplete input logs (passive learning). We investigate the learning time/completeness achieved trade-off of active learning with a pilot study at ASML, provider of lithography systems for the semiconductor industry. To resolve the trade-off we advocate extending active learning with execution logs and passive learning results.We apply the extended approach to eighteen components used in ASML TWINSCAN lithography machines. Compared to traditional active learning, our approach significantly reduces the active learning time. Moreover, it is capable of learning the behavior missed by the traditional active learning approach.
Original languageEnglish
Title of host publicationSANER 2019 - Proceedings of the 2019 IEEE 26th International Conference on Software Analysis, Evolution, and Reengineering
EditorsE. Shihab, D. Lo, X. Wang
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages253-263
ISBN (Electronic)9781728105918
ISBN (Print)9781728105918
DOIs
Publication statusPublished - 15 Mar 2019
Externally publishedYes
Event26th IEEE International Conference on Software Analysis, Evolution, and Reengineering, SANER 2019 - Hangzhou, China
Duration: 24 Feb 201927 Feb 2019

Publication series

NameSANER 2019 - Proceedings of the 2019 IEEE 26th International Conference on Software Analysis, Evolution, and Reengineering

Conference

Conference26th IEEE International Conference on Software Analysis, Evolution, and Reengineering, SANER 2019
Country/TerritoryChina
CityHangzhou
Period24/02/1927/02/19

Funding

We consider several directions as future work. First, we plan to perform a more comprehensive study with the sequential equivalence oracle on a richer set of components (i.e., more components from different types of systems). This will allow us to investigate more variables and provide users a guideline to adapt our techniques in their own context. For example, it would be valuable to study what features of traces can complement the Wp-method better, which passive learning algorithms work best for our PL-based oracle, and in which scenario one oracle set-up performs better than others. Second, while in this study we apply the techniques to MDSE-based software, we plan to replicate our work on legacy software. Moreover, the current approach is limited to the class of systems where values of data parameters do not influence the behavior. Learning data-dependent behavior is still a challenge for model inference techniques in terms of scalability [28]. It might require us to open the black-box of the SUL (cf. Howar et al. [15]) and integrate static analysis techniques. Finally, one can integrate the distance-metric approach of Smetsers et al. [31], [35] with the use of logs advocated in the current work by employing recently proposed log-log and log-model comparison techniques of Gupta et al. [13] or Amar et al. [1]. ACKNOWLEDGEMENT This research was partially supported by The Dutch Ministry of Economic Affairs, ESI (part of TNO) and ASML Netherlands B.V., carried out as part of the TKI-project ‘Transposition’; and partially supported by Eindhoven University of Technology and ASML Netherlands B.V., carried out as part of the IMPULS II project.

Funders
Ministerie van Economische Zaken
Technische Universiteit Eindhoven
Erwin Schrödinger International Institute for Mathematics and Physics

    Fingerprint

    Dive into the research topics of 'Improving Model Inference in Industry by Combining Active and Passive Learning'. Together they form a unique fingerprint.

    Cite this