Outlier Detection with Reinforcement Learning for Costly to Verify Data

Michiel Nijhuis*, Iman van Lelyveld

*Corresponding author for this work

Research output: Contribution to JournalArticleAcademicpeer-review

Abstract

Outliers are often present in data and many algorithms exist to find these outliers. Often we can verify these outliers to determine whether they are data errors or not. Unfortunately, checking such points is time-consuming and the underlying issues leading to the data error can change over time. An outlier detection approach should therefore be able to optimally use the knowledge gained from the verification of the ground truth and adjust accordingly. With advances in machine learning, this can be achieved by applying reinforcement learning on a statistical outlier detection approach. The approach uses an ensemble of proven outlier detection methods in combination with a reinforcement learning approach to tune the coefficients of the ensemble with every additional bit of data. The performance and the applicability of the reinforcement learning outlier detection approach are illustrated using granular data reported by Dutch insurers and pension funds under the Solvency II and FTK frameworks. The application shows that outliers can be identified by the ensemble learner. Moreover, applying the reinforcement learner on top of the ensemble model can further improve the results by optimising the coefficients of the ensemble learner.

Original languageEnglish
Article number842
Pages (from-to)1-17
Number of pages17
JournalEntropy
Volume25
Issue number6
DOIs
Publication statusPublished - Jun 2023

Bibliographical note

Publisher Copyright:
© 2023 by the authors.

Keywords

  • algorithm design
  • anomaly detection
  • outlier detection
  • outlier ensembles
  • reinforcement learning

Fingerprint

Dive into the research topics of 'Outlier Detection with Reinforcement Learning for Costly to Verify Data'. Together they form a unique fingerprint.

Cite this