Skip to main navigation Skip to search Skip to main content

First Go, then Post-Explore: The Benefits of Post-Exploration in Intrinsic Motivation

  • Zhao Yang
  • , Thomas M. Moerland
  • , Mike Preuss
  • , Aske Plaat

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

Go-Explore achieved breakthrough performance on challenging reinforcement learning (RL) tasks with sparse rewards. The key insight of Go-Explore was that successful exploration requires an agent to first return to an interesting state (‘Go’), and only then explore into unknown terrain (‘Explore’). We refer to such exploration after a goal is reached as ‘post-exploration’. In this paper, we present a clear ablation study of post-exploration in a general intrinsically motivated goal exploration process (IMGEP) framework, that the Go-Explore paper did not show. We study the isolated potential of post-exploration, by turning it on and off within the same algorithm under both tabular and deep RL settings on both discrete navigation and continuous control tasks. Experiments on a range of MiniGrid and Mujoco environments show that post-exploration indeed helps IMGEP agents reach more diverse states and boosts their performance. In short, our work suggests that RL researchers should consider using post-exploration in IMGEP when possible since it is effective, method-agnostic, and easy to implement.
Original languageEnglish
Title of host publicationICAART 2023 - Proceedings of the 15th International Conference on Agents and Artificial Intelligence
Subtitle of host publicationVolume 2
EditorsA. Rocha, L. Steels, J. van den Herik
PublisherSciTePress
Pages27-34
Number of pages8
Volume2
ISBN (Print)9789897586231
DOIs
Publication statusPublished - 2023
Externally publishedYes
Event15th International Conference on Agents and Artificial Intelligence, ICAART 2023 - Lisbon, Portugal
Duration: 22 Feb 202324 Feb 2023

Publication series

NameInternational Conference on Agents and Artificial Intelligence
ISSN (Print)2184-3589
ISSN (Electronic)2184-433X

Conference

Conference15th International Conference on Agents and Artificial Intelligence, ICAART 2023
Country/TerritoryPortugal
CityLisbon
Period22/02/2324/02/23

Fingerprint

Dive into the research topics of 'First Go, then Post-Explore: The Benefits of Post-Exploration in Intrinsic Motivation'. Together they form a unique fingerprint.

Cite this