Skip to main navigation Skip to search Skip to main content

Fear and hope emerge from anticipation in model-based reinforcement learning

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

Social agents and robots will require both learning and emotional capabilities to successfully enter society. This paper connects both challenges, by studying models of emotion generation in sequential decision-making agents. Previous work in this field has focussed on model-free reinforcement learning (RL). However, important emotions like hope and fear need anticipation, which requires a model and forward simulation. Taking inspiration from the psychological Belief-Desire Theory of Emotions (BDTE), our work specifies models of hope and fear based on best and worst forward traces. To efficiently estimate these traces, we integrate a well-known Monte Carlo Tree Search procedure (UCT) into a model based RL architecture. Test results in three known RL domains illustrate emotion dynamics, dependencies on policy and environmental stochasticity, and plausibility in individual Pacman game settings. Our models enable agents to naturally elicit hope and fear during learning, and moreover, explain what anticipated event caused this.
Original languageEnglish
Title of host publicationProceedings of the 25th International Joint Conference on Artificial Intelligence, IJCAI 2016
EditorsS. Kambhampati
PublisherInternational Joint Conferences on Artificial Intelligence
Pages848-854
Publication statusPublished - 2016
Externally publishedYes
Event25th International Joint Conference on Artificial Intelligence, IJCAI 2016 - New York, United States
Duration: 9 Jul 201615 Jul 2016

Publication series

NameIJCAI International Joint Conference on Artificial Intelligence
ISSN (Print)1045-0823

Conference

Conference25th International Joint Conference on Artificial Intelligence, IJCAI 2016
Country/TerritoryUnited States
CityNew York
Period9/07/1615/07/16

Fingerprint

Dive into the research topics of 'Fear and hope emerge from anticipation in model-based reinforcement learning'. Together they form a unique fingerprint.

Cite this