TY - GEN
T1 - Fear and hope emerge from anticipation in model-based reinforcement learning
AU - Moerland, Thomas
AU - Broekens, Joost
AU - Jonker, Catholijn
PY - 2016
Y1 - 2016
N2 - Social agents and robots will require both learning and emotional capabilities to successfully enter society. This paper connects both challenges, by studying models of emotion generation in sequential decision-making agents. Previous work in this field has focussed on model-free reinforcement learning (RL). However, important emotions like hope and fear need anticipation, which requires a model and forward simulation. Taking inspiration from the psychological Belief-Desire Theory of Emotions (BDTE), our work specifies models of hope and fear based on best and worst forward traces. To efficiently estimate these traces, we integrate a well-known Monte Carlo Tree Search procedure (UCT) into a model based RL architecture. Test results in three known RL domains illustrate emotion dynamics, dependencies on policy and environmental stochasticity, and plausibility in individual Pacman game settings. Our models enable agents to naturally elicit hope and fear during learning, and moreover, explain what anticipated event caused this.
AB - Social agents and robots will require both learning and emotional capabilities to successfully enter society. This paper connects both challenges, by studying models of emotion generation in sequential decision-making agents. Previous work in this field has focussed on model-free reinforcement learning (RL). However, important emotions like hope and fear need anticipation, which requires a model and forward simulation. Taking inspiration from the psychological Belief-Desire Theory of Emotions (BDTE), our work specifies models of hope and fear based on best and worst forward traces. To efficiently estimate these traces, we integrate a well-known Monte Carlo Tree Search procedure (UCT) into a model based RL architecture. Test results in three known RL domains illustrate emotion dynamics, dependencies on policy and environmental stochasticity, and plausibility in individual Pacman game settings. Our models enable agents to naturally elicit hope and fear during learning, and moreover, explain what anticipated event caused this.
UR - https://www.scopus.com/pages/publications/85006164606
UR - https://www.scopus.com/pages/publications/85006164606#tab=citedBy
M3 - Conference contribution
T3 - IJCAI International Joint Conference on Artificial Intelligence
SP - 848
EP - 854
BT - Proceedings of the 25th International Joint Conference on Artificial Intelligence, IJCAI 2016
A2 - Kambhampati, S.
PB - International Joint Conferences on Artificial Intelligence
T2 - 25th International Joint Conference on Artificial Intelligence, IJCAI 2016
Y2 - 9 July 2016 through 15 July 2016
ER -