Abstract
Enabling reinforcement learning (RL) agents to leverage a knowledge base while learning from experience promises to advance RL in knowledge intensive domains. However, it has proven difficult to leverage knowledge that is not manually tailored to the environment. We propose to use the subclass relationships present in open-source knowledge graphs to abstract away from specific objects. We develop a residual policy gradient method that is able to integrate knowledge across different abstraction levels in the class hierarchy. Our method results in improved sample efficiency and generalisation to unseen objects in commonsense games, but we also investigate failure modes, such as excessive noise in the extracted class knowledge or environments with little class structure.
Original language | English |
---|---|
Title of host publication | Proceedings of the 31st International Joint Conference on Artificial Intelligence (IJCAI 2022) |
Editors | Luc De Raedt |
Publisher | International Joint Conferences on Artificial Intelligence Organization |
Pages | 3050-3056 |
Number of pages | 7 |
ISBN (Electronic) | 9781956792003 |
DOIs | |
Publication status | Published - 2022 |
Event | 31st International Joint Conference on Artificial Intelligence, IJCAI 2022 - Vienna, Austria Duration: 23 Jul 2022 → 29 Jul 2022 |
Publication series
Name | IJCAI International Joint Conference on Artificial Intelligence |
---|---|
ISSN (Print) | 1045-0823 |
Conference
Conference | 31st International Joint Conference on Artificial Intelligence, IJCAI 2022 |
---|---|
Country/Territory | Austria |
City | Vienna |
Period | 23/07/22 → 29/07/22 |
Bibliographical note
Funding Information:This research was (partially) funded by the Hybrid Intelligence Center, a 10-year programme funded by the Dutch Ministry of Education, Culture and Science through the Netherlands Organisation for Scientific Research, https://hybrid-intelligence-centre.nl.
Publisher Copyright:
© 2022 International Joint Conferences on Artificial Intelligence. All rights reserved.
Funding
This research was (partially) funded by the Hybrid Intelligence Center, a 10-year programme funded by the Dutch Ministry of Education, Culture and Science through the Netherlands Organisation for Scientific Research, https://hybrid-intelligence-centre.nl.