Transfer Learning and Curriculum Learning in Sokoban

Zhao Yang, Mike Preuss, Aske Plaat

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

Transfer learning can speed up training in machine learning, and is regularly used in classification tasks. It reuses prior knowledge from other tasks to pre-train networks for new tasks. In reinforcement learning, learning actions for a behavior policy that can be applied to new environments is still a challenge, especially for tasks that involve much planning. Sokoban is a challenging puzzle game. It has been used widely as a benchmark in planning-based reinforcement learning. In this paper, we show how prior knowledge improves learning in Sokoban tasks. We find that reusing feature representations learned previously can accelerate learning new, more complex, instances. In effect, we show how curriculum learning, from simple to complex tasks, works in Sokoban. Furthermore, feature representations learned in simpler instances are more general, and thus lead to positive transfers towards more complex tasks, but not vice versa. We have also studied which part of the knowledge is most important for transfer to succeed, and identify which layers should be used for pre-training (Codes we used for this work can be found at https://github.com/yangzhao-666/TLCLS ).
Original languageEnglish
Title of host publicationArtificial Intelligence and Machine Learning - 33rd Benelux Conference on Artificial Intelligence, BNAIC/Benelearn 2021, Revised Selected Papers
EditorsL.A. Leiva, C. Pruski, R. Markovich, A. Najjar, C. Schommer
PublisherSpringer Science and Business Media Deutschland GmbH
Pages187-200
ISBN (Print)9783030938413
DOIs
Publication statusPublished - 2022
Externally publishedYes
Event33rd Benelux Conference on Artificial Intelligence, BNAIC/ BENELEARN 2021 - Esch-sur-Alzette, Luxembourg
Duration: 10 Nov 202112 Nov 2021

Publication series

NameCommunications in Computer and Information Science
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference33rd Benelux Conference on Artificial Intelligence, BNAIC/ BENELEARN 2021
Country/TerritoryLuxembourg
CityEsch-sur-Alzette
Period10/11/2112/11/21

Funding

The financial support to Zhao Yang is from the China Scholarship Council (CSC). Computation support is from ALICE and DSLab. The authors thank Hui Wang, Matthias Müller-Brockhausen, Michiel van der Meer, Thomas Moer-land and all members from the Leiden Reinforcement Learning Group for helpful discussions. Acknowledgement. The financial support to Zhao Yang is from the China Scholarship Council (CSC). Computation support is from ALICE and DSLab. The authors thank Hui Wang, Matthias Müller-Brockhausen, Michiel van der Meer, Thomas Moer-land and all members from the Leiden Reinforcement Learning Group for helpful discussions.

FundersFunder number
DSLab
Leiden Reinforcement Learning Group
China Scholarship Council

    Fingerprint

    Dive into the research topics of 'Transfer Learning and Curriculum Learning in Sokoban'. Together they form a unique fingerprint.

    Cite this