https://studiegids.vu.nl/en/courses/2025-2026/XM_0093Dynamic Programming (DP) and Reinforcement Learning (RL) are fields concerned with decision making over time. After completing this course, the studentis familiar with the commonly used algorithms for solving dynamic optimization problems; understands the main features of these algorithms, their strengths and weaknesses including their convergence properties; can implement them in an appropriate language; can model real-world decision problems into a DP or RL framework and solve moderately sized problems; has knowledge of the historical development of DP and RL and has an idea of possible future developments.This course is concerned with reinforcement learning and its origin dynamic programming. These are fields dealing with goal-directed decision making over time, such as finding your way in an unknown area, playing a game or pricing airline tickets. We look at these areas from different angles:we deal with full-information "planning" problems, but also with partial-information "learning" problems we consider different algorithms, some of which are guaranteed to find the best solution, but also heuristics we consider high-dimensional problems (such as games) and methods to solve them we look at small toy problems to understand algorithms and sharpen our intuition, but also bigger problems for which we learn how to implement algorithms (in python) we look at different types of applications, both from AI (search problems, games) and ORLectures and practical work integratedProgramming exercises and final exam. The 3 assignments each count for 10% and the exam for 70%. The minimal passing grade for the exam is 5.0.Slides and lecture notesmBA, mAI, mCS, mBa-D, mMathProgramming experience in python