Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Learn Challenge: Dynamic Programming | Dynamic Programming
Introduction to Reinforcement Learning
course content

Course Content

Introduction to Reinforcement Learning

Introduction to Reinforcement Learning

1. RL Core Theory
2. Multi-Armed Bandit Problem
3. Dynamic Programming
4. Monte Carlo Methods
5. Temporal Difference Learning

book
Challenge: Dynamic Programming

Challenge

In this challenge, you will write your own implementations of policy iteration and value iteration, visually observe the agent's behavior, and analyze value function and policy plots.

question-icon

Enter the parts of the key

1.
2.

3.
Everything was clear?

How can we improve it?

Thanks for your feedback!

SectionΒ 3. ChapterΒ 9
We're sorry to hear that something went wrong. What happened?
some-alt