Notice: This page requires JavaScript to function properly.
Please enable JavaScript in your browser settings or update your browser.
Aprenda Challenge: Multi-Armed Bandits | Multi-Armed Bandit Problem
Introduction to Reinforcement Learning
course content

Conteúdo do Curso

Introduction to Reinforcement Learning

Introduction to Reinforcement Learning

1. RL Core Theory
2. Multi-Armed Bandit Problem
3. Dynamic Programming
4. Monte Carlo Methods
5. Temporal Difference Learning

book
Challenge: Multi-Armed Bandits

From this point onward, you will receive coding challenges in Jupyter Notebook format. If you know how to properly setup the environment for such notebooks, feel free to do so. However, we recommend you to use Google Colab, as it offers many advantages over plain Jupyter notebooks. Read this article to learn how Colab works.

How to Open the Notebook and Complete the Tasks?

To open the notebook and complete the tasks, you should follow these steps:

  1. Download the notebook: you can find the download button at the bottom of this page;
  2. Open Google Colab: open the website and login with your Google account;
  3. Open the notebook: use File > Upload notebook or File > Open notebook > Upload and choose the file you downloaded. If you want to continue working on your tasks after a break, use File > Open notebook > Recent and find your notebook in that window;
  4. Follow the instructions: the notebook will contain detailed instructions to the tasks. You should read them before proceeding to the tasks;
  5. Receive the key: after solving each task, you will receive a part of the key, necessary to complete the chapter. Enter these parts inside of a quiz on the bottom of the page after you finish the challenge to verify that you completed all tasks.

Challenge

In this challenge, you will compare different approaches to solving the multi-armed bandit problem. You will write your own implementations for algorithms you learned about, and see how they work both in stationary and in dynamic environments.

question-icon

Enter the parts of the key

1.
2.

3.

4.

5.
Tudo estava claro?

Como podemos melhorá-lo?

Obrigado pelo seu feedback!

Seção 2. Capítulo 6
Sentimos muito que algo saiu errado. O que aconteceu?
some-alt