Introduction to Reinforcement Learning
Kursus
90 Allerede tilmeldte studerende- How agents learn through trial and error using rewards and feedback.
- How to model environments with Markov decision processes and solve basic decision problems.
- The role of exploration in learning, through the lens of multi-armed bandits.
- Different learning strategies: dynamic programming, Monte Carlo methods, and temporal difference learning.
Udstyr din virksomhed med banebrydende teknologi Data og KI erfaring.
Del det på sociale medier og i din præstationsvurdering
Der er 5 moduler i dette kursus
Reinforcement Learning (RL) is a powerful branch of machine learning focused on training intelligent agents through interaction with their environment. In this course, you'll learn how agents gradually discover effective behaviors through trial and error. Beginning with core concepts like Markov decision processes and multi-armed bandits, you'll work your way through dynamic programming, Monte Carlo methods, and temporal difference learning.- What is RL?Forhåndsvisning
- RL vs Other Learning ParadigmsForhåndsvisning
- Markov Decision ProcessForhåndsvisning
- Episodes and ReturnsForhåndsvisning
- Model, Policy, and ValuesForhåndsvisning
- Exploration vs ExploitationForhåndsvisning
- Gymnasium BasicsForhåndsvisning
- Challenge: Setting Up an EnvironmentForhåndsvisning
- What is Dynamic Programming?Forhåndsvisning
- Bellman EquationsForhåndsvisning
- Optimality ConditionsForhåndsvisning
- Policy EvaluationForhåndsvisning
- Policy ImprovementForhåndsvisning
- Generalized Policy IterationForhåndsvisning
- Policy IterationForhåndsvisning
- Value IterationForhåndsvisning
- Challenge: Dynamic ProgrammingForhåndsvisning
- What are Monte Carlo Methods?Forhåndsvisning
- Value Function EstimationForhåndsvisning
- Monte Carlo ControlForhåndsvisning
- Exploration ApproachesForhåndsvisning
- On-Policy Monte Carlo ControlForhåndsvisning
- Off-Policy Monte Carlo ControlForhåndsvisning
- Incremental ImplementationsForhåndsvisning
- Challenge: Monte Carlo MethodsForhåndsvisning
Udvalgt af studerende fra de allerbedste skoler
Hvorfor folk vælger Codefinity til deres karriere

Kwizera Mugisha
The teaching methodology at Codefinity is excellent, and I particularly appreciate how it has prepared me to handle real-world coding problems. Currently, I am delving into Node.js and eagerly anticipate building full-stack projects that integrate all the knowledge I have gained.

Sherry Barnes-Fox
My first course was 4 hours, I did it in a few days, "nugget-style. The instructions are very clear and easy to understand. There is even a hint to help you get the answer, and if you still cannot get the answer, then you can display the answer. I love the learning style that is used, it engages me.

Bill Wagner
I have really liked the browser-based lessons that allow me to code within the lesson. The RUN button allows me to test the code I write before submitting for a grade.

Stephanie Chan
As I went through the first course of the Python track, I liked the way the course was lay out (in easy and digestible modules) with little exercises at the end of each concept.

Daniel Chinea
I have gained a lot of practical and logical thinking skills, along with patience for myself and confidence in myself that I can learn programming.

Steve Bruening
The learning was progressive and made it easy to follow along and make progress. I could feel my skills increasing and building on each other as the course went along.
Anbefalet, hvis du er interesseret i at lærePython
Omfavn fascinationen for teknologiske færdigheder! Vores KI-assistent giver feedback i realtid, personlige hints og fejlforklaringer, så du trygt kan lære.
Med arbejdsområder kan du oprette og dele projekter direkte på vores platform. Vi har forberedt skabeloner til din bekvemmelighed
Tag kontrol over din karriereudvikling og start din rejse mod at mestre den nyeste teknologi
Virkelige projekter løfter din portefølje og viser praktiske færdigheder, der imponerer potentielle arbejdsgivere




Fuld adgang til kataloget
Et abonnement åbner dette kursus og hele vores katalog af projekter og færdigheder.Dit abonnement inkluderer også:
Ofte stillede spørgsmål
Har du stadig spørgsmål?
Skriv dit spørgsmål her