Introduction to Reinforcement Learning
Kurs
90 Studenter redan inskrivna- How agents learn through trial and error using rewards and feedback.
- How to model environments with Markov decision processes and solve basic decision problems.
- The role of exploration in learning, through the lens of multi-armed bandits.
- Different learning strategies: dynamic programming, Monte Carlo methods, and temporal difference learning.
Utrusta ditt företag med banbrytande Data och AI kompetens.
Dela det på sociala medier och vid din prestationsutvärdering
Det finns 5 moduler i denna kurs
Reinforcement Learning (RL) is a powerful branch of machine learning focused on training intelligent agents through interaction with their environment. In this course, you'll learn how agents gradually discover effective behaviors through trial and error. Beginning with core concepts like Markov decision processes and multi-armed bandits, you'll work your way through dynamic programming, Monte Carlo methods, and temporal difference learning.- What is RL?Förhandsgranska
- RL vs Other Learning ParadigmsFörhandsgranska
- Markov Decision ProcessFörhandsgranska
- Episodes and ReturnsFörhandsgranska
- Model, Policy, and ValuesFörhandsgranska
- Exploration vs ExploitationFörhandsgranska
- Gymnasium BasicsFörhandsgranska
- Challenge: Setting Up an EnvironmentFörhandsgranska
- What is Dynamic Programming?Förhandsgranska
- Bellman EquationsFörhandsgranska
- Optimality ConditionsFörhandsgranska
- Policy EvaluationFörhandsgranska
- Policy ImprovementFörhandsgranska
- Generalized Policy IterationFörhandsgranska
- Policy IterationFörhandsgranska
- Value IterationFörhandsgranska
- Challenge: Dynamic ProgrammingFörhandsgranska
- What are Monte Carlo Methods?Förhandsgranska
- Value Function EstimationFörhandsgranska
- Monte Carlo ControlFörhandsgranska
- Exploration ApproachesFörhandsgranska
- On-Policy Monte Carlo ControlFörhandsgranska
- Off-Policy Monte Carlo ControlFörhandsgranska
- Incremental ImplementationsFörhandsgranska
- Challenge: Monte Carlo MethodsFörhandsgranska
Valt av studenter från de allra bästa skolorna
Varför folk väljer Codefinity för sin karriär

Kwizera Mugisha
The teaching methodology at Codefinity is excellent, and I particularly appreciate how it has prepared me to handle real-world coding problems. Currently, I am delving into Node.js and eagerly anticipate building full-stack projects that integrate all the knowledge I have gained.

Sherry Barnes-Fox
My first course was 4 hours, I did it in a few days, "nugget-style. The instructions are very clear and easy to understand. There is even a hint to help you get the answer, and if you still cannot get the answer, then you can display the answer. I love the learning style that is used, it engages me.

Bill Wagner
I have really liked the browser-based lessons that allow me to code within the lesson. The RUN button allows me to test the code I write before submitting for a grade.

Stephanie Chan
As I went through the first course of the Python track, I liked the way the course was lay out (in easy and digestible modules) with little exercises at the end of each concept.

Daniel Chinea
I have gained a lot of practical and logical thinking skills, along with patience for myself and confidence in myself that I can learn programming.

Steve Bruening
The learning was progressive and made it easy to follow along and make progress. I could feel my skills increasing and building on each other as the course went along.
Rekommenderas om du är intresserad av att lära digPython
Omfamna fascinationen för tekniska färdigheter! Vår AI-assistent ger feedback i realtid, personliga tips och felanalyser, vilket gör att du kan lära dig med självförtroende.
Med Arbetsytor kan du skapa och dela projekt direkt på vår plattform. Vi har förberett mallar för din bekvämlighet
Ta kontroll över din karriärutveckling och inled din resa mot att bemästra de senaste teknologierna
Verkliga projekt lyfter din portfölj och visar praktiska färdigheter för att imponera på potentiella arbetsgivare




Full tillgång till katalogen
Ett abonnemang öppnar denna kurs och hela vår katalog av projekt och färdigheter.Ditt abonnemang inkluderar även:
Vanliga frågor
Har du fler frågor?
Formulera din fråga här