fbpx
4.39 de 5
4.39
529 reseñas sobre Udemy

Reinforcement Learning beginner to master – AI in Python

Build Artificial Intelligence (AI) agents using Deep Reinforcement Learning and PyTorch: A2C, REINFORCE, DQN, etc.
Instructor:
Escape Velocity Labs
4.153 estudiantes matriculados
English [Auto]
Understand the Reinforcement Learning paradigm and the tasks that it's best suited to solve.
Understand the process of solving a cognitive task using Reinforcement Learning
Understand the different approaches to solving a task using Reinforcement Learning and choose the most fitting
Implement Reinforcement Learning algorithms completely from scratch
Fundamentally understand the learning process for each algorithm
Debug and extend the algorithms presented
Understand and implement new algorithms from research papers

This is the most complete Reinforcement Learning course on Udemy. In it you will learn the basics of Reinforcement Learning, one of the three paradigms of modern artificial intelligence. You will implement from scratch adaptive algorithms that solve control tasks based on experience. You will also learn to combine these algorithms with Deep Learning techniques and neural networks, giving rise to the branch known as Deep Reinforcement Learning.

This course will give you the foundation you need to be able to understand new algorithms as they emerge. It will also prepare you for the next courses in this series, in which we will go much deeper into different branches of Reinforcement Learning and look at some of the more advanced algorithms that exist.

The course is focused on developing practical skills. Therefore, after learning the most important concepts of each family of methods, we will implement one or more of their algorithms in jupyter notebooks, from scratch.

This course is divided into three parts and covers the following topics:

Part 1 (Tabular methods):

– Markov decision process

– Dynamic programming

– Monte Carlo methods

– Time difference methods (SARSA, Q-Learning)

– N-step bootstrapping

Part 2 (Continuous state spaces):

– State aggregation

– Tile Coding

Part 3 (Deep Reinforcement Learning):

– Deep SARSA

– Deep Q-Learning

– REINFORCE

– Advantage Actor-Critic / A2C (Advantage Actor-Critic / A2C method)

Welcome module

1
[IMPORTANT] English captions available for sections 1-4
2
Welcome

Advanced Reinforcement Learning in Python: from DQN to SAC

https://www.udemy.com/course/advanced-reinforcement/?referralCode=2C96ADF61C80DD7FD392


Advanced Reinforcement Learning in Python: cutting-edge DQNs

https://www.udemy.com/course/advanced-deep-qnetworks/?referralCode=7430E30376CCFEB8BEE9

3
Reinforcement Learning series
4
Course structure
5
Environment setup [Important]
6
Setup

Link to the code repository:

https://github.com/escape-velocity-labs/beginner_master_rl

7
Complete code

The Markov decision process (MDP)

1
Elements common to all control tasks
2
The Markov decision process (MDP)
3
Types of Markov decision process
4
Trajectory vs episode
5
Reward vs Return
6
Discount factor
7
Policy
8
State values v(s) and action values q(s,a)
9
Bellman equations
10
Solving a Markov decision process
11
Setup - MDP in code
12
MDP in code - Part 1
13
MDP in code - Part 2

Dynamic Programming

1
Introduction to Dynamic Programming
2
Value iteration
3
Setup - Value iteration
4
Coding - Value iteration 1
5
Coding - Value iteration 2
6
Coding - Value iteration 3
7
Coding - Value iteration 4
8
Coding - Value iteration 5
9
Policy iteration
10
Setup - Policy iteration
11
Coding - Policy iteration 1
12
Policy evaluation
13
Coding - Policy iteration 2
14
Policy Improvement
15
Coding - Policy iteration 3
16
Coding - Policy iteration 4
17
Policy iteration in practice
18
Generalized Policy Iteration (GPI)

Monte Carlo methods

1
Monte Carlo methods
2
Solving control tasks with Monte Carlo methods
3
On-policy Monte Carlo control
4
Setup - On-policy Monte Carlo control
5
Coding - On-policy Monte Carlo control 1
6
Coding - On-policy Monte Carlo control 2
7
Coding - On-policy Monte Carlo control 3
8
Setup - Constant alpha Monte Carlo
9
Coding - Constant alpha Monte Carlo
10
Off-policy Monte Carlo control
11
Setup - Off-policy Monte Carlo control
12
Coding - Off-policy Monte Carlo 1
13
Coding - Off-policy Monte Carlo 2
14
Coding - Off-policy Monte Carlo 3

Temporal difference methods

1
Temporal difference methods
2
Solving control tasks with temporal difference methods
3
Monte Carlo vs temporal difference methods
4
SARSA
5
Setup - SARSA
6
Coding - SARSA 1
7
Coding - SARSA 2
8
Q-Learning
9
Setup - Q-Learning
10
Coding - Q-Learning 1
11
Coding - Q-Learning 2
12
Advantages of temporal difference methods

N-step bootstrapping

1
N-step temporal difference methods
2
Where do n-step methods fit?
3
Effect of changing n
4
N-step SARSA
5
N-step SARSA in action
6
Setup - n-step SARSA
7
Coding - n-step SARSA

Continuous state spaces

1
Setup - Classic control tasks
2
Coding - Classic control tasks
3
Working with continuous state spaces
4
State aggregation
5
Setup - Continuous state spaces
6
Coding - State aggregation 1
7
Coding - State aggregation 2
8
Coding - State aggregation 3
9
Tile coding
10
Coding - Tile coding 1
11
Coding - Tile coding 2
12
Coding - Tile coding 3

Brief introduction to neural networks

1
Function approximators
2
Artificial Neural Networks
3
Artificial Neurons
4
How to represent a Neural Network
5
Stochastic Gradient Descent
6
Neural Network optimization

Deep SARSA

1
Deep SARSA
2
Neural Network optimization (Deep Q-Network)
4.4
4.4 de 5
Calificaciones 529

Calificación Detallada

5 estrellas
296
4 estrellas
162
3 estrellas
50
2 estrellas
6
1 estrellas
15
f8fedb16fddc14fed5818c122b002c5b
Garantía de devolución de dinero de 30 días

Incluye

11 horas de video a pedido
20 artículos
Acceso completo de por vida
Acceso en el móvil y en la televisión
Certificado de finalización
Reinforcement Learning beginner to master – AI in Python
Precio:
$69.99 $15
bubble_bg_popup.png

Descarga las Herramientas Gratis