Reinforcement Learning🔗

Reinforcement Learning section of the Algorithms in Machine Learning class at ISAE-Supaero

Syllabus🔗

This class covers an introduction to Reinforcement Learning (RL) in 18 hours, over 6 sessions. It aims to provide both a solid theoretical foundation and a quick learning curve towards current Deep RL algorithms. It starts with the fundamental notions underlying RL: Markov Decision Processes, model-based resolution approaches including Dynamic Programming, sample-based resolution of the Bellman equation. This leads to the identification of the three bottomline challenges in RL: function approximation, the exploration/exploitation trade-off and the search for optimality. This provides perspective to the following classes that introduce methods designed to tackle these challenges, including Deep RL methods. By the end of the class, students should be able to understand the literature on RL, implement key algorithms, and anticipate the difficulties of applying RL to various problems.

Class material🔗

The class is split into a series of notebooks that serve as lecture material, textbook and exercice book.

Open the notebooks in Binder:

Additional resources🔗

Great books available online:
Reinforcement Learning, an introduction
Algorithms for Reinforcement Learning
An introduction to Deep Reinforcement Learning

FAQ on installing Gym for Mac users

Class schedule🔗

Schedule
MDPs and their resolution	08h30 - 11h45	03/02/2021	RL intuitions, Markov Decision Processes, Dynamic Programming
Sample-based policy search	08h30 - 11h45	09/02/2021	Formulations of RL algorithms, Temporal Differences, Q-learning, the 3 bottlenecks of RL
Value function approximation	13h00 - 16h15	09/02/2021	Linear approximations, Deep Q-Networks
Policy gradients	09h00 - 12h15	15/02/2021	PG and Deep PG methods
MCTS	09h00 - 12h15	17/02/2021	Monte Carlo Tree Search
open	13h45 - 17h00	17/02/2021	open session on an RL challenge