• Your selection is empty.

    Register the diplomas, courses or lessons of your choice.

Reinforcement learning

  • School / Prep

    ENSEIRB-MATMECA

Internal code

EI9IS320

Description

This course is an introduction to a branch of machine learning called reinforcement learning (RL). In this course, we'll look at the main models used in RL: multi-armed bandits, Markov decision processes, and their multi-agent and partial observation extensions, both in the dynamic framework and in the function approximation framework (by neural networks in particular). We will study the most important algorithms: value iteration, strategy iteration, Q-learning, DQN (Deep Q-learning). They will be implemented in Python.

Read more

Teaching hours

  • CIIntegrated Courses21h

Syllabus

This course is an introduction to a branch of machine learning called reinforcement learning (RL). In this course, we'll look at the main models used in RL: multi-armed bandits, Markov decision processes, and their multi-agent and partial observation extensions, both in the dynamic framework and in the function approximation framework (by neural networks in particular). We will study the most important algorithms: value iteration, strategy iteration, Q-learning, DQN (Deep Q-learning). They will be implemented in Python.

Read more

Assessment of knowledge

Initial assessment / Main session - Tests

Type of assessmentType of testDuration (in minutes)Number of testsTest coefficientEliminatory mark in the testRemarks
Integral Continuous ControlContinuous control1