Course Description

Sequential decision-making is one of the major topics in machine learning. From experience, the task is to decide the sequence of actions to perform in an uncertain environment in order to achieve some goals that may not necessarily seem beneficial in near future but are optimal for getting better long term reward. Reinforcement learning (RL) is a paradigm that proposes a formal framework to this problem. The aim of the course will be to familiarize the students with the basic concepts as well as with the state-of-the-art research literature in deep reinforcement learning. After completion the students will be able to (a) structure a reinforcement learning problem, (b) understand and apply basic RL algorithms for simple sequential decision making problems in uncertain conditions. (c) evaluate the performance of the solution (d) interpret state-of-the-art RL research and communicate their results.

Instructor

Class Time and Location

Thursday (3:00pm-4:55pm) and Friday (3:00pm-3:55pm)
Class Room 1, Takshashila Building, Second Floor

Office Hours and Email

Abir Das (abir@cse.iitkgp.ac.in)
Buridi Aditya (buridiaditya@gmail.com)
Vishal Gupta (ervishal@iitkgp.ac.in )

Office: Takshashila Building (Second Floor)
Hours: Friday 4:15pm-5:15pm (Just after class)

Have questions?

We welcome your questions about the course including lectures, assignments, projects, and logistics on Piazza. Email the TA or instructor about questions that specifically pertain to you as an individual.