Modules / Lectures


New Assignments
Module NameDownload
Week_01_Assignment_01Week_01_Assignment_01
Week_02_Assignment_02Week_02_Assignment_02
Week_03_Assignment_03Week_03_Assignment_03
Week_04_Assignment_04Week_04_Assignment_04
Week_05_Assignment_05Week_05_Assignment_05
Week_06_Assignment_06Week_06_Assignment_06
Week_07_Assignment_07Week_07_Assignment_07
Week_08_Assignment_08Week_08_Assignment_08
Week_09_Assignment_09Week_09_Assignment_09
Week_10_Assignment_10Week_10_Assignment_10
Week_11_Assignment_11Week_11_Assignment_11
Week_12_Assignment_12Week_12_Assignment_12


Sl.No Chapter Name MP4 Download
1Tutorial 1 - Probability Basics 1Download
2Tutorial 1-Probability basics2Download
3Tutorial 2-Linear algebra-1Download
4Tutorial 2-Linear algebra-2Download
5Introduction to RLDownload
6RL Framework and applicationsDownload
7Introduction to Immediate RLDownload
8Bandit OptimalitiesDownload
9Value function based methodsDownload
10UCB 1Download
11Concentration BoundsDownload
12UCB 1 TheoremDownload
13PAC BoundsDownload
14Median EliminationDownload
15Thompson SamplingDownload
16Policy SearchDownload
17REINFORCEDownload
18Contextual BanditsDownload
19Full RL IntroductionDownload
20Returns, Value Functions and MDPsDownload
21MDP ModellingDownload
22Bellman EquationDownload
23Bellman Optimality EquationDownload
24Cauchy Sequence and Green's EquationDownload
25Banach Fixed Point TheoremDownload
26Convergence ProofDownload
27Lpi ConvergenceDownload
28Value IterationDownload
29Policy IterationDownload
30Dynamic ProgrammingDownload
31Monte CarloDownload
32Control in Monte CarloDownload
33Off Policy MCDownload
34UCTDownload
35TD(0)Download
36TD(0) ControlDownload
37Q-LearningDownload
38AfterstateDownload
39Eligibility TracesDownload
40Backward View of Eligibility TracesDownload
41Eligibility Trace ControlDownload
42Thompson Sampling RecapDownload
43Function ApproximationDownload
44Linear ParameterizationDownload
45State Aggregation MethodsDownload
46Function Approximation and Eligibility TracesDownload
47LSTD and LSTDQDownload
48LSPI and Fitted QDownload
49DQN and Fitted Q-IterationDownload
50Policy Gradient ApproachDownload
51Actor Critic and REINFORCEDownload
52REINFORCE (cont'd)Download
53Policy Gradient with Function ApproximationDownload
54Hierarchical Reinforcement LearningDownload
55Types of OptimalityDownload
56Semi Markov Decision ProcessesDownload
57OptionsDownload
58Learning with OptionsDownload
59Hierarchical Abstract MachinesDownload
60MAXQDownload
61MAXQ Value Function DecompositionDownload
62Option DiscoveryDownload
63POMDP IntroductionDownload
64Solving POMDPDownload
65Live SessionDownload

Sl.No Chapter Name English
1Tutorial 1 - Probability Basics 1Download
To be verified
2Tutorial 1-Probability basics2Download
To be verified
3Tutorial 2-Linear algebra-1Download
To be verified
4Tutorial 2-Linear algebra-2Download
To be verified
5Introduction to RLDownload
To be verified
6RL Framework and applicationsDownload
To be verified
7Introduction to Immediate RLDownload
To be verified
8Bandit OptimalitiesDownload
To be verified
9Value function based methodsDownload
To be verified
10UCB 1Download
To be verified
11Concentration BoundsDownload
To be verified
12UCB 1 TheoremDownload
To be verified
13PAC BoundsDownload
To be verified
14Median EliminationDownload
To be verified
15Thompson SamplingDownload
To be verified
16Policy SearchDownload
To be verified
17REINFORCEDownload
To be verified
18Contextual BanditsDownload
To be verified
19Full RL IntroductionDownload
To be verified
20Returns, Value Functions and MDPsDownload
To be verified
21MDP ModellingDownload
To be verified
22Bellman EquationDownload
To be verified
23Bellman Optimality EquationDownload
To be verified
24Cauchy Sequence and Green's EquationDownload
To be verified
25Banach Fixed Point TheoremDownload
To be verified
26Convergence ProofDownload
To be verified
27Lpi ConvergenceDownload
To be verified
28Value IterationDownload
To be verified
29Policy IterationDownload
To be verified
30Dynamic ProgrammingDownload
To be verified
31Monte CarloDownload
To be verified
32Control in Monte CarloDownload
To be verified
33Off Policy MCDownload
To be verified
34UCTDownload
To be verified
35TD(0)Download
To be verified
36TD(0) ControlDownload
To be verified
37Q-LearningDownload
To be verified
38AfterstateDownload
To be verified
39Eligibility TracesDownload
To be verified
40Backward View of Eligibility TracesDownload
To be verified
41Eligibility Trace ControlDownload
To be verified
42Thompson Sampling RecapDownload
To be verified
43Function ApproximationDownload
To be verified
44Linear ParameterizationDownload
To be verified
45State Aggregation MethodsDownload
To be verified
46Function Approximation and Eligibility TracesDownload
To be verified
47LSTD and LSTDQDownload
To be verified
48LSPI and Fitted QDownload
To be verified
49DQN and Fitted Q-IterationDownload
To be verified
50Policy Gradient ApproachDownload
To be verified
51Actor Critic and REINFORCEDownload
To be verified
52REINFORCE (cont'd)Download
To be verified
53Policy Gradient with Function ApproximationDownload
To be verified
54Hierarchical Reinforcement LearningDownload
To be verified
55Types of OptimalityDownload
To be verified
56Semi Markov Decision ProcessesDownload
To be verified
57OptionsDownload
To be verified
58Learning with OptionsDownload
To be verified
59Hierarchical Abstract MachinesDownload
To be verified
60MAXQDownload
To be verified
61MAXQ Value Function DecompositionDownload
To be verified
62Option DiscoveryDownload
To be verified
63POMDP IntroductionDownload
To be verified
64Solving POMDPDownload
To be verified
65Live SessionPDF unavailable


Sl.No Language Book link
1EnglishNot Available
2BengaliNot Available
3GujaratiNot Available
4HindiNot Available
5KannadaNot Available
6MalayalamNot Available
7MarathiNot Available
8TamilNot Available
9TeluguNot Available