Date | Topic | Slides | Readings | Assignment |
---|---|---|---|---|
Jan 6 | Class Intro | Slides | Optional: Python Notes (Alan Kuntz) | Optional: Python Tutorial (Berkeley AI Class) |
Jan 8 | Behavioral Cloning | Slides | Optional: Behavioral Cloning from Observation, DAgger, ThriftyDAgger | Behavior Cloning in PyTorch (due Friday Jan 17) |
Jan 13 | Intro to Advanced Behavior Cloning | Slides | Choose one and submit reading report before class: Implicit Behavioral Cloning, Action Chunking Transformer, Diffusion Policy | |
Jan 15 | More Advanced Behavior Cloning | Slides | ||
Jan 22 | Multi-Armed Bandits and Evaluative Feedback | Slides | Sutton and Barto 2.1-2.5 | Multi-Armed Bandits (due Fri Jan 31) |
Jan 27 | More Bandits | Slides | ||
Jan 29 | Intro to Markov Decision Processes | Slides | ||
Feb 3 | Solving MDPs | |||
Feb 5 | Value-Based RL | |||
Feb 10 | Policy-Based RL | |||
Feb 12 | AlphaGo and AlphaZero | |||
Feb 19 | Advanced Deep RL | |||
Feb 24 | More Advanced Deep RL | |||
Feb 26 | Multi-Agent RL | |||
Mar 3 | RL from Human Feedback (RLHF) | |||
Mar 5 | More RL from Human Feedback (RLHF) |
Here you can find supplementary materials, links, etc.
PyTorch Tutorials