01 Basic Concepts

A grid-world example

center

  • how to define “good”? Avoid forbidden cells, detours, or boundary

Summary

By a grid-world examples, learn the key concepts:

  • state
  • action
  • state transtion, state transition probability
  • reward, reward probability
  • trajectory, episode, return, discounted return
  • Markov Decision Process

Later

Fundermental concepts, widely used later

next02 Bellman Equation