State transition
Taking an action , the agent move from one state to another.
- if we take action , then
For forbidden area, at state , if take action , maybe 2 case here:
- case 1: accessible but with penalty,
- case 2: inaccessible,
Consider the first case later, which is more general and challenging.
We can use tabular representation to describe state transition
Use class
dict
in python maybe convenient to describe it, but this can only represent determinisitic cases.
- Intuition: At state , if we choose action , the next state is
- Math:
- the case is determinisitic, the state transition could be stochastic
- also we can use
dict
to describe it in python