Optimal policy
The state value could be used to evaluate if a policy is good or not: if
then is better than
Definition
: A policy is optimal if for all and for any other policy
questions:
- optimal policy exists?
- optimal policy unique?
- policy stochastic or deterministic?
- how to obtain optimal policy? to solve this, see next⇒BOE Bellman Optimality Equation