Optimal policy

The state value could be used to evaluate if a policy is good or not: if

then is better than

Definition

: A policy is optimal if for all and for any other policy

questions:

  • optimal policy exists?
  • optimal policy unique?
  • policy stochastic or deterministic?
  • how to obtain optimal policy? to solve this, see nextBOE Bellman Optimality Equation