Proof of the fixed point equation
Assume you have , prove the iterative form is apporx to the fix point :
Define the error:
Such that:
Because we have , and we know that
Hence we have:
1 min read
Assume you have vπ=rπ+γPπvπ, prove the iterative form is apporx to the fix point vπ:
vk+1=rπ+γPπvkDefine the error:
δk⇒vk=vk−vπ=δk+vπSuch that:
vk+1⇒δk+1+vπ=rπ+γPπvk=rπ+γPπ(δk+vπ)Because we have vπ=rπ+γPπvπ, and we know that γ<1,[Pπ]ij<1
δk+1⇒k→∞limδk=γPπδk=γkPπkδ0=0Hence we have:
k→∞limvk=k→∞lim(δk+vπ)=vπ