Markov decision process and dynamic programming

66