Markov decision process and dynamic programming

42