SMU

Markov reward process + actions

Reward

Policy