Vojtěch Tóth

❯

❯

Symbolic machine learning

❯

Markov decision process

Markov decision process

Feb 16, 20261 min read

Markov reward process + actions

P (s^{'} ∣ s, a) = P (X_{t + 1} = s^{'} ∣ X_{t} = s, A_{t} = a)

Reward

R (s, a) = E [R_{t} ∣ X_{t} = s, A_{t} = a]

Policy

Graph View

Backlinks

Reinforcement learning

Created with Quartz v4.5.2 © 2026

GitHub
Discord Community