...

Reinforcement Learning Algorithms: Expected SARSA

In this post, we'll extend our toolset for Reinforcement Learning by considering a new temporal difference (TD) method called Expected SARSA. In...