Miguel Morales · 2020 · Link
A hands-on route through RL ideas: value functions, policy learning, and experiments you can run.