RL gym from Sutton

Small reinforcement-learning environments and experiments grounded in Sutton-style examples.

Status: active · GitHub · Demo

What it is

A Sutton-and-Barto-aligned reinforcement-learning playground with tabular SARSA/Q-learning, gridworld and Windy Gridworld environments, value and policy overlays, return/length/TD-error plots, greedy rollout playback, and optional Gymnasium experiments.

GitHub