ClassicΒΆ Standard control benchmarks adapted for population-scale parallelism. Environment Description CartPole Inverted pendulum balancing Pendulum Swing-up control