Skip to content

Latest commit

 

History

History

readme.md

Pendulum

Both state and action space are continuous.

Data Collection

Skip if you want to use the given expert data.

  • Install stable-baselines3
    pip install stable-baselines3[extra]

Train Regression policy

run pytorch training on pendulum in colab

Open In Colab

Train gaussian policy

  • predict mean and variance
  • Negative log likelihood loss

bc_pendulum_torch_gaussian.ipynb