Most reinforcement learning methods for the linear quadratic regulator (LQR) assume that an initial stabilizing controller is given. This notebook gives an example of how to compute a stabilizing LQR gain using a variant of policy iteration.
AndyLamperski/stabilizing-policy-iteration
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|