Policy-based Control
\[\newcommand{\vect}[1]{\boldsymbol{#1}}
\newcommand{\dvect}[1]{\dot{\boldsymbol{#1}}}
\newcommand{\ddvect}[1]{\ddot{\boldsymbol{#1}}}
\newcommand{\mat}[1]{\boldsymbol{#1}}\]
Policy based controllers do not aim to follow a singular trajectory. Instead, they define a funtion(the policy) \(\pi\) over the entire state space and return a control signal according to this policy:
\[\vect{u} = \pi(\vect{x})\]
the policy can be either analytically defined by reasoning about the system (e.g. about the system’s energy) or it can be learned with reinforcement learning.