Blacksburg – May 2023
Jan Heiland & Peter Benner & Steffen Werner (MPI Magdeburg)
\[\dot x = f(x) + Bu\]
Control of an inverted pendulum
Stabilization of a laminar flow
A general approach would include
\[\begin{align} \dot x -Bu & = f(x) \\ & \approx [A_0+\rho_1(x)A_1+ \dotsm + \rho_r(x) A_r]\, x \end{align}\]
The linear parameter varying (LPV) representation/approximation \[ \dot x \approx \bigl [\Sigma \,\rho_i(x)A_i \bigr]\, x + Bu \] for nonlinear controller comes with
and extensive theory on
Spoiler:
In this talk, we will consider LPV series expansions of control laws.
For linear parameter-varying systems \[ \dot x = A(\rho(x))\,x + Bu \] there exist established methods that provide control laws based one
A major issue: require solutions of coupled LMI systems.
Consider the optimal regulator control problem
\[ \int_0^\infty \|y\|^2 + \alpha \|u\|^2\, \mathsf{d}s \to \min_{(y, u)} \] subject to \[ \dot x = A(\rho(x))\,x+Bu, \quad y=Cx. \]
Theorem (Beeler, Tran, and Banks 2000)
If there exists \(\Pi\) as a function of \(x\) such that \[ \begin{aligned} & \dot{\Pi}(x)+\bigl[\frac{\partial(A(\rho(x)))}{\partial x}\bigr]^T \Pi(x)\\ & \quad+\Pi(x) A(\rho(x))+A^T(\rho(x)) \Pi(x)-\frac{1}{\alpha} \Pi(x) BB^T \Pi(x)=-C^TC . \end{aligned} \]
Then \[u=-\frac{1}{\alpha}B^T\Pi(x)\,x\] is an optimal feedback for the control problem.
In Praxis, parts of the HJB are discarded and we use \(\Pi(x)\) that solely solves the state-dependent Riccati equation (SDRE) \[ \Pi(x) A(\rho(x))+A^T(\rho(x)) \Pi(x)-\frac{1}{\alpha} \Pi(x) BB^T\Pi(x)=-C^TC, \] and the SDRE feedback \[ u=-\frac{1}{\alpha}B^T\Pi(x)\,x. \]
Although the SDRE is an approximation already,
the repeated solve of the Riccati equation is not feasible.
However, for affine LPV systems, a series expansion
enables an efficient approximation at runtime.
We note that \(\Pi\) depends on \(x\) through \(A(\rho(x))\).
Thus, we can consider \(\Pi\) as a function in \(\rho\) and its corresponding multivariate Taylor expansion up to order \(K\) \[\begin{equation} \label{eq:taylor-expansion-P} \Pi (\rho) \approx \Pi (0) + \sum_{1\leq |\beta| \leq K} \rho^{(\beta)}P_{\beta}, \end{equation}\] where
Theorem
If \(A(\rho)\) is affine, i.e. \(A(\rho) = A_0 + \sum_{k=1}^r \rho_k A_k\).
Then the coefficients of the first order Taylor approximation \[ \Pi (\rho) \approx \Pi(0) + \sum_{|\beta| = 1} \rho^{(\beta)}P_{\beta} =: P_0 + \sum_{k=1}^r \rho_k L_k. \] are the solutions to
and, for \(k=1,\dotsc,r\),
Proof
Insert the Taylor expansion of \(\Pi\) and the LPV representation of \(A\) into the SDRE and match the coefficients.
Corollary
The corresponding nonlinear feedback is realized as \[ u = -\frac{1}{\alpha}B^T[P_0 + \sum_{k=1}^r \rho_k(x) L_k]\,x. \]
Cp., e.g., (Beeler, Tran, and Banks 2000) and (Alla, Kalise, and Simoncini 2023).
A representation/approximation of the nonlinear system via \[ \dot x = [A_0 + \sum_{k=1}^r \rho_k(x) A_k]\, x + Bu \] enables the nonlinear feedback design through truncated expansions of the SDRE.
A general procedure
If \(f(0)=0\) and under mild conditions, the flow \(f\) can be factorized \[ f( x) = [A(x)]\,x \] with some \(A\colon \mathbb R^{n} \to \mathbb R^{n\times n}\).
For the factorization \(f(x)=A(x)\,x\), one can say that
… like in the advective term in the Navier-Stokes equations: \[ (v\cdot \nabla)v = \mathcal A_s(v)\,v \] with \(s\in[0,1]\) and the linear operator \(\mathcal A_s(v)\) defined via \[\mathcal A_s(v)\,w := s\,(v\cdot \nabla)w + (1-s)\, (w\cdot \nabla)v.\]
Now, we have an state-dependent coefficient representation
\[ f(x) = A(x)\,x.\]
Trivially, this is an LPV representation \[ \dot x = A(\rho(x))\, x + Bu \] with \(\rho(x) = x\).
Take any model order reduction scheme that compresses (via \(\mathcal P\)) the state and lifts it back (via \(\mathcal L\)) so that \[ \tilde x = \mathcal L(\hat x) = \mathcal L (\mathcal P(x)) \approx x \]
If \(x\mapsto A(x)\) itself is affine linear
and \(\mathcal L\) is linear,
then \[ \dot x \approx A(\mathcal L \rho(x))\,x + Bu = [A_0 + \sum_{i=1}^r \rho_i(x) A_i]\, x + Bu \] is affine with
Generally, a nonlinear \(f\) can be factorized as \(f(x) = A(x)\,x\).
Model order reduction provides a low dimensional LPV representation \(A(x)\,x\approx A(\mathcal \rho(x))\,x\).
The needed affine-linearity in \(\rho\) follows from system’s structure (or from another layer of approximation (see, e.g, (Koelewijn and Tóth 2020)).
The Navier-Stokes equations
\[ \dot v + (v\cdot \nabla) v- \frac{1}{\mathsf{Re}}\Delta v + \nabla p= f, \]
\[ \nabla \cdot v = 0. \]
Control Problem:
Simulation model:
\(\dot x = Ax + N(x,x) + Bu, \quad y = Cx\)
Nonlinear controller design for \[ \dot x = f(x) + Bu \] by LPV approximations and truncated SDRE expansions.
Compute an affine LPV approximative model with \[f(x)\approx A_0x + \sum_{k=1}^r \rho_k(x)A_kx.\]
Solve one Riccati and \(r\) Lyapunov equations for \(P_0\) and the \(L_k\)s.
Close the loop with \(u = -\frac{1}{\alpha}B^T[P_0x + \sum_{k=1}^r \rho_k(x) L_kx ].\)
We use POD coordinates with the matrix \(V\in \mathbb R^{n\times r}\) of POD modes \(v_k\)
\(\rho(x) = V^T x\),
\(\tilde x = V\rho(x)=\sum_{k=1}^r\rho_i(x)v_k.\)
Then: \[N(x,x)\approx N(\tilde x, x) = N(\sum_{k=1}^r\rho_i(x)v_k, x) = \sum_{k=1}^r\rho_i(x) N(v_k, x) \] which is readily realized as \[ [\sum_{k=1}^r\rho_i(x) A_k]\,x.\]
This requires solutions of large-scale (\(n=50'000\)) matrix equations
We use state-of-the-art low-rank ADI iterations (ask Steffen for details).
Comparison of feedback designs
LQR
– plain LQR controllerxSDRE-r
– truncated (at r
) SDRE
feedbackWe check the performance with respect to two parameters
\(\alpha\) … the regularization parameter that penalizes the control
\(t_{\mathsf c} > 0\) … time before the controller is activated
…
The parameter \(t_c\) describes the domain of attraction.
For r=0
the xSDRE-r
feedback recovers
the LQR
feedback.
Norm plot of the feedback signals.
LQR
fails to stabilizer
means better performancer=10
Less regularization
LQR
failsxSDRE
can achieve stabilityr
Measurable and reliable improvements with respect to \(\alpha\)
no measurable performance gain with respect to \(t_{\mathsf c}\)
still much space for improvement
r
?\[f(x) \quad \to\quad A(x)\,x\quad \to\quad \tilde A(\rho(x))\,x\quad \to\quad [A_0 + \sum_{k=1}^r\rho_k(x)A_k]\,x\]
Proof of concept for nonlinear controller design with POD and truncated SDRE (Heiland and Werner 2023).
General and performant but still heuristic approach.
Detailed roadmap for developing the LPV (systems) theory is available.
PhD student wanted!
Thank You!