My question is about the infinite-horizon linear-quadratic regulator (LQR) problem. Consider the following linear and time-invariant system $$ x_{k+1} = Ax_k + Bu_k $$ Given the initial state vector $x_0 \in \mathbb R^n$, our objective is to solve the following optimization problem $$ \inf_{f_0,f_1,\dots} \left[\lim_{N \to \infty} \sum_{k=0}^{N-1} x_k^TQx_k + u_k^TRu_k\right] \tag{1} $$ where $(f_n)_{n \in \mathbb N}$ is the sequence of policies such that $u_k = f_k(x_k)$. In the notes and books that I've read, I noticed that the following optimization problem is solved instead: $$ \lim_{N \to \infty} \left[\inf_{f_0,\dots,f_{N-1}} \sum_{k=0}^{N-1} x_k^TQx_k + u_k^TRu_k\right] \tag{2} $$ where the $\inf$ and $\lim$ are interchanged. More specifically, I've found that authors tend to take the limit of the discrete-time finite horizon Riccati difference equation, which suggests that they implicitly performed the interchange shown in $(2)$. However, I'm not sure why this interchange is valid, even when we assume that $(A,B)$ is stabilizable (we assume that $y_k = x_k$ for simplicity, such that $(A,C)$ is observable). That is, why is the solution of the problem in $(1)$ equivalent to the solution of the problem in $(2)$? I'd appreciate any guidance on this.
2026-03-27 21:37:23.1774647443
Are the $\inf$ and $\lim$ interchangeable in the infinite-horizon discrete-time LQR problem?
101 Views Asked by Bumbble Comm https://math.techqa.club/user/bumbble-comm/detail At
1
There are 1 best solutions below
Related Questions in SEQUENCES-AND-SERIES
- How to show that $k < m_1+2$?
- Justify an approximation of $\sum_{n=1}^\infty G_n/\binom{\frac{n}{2}+\frac{1}{2}}{\frac{n}{2}}$, where $G_n$ denotes the Gregory coefficients
- Negative Countdown
- Calculating the radius of convergence for $\sum _{n=1}^{\infty}\frac{\left(\sqrt{ n^2+n}-\sqrt{n^2+1}\right)^n}{n^2}z^n$
- Show that the sequence is bounded below 3
- A particular exercise on convergence of recursive sequence
- Proving whether function-series $f_n(x) = \frac{(-1)^nx}n$
- Powers of a simple matrix and Catalan numbers
- Convergence of a rational sequence to a irrational limit
- studying the convergence of a series:
Related Questions in OPTIMIZATION
- Optimization - If the sum of objective functions are similar, will sum of argmax's be similar
- optimization with strict inequality of variables
- Gradient of Cost Function To Find Matrix Factorization
- Calculation of distance of a point from a curve
- Find all local maxima and minima of $x^2+y^2$ subject to the constraint $x^2+2y=6$. Does $x^2+y^2$ have a global max/min on the same constraint?
- What does it mean to dualize a constraint in the context of Lagrangian relaxation?
- Modified conjugate gradient method to minimise quadratic functional restricted to positive solutions
- Building the model for a Linear Programming Problem
- Maximize the function
- Transform LMI problem into different SDP form
Related Questions in CONTROL-THEORY
- MIT rule VS Lyapunov design - Adaptive Control
- Question on designing a state observer for discrete time system
- Do I really need quadratic programming to do a Model Predictive Controller?
- Understanding Definition of Switching Sequence
- understanding set of controllable state for switched system
- understanding solution of state equation
- Derive Anti Resonance Frequency from Transfer Function
- Laplace Transforms, show the relationship between the 2 expressions
- Laplace transform of a one-sided full-wave rectified...
- Controlled Markov process - proper notation and set up
Related Questions in OPTIMAL-CONTROL
- Do I really need quadratic programming to do a Model Predictive Controller?
- Transforming linear dynamical system to reduce magnitude of eigen values
- Hamiltonian minimization
- An approximate definition of optimal state trajectory of a discrete time system
- Reference request: Symmetric Groups and linear control systems
- Does the Pontryagrin maximum principle in sequential order result in same minimum?
- I can't get my Recursive Least Square algorithm work - What have I miss?
- Will LQR act like MPC in reality?
- Find which gain the process will be unstable?
- How do I find the maximum gain limit for a delayed system?
Trending Questions
- Induction on the number of equations
- How to convince a math teacher of this simple and obvious fact?
- Find $E[XY|Y+Z=1 ]$
- Refuting the Anti-Cantor Cranks
- What are imaginary numbers?
- Determine the adjoint of $\tilde Q(x)$ for $\tilde Q(x)u:=(Qu)(x)$ where $Q:U→L^2(Ω,ℝ^d$ is a Hilbert-Schmidt operator and $U$ is a Hilbert space
- Why does this innovative method of subtraction from a third grader always work?
- How do we know that the number $1$ is not equal to the number $-1$?
- What are the Implications of having VΩ as a model for a theory?
- Defining a Galois Field based on primitive element versus polynomial?
- Can't find the relationship between two columns of numbers. Please Help
- Is computer science a branch of mathematics?
- Is there a bijection of $\mathbb{R}^n$ with itself such that the forward map is connected but the inverse is not?
- Identification of a quadrilateral as a trapezoid, rectangle, or square
- Generator of inertia group in function field extension
Popular # Hahtags
second-order-logic
numerical-methods
puzzle
logic
probability
number-theory
winding-number
real-analysis
integration
calculus
complex-analysis
sequences-and-series
proof-writing
set-theory
functions
homotopy-theory
elementary-number-theory
ordinary-differential-equations
circles
derivatives
game-theory
definite-integrals
elementary-set-theory
limits
multivariable-calculus
geometry
algebraic-number-theory
proof-verification
partial-derivative
algebra-precalculus
Popular Questions
- What is the integral of 1/x?
- How many squares actually ARE in this picture? Is this a trick question with no right answer?
- Is a matrix multiplied with its transpose something special?
- What is the difference between independent and mutually exclusive events?
- Visually stunning math concepts which are easy to explain
- taylor series of $\ln(1+x)$?
- How to tell if a set of vectors spans a space?
- Calculus question taking derivative to find horizontal tangent line
- How to determine if a function is one-to-one?
- Determine if vectors are linearly independent
- What does it mean to have a determinant equal to zero?
- Is this Batman equation for real?
- How to find perpendicular vector to another vector?
- How to find mean and median from histogram
- How many sides does a circle have?
Consider the following optimal control problem
$$ J_T(x_0)=\min_{u}\sum_{i=0}^{T-1}x_i^TQx_i+u_i^TRu_i $$ where $x_{k+1}=Ax_k+Bu_k$. From dynamic programming, we have that
$$ J_T(x_0)=x_0^TP_T(0)x_0 $$ where $P_T(k)$ solves the Difference Riccati Equation (DRE)
$$ P_T(k-1)=\mathcal{R}(P_T(k)),\ P_T(T)=0. $$
In that case, the optimal control law is given by $u_k=K_kx_k$ where $K_k=-(R+B^TP_T(k+1)B)^{-1}B^TP_T(k+1)A$.
On the other, we can define the cost
$$ J_\infty(x_0)=\min_{u}\sum_{i=0}^{ \infty}x_i^TQx_i+u_i^TRu_i. $$
and we know that this cost is equal to
$$ J_\infty(x_0)=x_0^TP_\infty x_0 $$ where $P_\infty$ is the stabilizing solution of the Algebraic Riccati Equation (ARE)
$$ P_\infty=\mathcal{R}(P_\infty). $$
By stabilizing solution, it is meant here that $A+BK$ is Schur stable where $K=-(R+B^TP_\infty B)^{-1}B^TP_\infty A$.
The following still contains gap and unclear points which I will fix when I understand how to do it correctly. In the meantime that can just serve as an indicative answer. We show that $P_N(0)\to P_\infty$ as $T\to\infty$.
First of all, it is important to note that $P_{T+1}(0)\succeq P_T(0)$ because the integrand is nonnegative. Assumming now that the pair $(A,B)$ is stabilizable, there exists a $K$ such that $A+BK$ is Schur stable.
This means that we have $$ J_T(x_0)\le\min_{u}\sum_{i=0}^{T-1}x_i^T(Q+K^TRK)x_i=x_0^T\left(\sum_{i=0}^{T-1}((A+BK)^i)^T(Q+K^TRK)A+BK)^i\right) $$ and we have that the sum is converging as $T\to\infty$ to, say, $Q$. Therefore, this implies that $P_T(0)\preceq Q$. As a result, the sequence $\{P_T(0)\}_T$ is nondcreasing and upper bounded, which means that it is converging to a limit $P$, which is assumed to be positive definite.
Since it is the solution of a sequence, then it must be converging to a stationary solution of the DRE. In fact, since it is positive definite, it is the unique positive definite solution to the ARE and we have that $P=P_\infty$ (this needs to be better supported).