Consider $$V(x)=\inf_{u \in \mathcal{U}}\int_{0}^{+\infty} e^{-\lambda t} f^0(y_x(t), u(t)) d t$$ which is the value function of an optimal control problem with $y_x(t)$ solution of the state equation \begin{equation}\label{state_eq_finite_dim} \left\{\begin{array}{l} y^{\prime}(t)=f(y(t), u(t)), t>0 \\ y(0)=x \in \mathbb{R}^n \end{array}\right. \end{equation} Then can I say that at an intuitive level $V$ should satisfy the following HJB \begin{equation*} \lambda v(x)-H(x, \nabla v(x))=0 \quad \text { in } \mathbb{R}^{n} \end{equation*} where \begin{equation} H(x, p)=\inf _{u \in U}\{f(x, u) \cdot p+f^0(x,u )\} \end{equation} By Bellman's optimality principle, we know that for every $t>0$ it holds that: \begin{equation} V(x)=\inf _{u \in \mathcal{U}}\left\{\int_{0}^{t} f^0\left(y_{x}(s), u(s)\right) e^{-\lambda s} d s+V\left(y_{x}(t)\right) e^{-\lambda t}\right\} \end{equation} since the lhs of Bellman's optimality principle is independent of $t$ while the rhs depends on $t$ and then we can differentiate formally the rhs and equating it to zero ?(in this way you get the HJB)
2026-02-23 05:57:27.1771826247
Dynamic programming and Bellman optimality principle
239 Views Asked by Bumbble Comm https://math.techqa.club/user/bumbble-comm/detail At
1
There are 1 best solutions below
Related Questions in OPTIMIZATION
- Optimization - If the sum of objective functions are similar, will sum of argmax's be similar
- optimization with strict inequality of variables
- Gradient of Cost Function To Find Matrix Factorization
- Calculation of distance of a point from a curve
- Find all local maxima and minima of $x^2+y^2$ subject to the constraint $x^2+2y=6$. Does $x^2+y^2$ have a global max/min on the same constraint?
- What does it mean to dualize a constraint in the context of Lagrangian relaxation?
- Modified conjugate gradient method to minimise quadratic functional restricted to positive solutions
- Building the model for a Linear Programming Problem
- Maximize the function
- Transform LMI problem into different SDP form
Related Questions in CONTROL-THEORY
- MIT rule VS Lyapunov design - Adaptive Control
- Question on designing a state observer for discrete time system
- Do I really need quadratic programming to do a Model Predictive Controller?
- Understanding Definition of Switching Sequence
- understanding set of controllable state for switched system
- understanding solution of state equation
- Derive Anti Resonance Frequency from Transfer Function
- Laplace Transforms, show the relationship between the 2 expressions
- Laplace transform of a one-sided full-wave rectified...
- Controlled Markov process - proper notation and set up
Related Questions in OPTIMAL-CONTROL
- Do I really need quadratic programming to do a Model Predictive Controller?
- Transforming linear dynamical system to reduce magnitude of eigen values
- Hamiltonian minimization
- An approximate definition of optimal state trajectory of a discrete time system
- Reference request: Symmetric Groups and linear control systems
- Does the Pontryagrin maximum principle in sequential order result in same minimum?
- I can't get my Recursive Least Square algorithm work - What have I miss?
- Will LQR act like MPC in reality?
- Find which gain the process will be unstable?
- How do I find the maximum gain limit for a delayed system?
Related Questions in DYNAMIC-PROGRAMMING
- Dynamic programming for Knapsack problem
- DP algorithm for covering the distance between two points with a set of intervals
- Solution of an HJB equation in continuous time
- correctness for minimizing average completition time for scheduling problem with release times
- Zero-sum differential game
- An enclosing polygon with minimum area
- Divide set into two subsets of equal sum and maximum this sum
- Stochastic Dynamic Programming: Deriving the Steady-State for a Lottery
- How would you prove that a dynamic programming problem is solvable by a greedy algorithm?
- How to find minimal distances route for a trip of $t$ days, given distances for each stop?
Related Questions in VISCOSITY-SOLUTIONS
- Why is this a viscosity super solution?
- 1D viscous flow upwards against gravity
- Comparison principle for linear second order viscosity solutions
- Determine solutions of the Jacobi-Hamilton problem $u_{t}+|u_x|^{2}=0$
- Estimate the large-time behavior of the unique viscosity solution
- Viscous Fluids at a Slope (Navier-Stokes)
- Viscosity solution for Hamilton-Jacobi equations and local extrema
- Estimating a Lebesgue integral and Taylor's formula
- Uniform convergence in $\sup$ and $\inf$ convolutions
- Touching points are dense (viscosity theory)
Trending Questions
- Induction on the number of equations
- How to convince a math teacher of this simple and obvious fact?
- Find $E[XY|Y+Z=1 ]$
- Refuting the Anti-Cantor Cranks
- What are imaginary numbers?
- Determine the adjoint of $\tilde Q(x)$ for $\tilde Q(x)u:=(Qu)(x)$ where $Q:U→L^2(Ω,ℝ^d$ is a Hilbert-Schmidt operator and $U$ is a Hilbert space
- Why does this innovative method of subtraction from a third grader always work?
- How do we know that the number $1$ is not equal to the number $-1$?
- What are the Implications of having VΩ as a model for a theory?
- Defining a Galois Field based on primitive element versus polynomial?
- Can't find the relationship between two columns of numbers. Please Help
- Is computer science a branch of mathematics?
- Is there a bijection of $\mathbb{R}^n$ with itself such that the forward map is connected but the inverse is not?
- Identification of a quadrilateral as a trapezoid, rectangle, or square
- Generator of inertia group in function field extension
Popular # Hahtags
second-order-logic
numerical-methods
puzzle
logic
probability
number-theory
winding-number
real-analysis
integration
calculus
complex-analysis
sequences-and-series
proof-writing
set-theory
functions
homotopy-theory
elementary-number-theory
ordinary-differential-equations
circles
derivatives
game-theory
definite-integrals
elementary-set-theory
limits
multivariable-calculus
geometry
algebraic-number-theory
proof-verification
partial-derivative
algebra-precalculus
Popular Questions
- What is the integral of 1/x?
- How many squares actually ARE in this picture? Is this a trick question with no right answer?
- Is a matrix multiplied with its transpose something special?
- What is the difference between independent and mutually exclusive events?
- Visually stunning math concepts which are easy to explain
- taylor series of $\ln(1+x)$?
- How to tell if a set of vectors spans a space?
- Calculus question taking derivative to find horizontal tangent line
- How to determine if a function is one-to-one?
- Determine if vectors are linearly independent
- What does it mean to have a determinant equal to zero?
- Is this Batman equation for real?
- How to find perpendicular vector to another vector?
- How to find mean and median from histogram
- How many sides does a circle have?
Too long to be a comment:
According to the book you mentioned in page 12, the last expression you wrote $h(t)=\int_{0}^{t} f^0\left(y_{x}(s), u(s)\right) e^{-\lambda s} d s+V\left(y_{x}(t)\right) e^{-\lambda t}$ in general will depend on $t$. However, this quantity should be constant regardless of $t$ for the optimal trajectory due to the dynamic programming principle. This is the equivalent to say that if the fastest route from L.A. to Boston passes through Chicago, then it is also the sequence of the fastest route from LA to Chicago and the fastest from Chicago to Boston. And the same goes for any other midpoint in the optimal route from L.A. to Boston. In your case, this roughly means that the for optimal trajectories the functional remains constant regardless of the midpoint $t$: the first term in $h(t)$ i.e. $\int_{0}^{t} f^0\left(y_{x}(s), u(s)\right) e^{-\lambda s} d s$ which is the cost up to $t$ and the second term i.e. $V\left(y_{x}(t)\right) e^{-\lambda t}$ which is the cost to go, are both optimal for any $t$. Thus, $h'(t)=0$ (since its constant) for the optimal route. Then, the nexts steps from the reference you gave lead to the HJB. Is this the part you wanted to be clarified? Please let me now.