I am trying to determine the rate of convergence for $f(x,y) = 5x^2 + 5y^2 − xy − 11x + 11y$. Would anyone be able to provide guidance as to how I might go about doing this? Should I select my own initial $x_k$ and constant $a$? What is a general procedure?
2026-04-08 00:21:31.1775607691
Rate of Convergence for Gradient Descent (Example)
4.7k Views Asked by Bumbble Comm https://math.techqa.club/user/bumbble-comm/detail At
1
There are 1 best solutions below
Related Questions in ALGORITHMS
- Least Absolute Deviation (LAD) Line Fitting / Regression
- Do these special substring sets form a matroid?
- Modified conjugate gradient method to minimise quadratic functional restricted to positive solutions
- Correct way to prove Big O statement
- Product of sums of all subsets mod $k$?
- (logn)^(logn) = n^(log10+logn). WHY?
- Clarificaiton on barycentric coordinates
- Minimum number of moves to make all elements of the sequence zero.
- Translation of the work of Gauss where the fast Fourier transform algorithm first appeared
- sources about SVD complexity
Related Questions in RECURRENCE-RELATIONS
- Recurrence Relation for Towers of Hanoi
- Solve recurrence equation: $a_{n}=(n-1)(a_{n-1}+a_{n-2})$
- General way to solve linear recursive questions
- Approximate x+1 without addition and logarithms
- Recurrence relation of the series
- first order inhomogeneous linear difference equation general solution
- Guess formula for sequence in FriCAS
- Solve the following recurrence relation: $a_{n}=10a_{n-2}$
- Find closed form for $a_n=2\frac{n-1}{n}a_{n-1}-2\frac{n-2}{n}a_{n-2}$ for all $n \ge 3$
- Young Tableaux generating function
Related Questions in NONLINEAR-OPTIMIZATION
- Prove that Newton's Method is invariant under invertible linear transformations
- set points in 2D interval with optimality condition
- Finding a mixture of 1st and 0'th order Markov models that is closest to an empirical distribution
- Sufficient condition for strict minimality in infinite-dimensional spaces
- Weak convergence under linear operators
- Solving special (simple?) system of polynomial equations (only up to second degree)
- Smallest distance to point where objective function value meets a given threshold
- KKT Condition and Global Optimal
- What is the purpose of an oracle in optimization?
- Prove that any Nonlinear program can be written in the form...
Trending Questions
- Induction on the number of equations
- How to convince a math teacher of this simple and obvious fact?
- Find $E[XY|Y+Z=1 ]$
- Refuting the Anti-Cantor Cranks
- What are imaginary numbers?
- Determine the adjoint of $\tilde Q(x)$ for $\tilde Q(x)u:=(Qu)(x)$ where $Q:U→L^2(Ω,ℝ^d$ is a Hilbert-Schmidt operator and $U$ is a Hilbert space
- Why does this innovative method of subtraction from a third grader always work?
- How do we know that the number $1$ is not equal to the number $-1$?
- What are the Implications of having VΩ as a model for a theory?
- Defining a Galois Field based on primitive element versus polynomial?
- Can't find the relationship between two columns of numbers. Please Help
- Is computer science a branch of mathematics?
- Is there a bijection of $\mathbb{R}^n$ with itself such that the forward map is connected but the inverse is not?
- Identification of a quadrilateral as a trapezoid, rectangle, or square
- Generator of inertia group in function field extension
Popular # Hahtags
second-order-logic
numerical-methods
puzzle
logic
probability
number-theory
winding-number
real-analysis
integration
calculus
complex-analysis
sequences-and-series
proof-writing
set-theory
functions
homotopy-theory
elementary-number-theory
ordinary-differential-equations
circles
derivatives
game-theory
definite-integrals
elementary-set-theory
limits
multivariable-calculus
geometry
algebraic-number-theory
proof-verification
partial-derivative
algebra-precalculus
Popular Questions
- What is the integral of 1/x?
- How many squares actually ARE in this picture? Is this a trick question with no right answer?
- Is a matrix multiplied with its transpose something special?
- What is the difference between independent and mutually exclusive events?
- Visually stunning math concepts which are easy to explain
- taylor series of $\ln(1+x)$?
- How to tell if a set of vectors spans a space?
- Calculus question taking derivative to find horizontal tangent line
- How to determine if a function is one-to-one?
- Determine if vectors are linearly independent
- What does it mean to have a determinant equal to zero?
- Is this Batman equation for real?
- How to find perpendicular vector to another vector?
- How to find mean and median from histogram
- How many sides does a circle have?
I am not sure which method you are using, but you can see the different approaches in these Lecture Notes - Section 4.3. Having said that, I will provide a solution using the Convergence Theory - Section 3 and you can get whatever mileage out of it in case you are using one of the other approaches.
We are given:
$$\tag 1 f(x,y) = 5x^2 + 5y^2 − xy − 11x + 11y,$$
and asked to find the Rate of Convergence using the Gradient Descent method.
We can write $f(x, y)$ as a quadratic function in order to use the referenced Convergence Theory as:
$v = \begin{pmatrix} x \\ y \end{pmatrix}, ~A = \begin{pmatrix} 10 & -1 \\ -1 & 10 \end{pmatrix}, ~b = \begin{pmatrix} 11 \\ -11 \end{pmatrix}$
We can now write:
$\tag 2 \displaystyle f(v) = \frac{1}{2} v^T \cdot A \cdot v - v^t \cdot b = (\frac{1}{2}) \begin{pmatrix} x & y \end{pmatrix} \cdot \begin{pmatrix} 10 & -1 \\ -1 & 10 \end{pmatrix} \cdot \begin{pmatrix} x \\ y \end{pmatrix} - \begin{pmatrix} x & y \end{pmatrix} \cdot \begin{pmatrix} 11 \\ -11 \end{pmatrix}$
Note, multiply out $(2)$ and verify it produces $f(x, y)$.
The matrix $A$ produces two eigenvalues $\lambda_1 = 9$ and $\lambda_2 = 11.$
Since $A$ is normal, the condition number is given by:
$K = \displaystyle \frac{\lambda_max}{\lambda_min} = \frac{11}{9}.$
We can now find the global minimizer, $v^*$, such that $Av* = b$.
Solving this, we arrive at $v^* = \begin{pmatrix} 1 \\ -1 \end{pmatrix}.$
We can now calculate the minimum as $f(v^*) = -11$ and verify the same result using the original function, $f(x, y) = f(1, -1) = -11.$
The convergence of Steepest Descent is governed by the condition number of the matrix and defined by the constant:
$$\displaystyle \rho = \frac{K-1}{K+1} = \frac{\frac{11}{9} - 1}{\frac{11}{9}+1} = \frac{1}{10} \text{and} ||e^{(k)}||_A \le \rho^k||e^{(0)}||_A$$
So, we have (from theory):
$$\displaystyle 0.5 ||e^{(k)}||^2_A \le 0.5 \rho^{2k}||e^{(0)}||^2_A = 0.5(\frac{1}{100})^k ||e^{(0)}||^2_A = 5.5 \cdot 10^{-2k}$$
This tells us we need six to seven steps to get $\displaystyle 0.5 ||e^{(k)}||^2_A \approx 10^{-11}.$
Now, you should do your iterations and see that that is the number of steps to converge to the solution above.
Additionally, you should compare the convergence to the exact result and make sure you get what the condition number was telling us.