Suppose I want to find the global maximum of a differentiable function $f : \mathbb{R}^n \rightarrow \mathbb{R}$ with multiple local maxima, where $n$ is on the order of $100$. I have access to both $f$ and its gradients $\nabla f$. What are the state-of-the-art techniques for solving this problem? Gradient descent alone does not suffice for my problem because it gets stuck in local maxima.
2026-03-25 22:02:48.1774476168
State-of-the-art techniques for maximizing a differentiable function with multiple local maxima
149 Views Asked by Bumbble Comm https://math.techqa.club/user/bumbble-comm/detail At
1
There are 1 best solutions below
Related Questions in OPTIMIZATION
- Optimization - If the sum of objective functions are similar, will sum of argmax's be similar
- optimization with strict inequality of variables
- Gradient of Cost Function To Find Matrix Factorization
- Calculation of distance of a point from a curve
- Find all local maxima and minima of $x^2+y^2$ subject to the constraint $x^2+2y=6$. Does $x^2+y^2$ have a global max/min on the same constraint?
- What does it mean to dualize a constraint in the context of Lagrangian relaxation?
- Modified conjugate gradient method to minimise quadratic functional restricted to positive solutions
- Building the model for a Linear Programming Problem
- Maximize the function
- Transform LMI problem into different SDP form
Related Questions in MAXIMA-MINIMA
- optimization with strict inequality of variables
- Minimum value of a complex expression involving cube root of a unity
- Calculation of distance of a point from a curve
- Find all local maxima and minima of $x^2+y^2$ subject to the constraint $x^2+2y=6$. Does $x^2+y^2$ have a global max/min on the same constraint?
- Solving discrete recursion equations with min in the equation
- Trouble finding local extrema of a two variable function
- Why do I need boundedness for a a closed subset of $\mathbb{R}$ to have a maximum?
- Find the extreme points of the function $g(x):=(x^4-2x^2+2)^{1/2}, x∈[-0.5,2]$
- Maximizing triangle area problem
- Find the maximum volume of a cylinder
Related Questions in GRADIENT-DESCENT
- Gradient of Cost Function To Find Matrix Factorization
- Can someone explain the calculus within this gradient descent function?
- Established results on the convergence rate of iterates for Accelerated Gradient Descent?
- Sensitivity (gradient) of function solved using RK4
- Concerning the sequence of gradients in Nesterov's Accelerated Descent
- Gradient descent proof: justify $\left(\dfrac{\kappa - 1}{\kappa + 1}\right)^2 \leq \exp(-\dfrac{4t}{\kappa+1})$
- If the gradient of the logistic loss is never zero, does that mean the minimum can never be achieved?
- How does one show that the likelihood solution for logistic regression has a magnitude of infinity for separable data (Bishop exercise 4.14)?
- How to determinate that a constrained inequality system is not empty?
- How to show that the gradient descent for unconstrained optimization can be represented as the argmin of a quadratic?
Related Questions in NON-CONVEX-OPTIMIZATION
- Find largest possible value of $x^2+y^2$ given that $x^2+y^2=2x-2y+2$
- Interesting Global Minimum of a Non-convex Function
- Minimize $x^T A y$, subject to $ x^Ty\geq 0$, where $A=\Phi^T\Phi$ is symmtric and semi-positive definite.
- How should I proceed to solve the below mentioned non-convex optimisation problem?
- Optimization of the sum of a convex and a non-convex function?
- Solution of a semidefinite program with rank at most $k$
- Trace maximization with semi-orthogonal constraint
- Literature on optimizing linear objectives over non-convex sets
- Convex Hull of the Union of Subgradients
- Find matrices $X$ such that the diagonal of $X^H X$ is sparse
Related Questions in GRADIENT-FLOWS
- understanding the gradient of a curve through derivative
- How can I solve the gradient here?
- Why is the gradient vector zero inside a circular level curve
- Does the gradient point to the direction of greatest increase or decrease or the direction of greatest increase?
- $H^{-1}$ norm and $H^{-1}$ gradient flow
- Is a level set more than one level curve?
- Does normalized gradient flow preserve level sets?
- How do you know what the units of force are measured in when calculating work with a line integral?
- Finding a metric to write a vector field as a gradient of a given function
- Solving for policy gradient in reinforcement learning
Trending Questions
- Induction on the number of equations
- How to convince a math teacher of this simple and obvious fact?
- Find $E[XY|Y+Z=1 ]$
- Refuting the Anti-Cantor Cranks
- What are imaginary numbers?
- Determine the adjoint of $\tilde Q(x)$ for $\tilde Q(x)u:=(Qu)(x)$ where $Q:U→L^2(Ω,ℝ^d$ is a Hilbert-Schmidt operator and $U$ is a Hilbert space
- Why does this innovative method of subtraction from a third grader always work?
- How do we know that the number $1$ is not equal to the number $-1$?
- What are the Implications of having VΩ as a model for a theory?
- Defining a Galois Field based on primitive element versus polynomial?
- Can't find the relationship between two columns of numbers. Please Help
- Is computer science a branch of mathematics?
- Is there a bijection of $\mathbb{R}^n$ with itself such that the forward map is connected but the inverse is not?
- Identification of a quadrilateral as a trapezoid, rectangle, or square
- Generator of inertia group in function field extension
Popular # Hahtags
second-order-logic
numerical-methods
puzzle
logic
probability
number-theory
winding-number
real-analysis
integration
calculus
complex-analysis
sequences-and-series
proof-writing
set-theory
functions
homotopy-theory
elementary-number-theory
ordinary-differential-equations
circles
derivatives
game-theory
definite-integrals
elementary-set-theory
limits
multivariable-calculus
geometry
algebraic-number-theory
proof-verification
partial-derivative
algebra-precalculus
Popular Questions
- What is the integral of 1/x?
- How many squares actually ARE in this picture? Is this a trick question with no right answer?
- Is a matrix multiplied with its transpose something special?
- What is the difference between independent and mutually exclusive events?
- Visually stunning math concepts which are easy to explain
- taylor series of $\ln(1+x)$?
- How to tell if a set of vectors spans a space?
- Calculus question taking derivative to find horizontal tangent line
- How to determine if a function is one-to-one?
- Determine if vectors are linearly independent
- What does it mean to have a determinant equal to zero?
- Is this Batman equation for real?
- How to find perpendicular vector to another vector?
- How to find mean and median from histogram
- How many sides does a circle have?
this problem is likely unsolvable, sorry :)
Let's say you are looking for a global maximum in a box of length 1 and dimension $m=100$. Further, let's even assume that you know the value of that maximum $F_{\max}$, you just don't know where it is. Let's also assume that the absolute values of all derivatives $|\nabla F|$ are upper-bounded by some number $D$. Whenever you sample one point from the function you get information about whether the maximum might be located within a certain patch localized around the sample point. The exact size and shape of the patch would depend on the value of the function, value of the gradient and $D$. Thing is, you don't get any information on what happens outside that patch. So, in order to guarantee finding the maximum, you must have sufficient patches to cover the entire volume of the cube. The number of patches you need grows exponentially with dimension, so even if you need 2 patches per dimension, you still need $2^{100} \approx 10^{30}$ patches, which is not computable on a regular hardware. Further concerns:
At least in astronomy, I have seen people use random search (e.g. MCMC) to get the probability distribution of different values occuring in the domain, and then use some prior knowledge to extrapolate the probability distribution to make an educated guess of what the maximum could be. No surprise, depending on prior assumptions and luck, they sometimes are wrong by a few orders of magnitude, but it is not their fault :D