Why isn't this function affine?

633 Views Asked by At

I'm studying convex optimization using Convex Optimization (Boyd & Vandenberghe) and had a question from an example used in Chapter 4.2: Convex Optimization.

The specific example is as follows:

$$\begin{array}{ll} \text{minimize} & f_0(x) = x_1^2 + x_2^2\\ \text{subject to} & f_1(x) = \frac{x_1}{1+x_2^2} \le 0\\ & h_1(x) = (x_1 + x_2)^2 = 0\end{array}$$

The textbook states that this problem is not a convex optimization problem because the equality constraint $h_1(x)$ is not affine.

I'm aware that affine functions are linear functions composed with a translation, but when I plotted out the graph for $h_1(x)$ I get a line or plane, which are both affine sets. This led to my confusion as to why the function isn't affine if it's graph is an affine set.

Why isn't it an affine function? Is my understanding of what an affine function is itself wrong?

3

There are 3 best solutions below

1
On

The function $h_{1}(x)$ is not an affine function because it can't be written in the form $h_{1}(x)=a^{T}x+b$. Or, if you let $x=(1, 1)$ and $y=(2,2)$, you'll see that $(h_{1}(x)+h_{1}(y))/2=10$, while $h_{1}((x+y)/2)=9$. It's true that these points don't satisfy $h_{1}(x)=0$ and $h_{1}(y)=0$, but that doesn't make any difference in whether $h_{1}()$ is itself an affine function.

The definition of "convex optimization problem" being used here requires that the functions $h_{i}(x)$ in the constraints $h_{i}(x)=0$ be affine. That's not the same thing as requiring that the set of $x$ such that $h_{1}(x)=0$ is an affine set.

Under this definition of a convex optimization problem, it's possible (as this example illustrates) that the feasible region of an optimization problem is convex even though the problem is not a convex optimization problem under the definition.

Why use this definition of "convex optimization problem"? One reason is that under this definition we can assume that the $g_{i}(x)$ functions are convex and the $h_{i}(x)$ functions are affine. You'll see these properties used from time to time in proofs later in the book.

0
On

$$h_1 (\mathrm x) = \left( 1_2^\top \mathrm x \right)^2 = \mathrm x^\top 1_2 1_2^\top \mathrm x$$

is a positive semidefinite quadratic form that vanishes at the line defined by $1_2^\top \mathrm x = 0$. However, function $h_1$ is quadratic and, thus, non-affine.

0
On

The constraint is affine. It is because the constraint is equivalent to the affine constraint $x_1+x_2 = 0$. However, it is written in a form which is not affine. Hence the confusion.