Why is there an "absolute value" and a norm in the Schwarz Inequality?

4.8k Views Asked by At

This really bothers me, and I'm not sure if it's just that I'm not understanding it correctly. For the moment, assume we are working in a vector space $V$ over $\mathbb{R}^n$. Let $x,y \in V$. We have an inner product $\langle \cdot , \cdot\rangle\colon V \to \mathbb{R}$, which in this case is just the dot product.

In so many presentations of the Schwarz Inequality, the inequality is given as:

$\big\rvert\langle x,y\rangle\big\lvert \leq \big\rvert\big\rvert{x}\big\lvert\big\lvert \;\big\rvert\big\rvert{y}\big\lvert\big\lvert$

Why do we use both the absolute value and the norm? The absolute value is a norm, so why don't we just write

$\big\rvert\big\rvert \langle x,y\rangle\big\lvert\big\lvert^2 \leq \big\rvert\big\rvert{x}\big\lvert\big\lvert \;\big\rvert\big\rvert{y}\big\lvert\big\lvert$

I'm not sure how clear this question is, but I guess my gripe is with the fact that we present the inequality with two different types of norms.

I guess, I find it easier to conceptually understand the following notation:

$\langle x,y \rangle^2 \leq \langle x, x \rangle \langle y, y \rangle$

This is actually really bothering me at the moment, and I can't figure out why the "absolute value" is such a popular way to present the inequality.

2

There are 2 best solutions below

1
On

I guess my gripe is with the fact that we present the inequality with two different types of norms.

That's because they are two different norms, on two different spaces! Moreover, we might not even want to think of the absolute value as being a norm for this purpose.

When you declare a normed space $(V, \lVert{-}\rVert)$, you're assigning the symbol $\lVert{-}\rVert$ as your notation for the norm on that space. If $V \ne \mathbb{R}$, say $V = \mathbb{R}^3$ or whatever, then $\lVert{-}\rVert$ refers to the norm on $\mathbb{R}^3$ and not on $\mathbb{R}$.

However, the notation $|{-}|$ unambiguously means one thing: the absolute value.

0
On

The norm operates on the vector space that contain $x$ and $y$. The absolute value operates on the scalar. Moreover, the absolute value has a very precise meaning in this case: removal of the sign of real numbers. However, the norm (in this case derived from the dot product) can be based on any of the infinitely many dot products in existence. It's really a very specific metric that can be constructed on vector spaces with a dot products (=square root of the dot product with itself). It's true that the absolute value is a norm... but not any norm is a dot product.

I too prefer the last expression. It's clearer and doesn't introduce square roots, which are indeed a very specific construct. What if the vectors were constructed over the field of integers? In that case, the square root doesn't necessarily exist, but a square is still well defined.