Why is the change-to-polar-coordinate method valid in computing limits?

1.9k Views Asked by At

For example, when we compute $\displaystyle\lim_{(x,y)\to(0,0)}\frac{x^2y}{x^2+y^2}$, we can use the change to polar coordinates method below:

Let $\begin{cases}x=r\cos\theta\\y=r\sin\theta \end{cases}$, then $\displaystyle\lim_{(x,y)\to(0,0)}\frac{x^2y}{x^2+y^2}=\lim_{r\to0}\frac{r^3\cos^3\theta\sin\theta}{r^2}(=\lim_{r\to0}r\cos^2\theta\sin\theta=0)$.

However, why is this method valid? What is the actual reason behind this substitution? Is this related to the limit law of composition functions of continuous function? But what are those two functions that applying behind here?

I think it makes use of this change-of-coordinate continuous and one-to-one function $\Psi$:

$$\begin{alignat*}{3}\Psi:\ &\mathbb{R}^+\times[0,2\pi)&&\to\mathbb{R}^2\\ &(r,\theta)&&\mapsto(r\cos\theta,r\sin\theta)\end{alignat*}$$

But I can't figure out how to "chain" this function with others and get the eager result.

5

There are 5 best solutions below

4
On BEST ANSWER

Note: this answer stems from the (now removed) comments left under my answer to your other question about polar coordinates. As requested, I have edited to add the proofs.

Before proceeding, let us define (possibly non-standard notation) $$\lim_{(r,\phi)\to\{0\}\times\mathbb R}g(r,\phi)=L\tag{$\diamond$}$$ to mean that for each $\epsilon>0$, there is an open neighborhood $U$ of $\{0\}\times\mathbb R$ in $[0,\infty)\times\mathbb R$ such that $(r,\phi)\in U$ and $r\neq0$ implies $|g(r,\phi)-L|<\epsilon.$

In the case $g(r,\phi)=f(r\cos\phi,r\sin\phi)$, which is $2\pi$-periodic in the second argument, this can be simplified, by requiring instead that for each $\epsilon>0$, there is a $\delta>0$ such that $0<r<\delta$ implies $|g(r,\phi)-L|<\epsilon$. (Intuitively, in this case we do not have to worry about $\delta$ getting smaller and smaller as $\phi\to\pm\infty$.) This can be done e.g. by using the tube lemma from elementary topology.

We may now state:

Proposition. The following are equivalent for a function $f:\mathbb R^2\setminus\{(0,0)\}\to\mathbb R$:

  1. We have $$\lim_{(x,y)\to(0,0)}f(x,y)=L.$$
  2. The function $\tilde f:\mathbb R^2\to\mathbb R$, defined by $$\tilde f(x,y)=\begin{cases}f(x,y);&(x,y)\neq(0,0),\\L;&(x,y)=(0,0),\end{cases}$$ is continuous at $(0,0)$ and $f(0,0)=L$.
  3. For all $\phi_0\in\mathbb R$ we have $$\lim_{(r,\phi)\to(0,\phi_0)}f(r\cos\phi,r\sin\phi)=L.\tag{$\star$}$$
  4. We have $$\lim_{(r,\phi)\to\{0\}\times\mathbb R}f(r\cos\phi,r\sin\phi)=L.$$
  5. The function $g:[0,\infty)\times\mathbb R\to\mathbb R$, defined by $$g(r,\phi)=\begin{cases}f(r\cos\phi,r\sin\phi);&r>0,\\L;&r=0,\end{cases}$$ is continuous at $(0,\phi_0)$ for each $\phi_0\in\mathbb R$.

Proof. The equivalence of 1 and 2 is standard and follows directly from the definitions.

To see that 2 implies 5, let $p:[0,\infty)\times\mathbb R\to\mathbb R^2$ be defined by $p(r,\phi)=(r\cos\phi,r\sin\phi).$ This is obviously continuous, so if $\tilde f$ is continous, $g=\tilde f\circ p$ must be continuous as well.

Next, we show that 5 implies 3. To see this, let $\epsilon>0$ and $\phi_0\in\mathbb R$. By $5$, there is a $\delta>0$ such that $\|(r,\phi)-(0,\phi_0)\|<\delta$ implies $|g(r,\phi)-g(0,\phi_0)|<\epsilon$. In particular, if $r>0$, this means that $\|(r,\phi)-(0,\phi_0)\|<\delta$ implies $|f(r\cos\phi,r\sin\phi)-L|<\epsilon$, which is precisely 3.

Next, we show that 3 implies 4. Let $\epsilon>0$. For each $\phi_0\in\mathbb R$ we have a $\delta(\phi_0)>0$ such that $\|(r,\phi)-(0,\phi_0)\|<\delta(\phi_0)$ implies $|f(r\cos\phi,r\sin\phi)-L|<\epsilon$. Let $$U(\phi_0)=\{(r,\phi)\in[0,\infty)\times\mathbb R\mid \|(r,\phi)-(0,\phi_0)\|<\delta(\phi_0)\}.$$ Then $U=\bigcup_{\phi_0\in\mathbb R}U(\phi_0)$ is an open neighborhood of $\{0\}\times\mathbb R$ in $[0,\infty)\times\mathbb R$ and for each $r\neq 0$ the condition $(r,\phi)\in U$ implies $\|f(r\cos\phi,r\sin\phi)- L\|<\epsilon$, because every such $(r,\phi)$ is an element of some $U(\phi_0)$ and therefore satisfies $\|(r,\phi)-(0,\phi_0)\|<\delta(\phi_0)$.

Finally, we show that 4 implies 1, thus concluding the circle of implications and establishing the desired equivalence. To do this, let $\epsilon>0$. By 4 and the fact that $f(r\cos\phi,r\sin\phi)$ is a $2\pi$-periodic function of $\phi$, there is a $\delta>0$ such that $0<r<\delta$ implies $|f(r\cos\phi,r\sin\phi)-L|<\epsilon$. Therefore, if $0<\|(x,y)\|<\delta$, we have $|f(x,y)-L|<\epsilon$, because $(x,y)=(r\cos\phi,r\sin\phi)$ for some $r\in(0,\delta)$ and $\phi\in\mathbb R$. This concludes the proof. $\qquad\square$

Remark. Note that the limit $(\star)$ is the limit of a two-variable function. Requiring that we have the single variable limit $$\lim_{r\to0}f(r\cos\phi_0,r\sin\phi_0)=L\tag{$\ast$}$$ for each $\phi_0\in\mathbb R$ is not (!) equivalent. In fact, this latter condition is not sufficient for the limit to exist as shown by the standard counterexample $f(x,y)=\frac{x^2y}{x^4+y^2}.$) The point is that the limit $(\star)$ takes into account points $(r,\phi)$ with different values of $\phi$ whereas in $(\ast)$ the value $\phi=\phi_0$ is fixed.

Added: here are the details regarding the part with the tube lemma, but I will do it without actually using the tube lemma, which would allow us to skip a few steps. (In fact, what follows basically includes a proof of the tube lemma in this special case.)

The definition of $(\diamond)$ mentions a neighborhood $U$ of $\{0\}\times\mathbb R$ in $[0,\infty)\times\mathbb R$. We wish to show that, if $g(r,\phi)$ is $2\pi$-periodic in $\phi$, we can always take a neighborhood of the form $V=[0,\delta)\times\mathbb R$ for some $\delta>0$, instead of this possibly more complicated neighborhood $U$.

Assume we have a neighborhood $U$ of $\{0\}\times\mathbb R$, as in the definition of $(\diamond)$, not necessarily of the form $[0,\delta)\times\mathbb R$. Using $U$, we are going to construct a neighborhood $V=[0,\delta)\times\mathbb R$ satisfying the same properties. Note that since $U$ is a neighborhood of every point $(0,\phi_0)\in\{0\}\times[0,2\pi]$, it contains a ball $$B((0,\phi_0),\eta(\phi_0))=\{(r,\phi)\in[0,\infty)\times\mathbb R\mid \|(r,\phi)-(0,\phi_0)\|<\eta(\phi_0)\}$$ of radius $\eta(\phi_0)>0$ around every such point. This ball contains a set of the form $$V(\phi_0)=[0,\delta(\phi_0))\times(\phi_0-\delta(\phi_0),\phi_0+\delta(\phi_0))$$ which still contains $(0,\phi_0)$. The sets $V(\phi_0)$ form a cover of the interval $\{0\}\times[0,2\pi]$, which is compact, so there is a finite subcover $V(\phi_1),V(\phi_2),\ldots, V(\phi_n)$, where $\phi_1,\phi_2,\ldots,\phi_n\in[0,2\pi]$. Now take $\delta=\min\{\delta(\phi_1),\delta(\phi_2),\ldots,\delta(\phi_n)\}$. Observe that $$[0,\delta)\times[0,2\pi]\subseteq V(\phi_1)\cup V(\phi_2)\cup\ldots\cup V(\phi_n)\subseteq U,$$ so $(r,\phi)\in[0,\delta)\times [0,2\pi]$ implies $|g(r,\phi)-L|<\epsilon$. But since $g$ is $2\pi$-periodic in $\phi$, this means that $(r,\phi)\in[0,\delta)\times\mathbb R$ also implies $|g(r,\phi)-L|<\epsilon$. Therefore $V=[0,\delta)\times\mathbb R$ has the desired properties.

1
On

A layman explanation is that the limit exists if and only if you approach the same value no matter which path you take to approach the origin.

A particle on any path which approaches the origin necessarily needs to eventually be constrained in a circle of arbitrarily small radius $r$ centred at the origin. So essentially, with polar coordinates you're "considering" all such possible paths, since you're taking the limit as $r\rightarrow 0$.

2
On

The right view is not to consider a (or all) "path(s)" leading to $(0,0)$, but the values of the function "near" $(0,0)$ : saying that $\lim_{(x,y)\to(0,0)} f(x,y)=0$ is equivalent to saying $$(\forall\epsilon>0)(\exists\alpha>0) ((||(x,y)-(0,0)||<\alpha)\Longrightarrow (\left|f(x,y)-f(0,0)\right|<\epsilon))$$ for a certain norm $||.||$ of $\mathbb R^2$.

Using polar coordinates is just using the norm $$||(x,y)||_2=\sqrt{x^2+y^2}$$ And the right way to write it is : $$\left|f(x,y)-0\right| = \left|r\cos^2\theta\sin\theta\right|\le r$$ which permits you to use $\alpha=\epsilon$ in the definition (I mean, don't use $\lim_{(x,y)\to(0,0)}$ before you "know" the limit exists).

11
On

$\newcommand{\Reals}{\mathbf{R}}\newcommand{\eps}{\varepsilon}$Let $f$ be a function defined in some deleted neighborhood $U$ of the origin in $\Reals^{2}$, and let $L$ be a real number. The following are equivalent:

  1. $\displaystyle \lim_{(x, y) \to (0, 0)} f(x, y) = L$.

  2. For every $\eps > 0$, there exists a $\delta > 0$ such that if $(x, y) \in U$ and $0 < \sqrt{x^{2} + y^{2}} < \delta$, then $|f(x, y) - L| < \eps$.

  3. For every $\eps > 0$, there exists a $\delta > 0$ such that if $0 < r < \delta$ and $(r\cos\theta, r\sin\theta) \in U$ for all real $\theta$, then $|f(r\cos\theta, r\sin\theta) - L| < \eps$.

  4. $\displaystyle \lim_{r \to 0} f(r\cos\theta, r\sin\theta) = L$.

Statements 2. and 3. are equivalent for the natural geometric reason that if $(x, y) = (r\cos\theta, r\sin\theta)$, then $\sqrt{x^{2} + y^{2}} = \sqrt{r^{2}} = |r|$, and without loss of generality (e.g., replacing $\theta$ by $\theta + \pi$ if necessary) we may assume $0 < r$.

Edit: Condition 4 is (in my experience) defined by Condition 3. It would be more honest to write something like "$\displaystyle\lim_{r \to 0} f(r\cos\theta, r\sin\theta) = L$ uniformly in $\theta$"; otherwise, this use of limit notation is technically anomalous, since the limit isn't taken at one point, but in a tube neighborhood of the $\theta$-axis. (Of course, the $\theta$-axis maps to one point, the origin, under polar coordinates.)

0
On

This might help you: Try to figure out what exactly is the difference between the statements: $$\lim_{r\to0}f(r\cos\theta_0,r\sin\theta_0)=L,\,\forall \theta_0\\ \text{and}\\~\\ \lim_{(r,\theta)\to(0,\theta_0)}f(r\cos\theta,r\sin\theta)=L,\,\forall\theta_0.$$ If you know the difference between the two, then you also know why the first one is pretty much useless, and why the second one is the equivalent to 3. in Andrew D. Whang's answer (and therefore usefull).

As a hint: A drawing is always useful. Your domain looks like this:


The domain

The domain $\big\{(r,\theta)\mid (r,\theta)\in[0,\infty)\times[0,2\pi)\big\}$. The $\color{blue}{\text{blue}}$ line is the line $r=0$.


We want to exeamine the behaviour of $f(r\cos\theta,r\sin\theta)$ as the points $(r,\theta)$ get close to the blue line $r=0$, along any path.