What are the Laws of Rational Exponents?

17k Views Asked by At

On Math SE, I've seen several questions which relate to the following. By abusing the laws of exponents for rational exponents, one can come up with any number of apparent paradoxes, in which a number seems to be shown as equal to its opposite (negative). Possibly the most concise example:

$-1 = (-1)^1 = (-1)^\frac{2}{2} = (-1)^{2 \cdot \frac{1}{2}} = ((-1)^2)^\frac{1}{2} = (1)^\frac{1}{2} = \sqrt{1} = 1$

Of the seven equalities in this statement, I'm embarrassed to say that I'm not totally sure which one is incorrect. Restricting the discussion to real numbers and rational exponents, we can look at some college algebra/precalculus books and find definitions like the following (here, Ratti & McWaters, Precalculus: a right triangle approach, section P.6):

Ratti's definition of rational exponents Ratti's properties of rational exponents

The thing that looks the most suspect in my example above is the 4th equality, $(-1)^{2 \cdot \frac{1}{2}} = ((-1)^2)^\frac{1}{2}$, which seems to violate the spirit of Ratti's definition of rational exponents ("no common factors")... but technically, that translation from rational exponent to radical expression was not used at this point. Rather, we're still only manipulating rational exponents, which seems fully compliant with Ratti's 2nd property: $(a^r)^s = a^{rs}$, where indeed "all of the expressions used are defined". The rational-exponent-to-radical-expression switch (via the rational exponent definition) doesn't actually happen until the 6th equality, $(1)^\frac{1}{2} = \sqrt{1}$, and that seems to undeniably be a true statement. So I'm a bit stumped at exactly where the falsehood lies.

We can find effectively identical definitions in other books. For example, in Sullivan's College Algebra, his definition is (sec. R.8): "If $a$ is a real number and $m$ and $n$ are integers containing no common factors, with $n \ge 2$, then: $a^\frac{m}{n} = \sqrt[n]{a^m} = (\sqrt[n]{a})^m$, provided that $\sqrt[n]{a}$ exists"; and he briefly states that "the Laws of Exponents hold for rational exponents", but all examples are restricted to positive variables only. OpenStax College Algebra does the same (sec. 1.3): "In these cases, the exponent must be a fraction in lowest terms... All of the properties of exponents that we learned for integer exponents also hold for rational exponents."

So what exactly are the restrictions on the Laws of Exponents in the real-number context, with rational exponents? As one example, is there a reason missing from the texts above why $(-1)^{2 \cdot \frac{1}{2}} = ((-1)^2)^\frac{1}{2}$ is a false statement, or is it one of the other equalities that fails?


Edit: Some literature that discusses this issue:

  • Goel, Sudhir K., and Michael S. Robillard. "The Equation: $-2 = (-8)^\frac{1}{3} = (-8)^\frac{2}{6} = [(-8)^2]^\frac{1}{6} = 2$." Educational Studies in Mathematics 33.3 (1997): 319-320.

  • Tirosh, Dina, and Ruhama Even. "To define or not to define: The case of $(-8)^\frac{1}{3}$." Educational Studies in Mathematics 33.3 (1997): 321-330.

  • Choi, Younggi, and Jonghoon Do. "Equality Involved in 0.999... and $(-8)^\frac{1}{3}$" For the Learning of Mathematics 25.3 (2005): 13-36.

  • Woo, Jeongho, and Jaehoon Yim. "Revisiting 0.999... and $(-8)^\frac{1}{3}$ in School Mathematics from the Perspective of the Algebraic Permanence Principle." For the Learning of Mathematics 28.2 (2008): 11-16.

  • Gómez, Bernardo, and Carmen Buhlea. "The ambiguity of the sign √." Proceedings of the Sixth Congress of the European Society for Research in Mathematics Education. 2009.

  • Gómez, Bernardo. "Historical conflicts and subtleties with the √ sign in textbooks." 6th European Summer University on the History and Epistemology in Mathematics Education. HPM: Vienna University of Technology, Vienna, Austria (2010).

10

There are 10 best solutions below

4
On

The issue is that $a^{\frac{1}{n}}$ is multivalued. You could arguably simplify the first calculation into $1 = \sqrt{1} = -1$. Taking different branch cuts is how the "paradox" arises.

Essentially, in the context of the reals (or even the complex numbers) $\sqrt{a}$ is one name for two functions, say $\sqrt[+]{a^2} = a$ and $\sqrt[-]{a^2} = -a$. All the laws are fine as long as you remain consistent with your choice. (Alternatively, by moving to a Riemann surface you don't have to make and track a choice... well, you have to decide when and how you are going to embed your reals into the Riemann surface, but once you do, no more choices.)

Whenever square roots entered the picture — you can say at $-1 = (-1)^{\frac{2}{2}}$ or at $((-1)^2)^{\frac{1}{2}}$ — it explicitly chose, going from left-to-right, the non-standard choice of $a^{\frac{1}{2}} = \sqrt[-]{a}$. If it chose the standard choice which it uses later on then, $-1 = -(-1)^{\frac{2}{2}} = -((-1)^2)^{\frac{1}{2}}$ and everything would work out. If it was consistent with the choice of $\sqrt[-]{}$ then $\sqrt{1} = \sqrt[-]{1} = -1$ would also have led to a correct result.

Moving my comment to the answer, a crucial source of confusion is that the definition of $a^{\frac{m}{n}}$ is not a well-defined function of the rationals in that it doesn't respect equality of the rationals. This is witnessed by the need for $\frac{m}{n}$ to be in lowest terms, and, relevant here, the fact that $1 = \frac{n}{n}$ does not imply $a^1 = a^{\frac{n}{n}}$. In fact, the ill-definedness of the provided definition of $a^\frac{m}{n}$ is entirely reduced to the question of what $a^\frac{n}{n}$ is.

So to put it in terms of rules: all the rules are valid, what's invalid is cancelling common factors in a "rational" exponent because the exponents aren't actually rational numbers.

22
On

You have put your finger precisely on the statement that is incorrect.

There are two competing conventions with regard to rational exponents.

The first convention is to define the symbol $a^x$ for $a > 0$ only. The symbol $\sqrt[n]{a}$ is defined for negative values of $a$ so long as $n$ is odd, but according to this convention, one wouldn't write $a^{1/n}$, for instance.

In defining $a^{p/q}$ to be $(\sqrt[q]{a})^p$, the author you quoted chose the fraction $p/q$ to be in lowest form so that the definition would be unambiguous. For example, $a^{10/15}$ is defined to be $(\sqrt[3]{a})^2$. However, it is preferable to define $a^{p/q}$ to be $(\sqrt[q]{a})^p$ in all cases and to prove that this definition is independent of the particular representation chosen for $p/q$; this is what more rigorous books tend to do. That is, you prove that if $p/q = r/s$, then $(\sqrt[q]{a})^p = (\sqrt[s]{a})^r$. There is no mention of lowest form.

The competing convention is to also allow $a^x$ to be defined for all $a \ne 0$ and all rational numbers $x = p/q$ that have at least one representation with an odd denominator. You then prove that $(\sqrt[q]{a})^p$ is independent of the particular representation $p/q$ chosen, so long as the denominator is odd. Thus you can write $a^{3/5} = (\sqrt[5]{a})^3 = (\sqrt[15]{a})^{9} = a^{9/15}$. All of that is fine. However, you cannot write $a^{6/10} = (\sqrt[10]{a})^6$, or even $a^{6/10} = \sqrt[10]{a^6}$. The number $a^{6/10}$ is well-defined, but to write down its definition, you must first select a fraction equivalent to $6/10$ that has an odd denominator, which could be $3/5$ or $9/15$ or something else. For $a^{1/2}$, this can't be done at all, so $a^{1/2}$ is undefined for $a < 0$.

The rules for exponents break down if you start allowing $a < 0$ and exponents that can't be written with an odd denominator. For example, the rule $a^{xy} = (a^x)^y$ is valid, but only so long as $x$ and $y$ are both rational numbers that can be written with an odd denominator. This is not the case if you write $a^1 = (a^2)^{1/2}$, despite the fact that both sides of the equation are defined since $a^2 > 0$.

Edit Reading the paper by Tirosh and Even, I was surprised to learn this matter has drawn serious attention from math educators.

A long time ago, I assumed that, apart from complex extensions, $a^x$ for non-integer $x$ should be defined only for $a > 0$. I reasoned that it made no sense to have a function $(-2)^x$ defined only for rational numbers $x$ with odd denominator. I objected strenuously to notations like $(-8)^{1/3}$.

But that was before I taught a calculus class, which is when I realized why some textbook authors are so happy to define $a^x$ for $a < 0$, following the second convention. The reason is that the formula $\frac{d}{dx}(x^r) = rx^{r-1}$ is perfectly valid for $x < 0$ and $r$ with odd denominator.

17
On

Edit: I'm now disavowing this answer, but keeping it here to maintain the comments. Consider my newer answer elsewhere for, I think, a better treatment.


I would currently argue that it's the 2nd equality in the example, $(-1)^1 = (-1)^\frac{2}{2}$, that contains the initial error. It's illegitimate to write $(-1)^\frac{2}{2}$ because that's an undefined expression, although the exact reason depends on which of two common definitions of $a^\frac{m}{n}$ is in use in a given textbook:

  • Some books at the college-algebra level define rational exponents for any real number $a$, but only for rational exponents written in lowest terms (i.e., $m$ and $n$ with no common factors). In this case, $(-1)^\frac{2}{2}$ in invalid because of the common factor in the exponent.

  • Other books, including those at the real analysis level, define rational exponents only for nonnegative real $a$ (i.e., $a \ge 0$), but permit any $m$ and $n$ regardless of common factors. In this case, $(-1)^\frac{2}{2}$ is undefined because of the negative base.

In summary: for $a > 0$, any $(-a)^\frac{n}{n}$ is a meaningless piece of writing. The way that this expression violates the definition of a rational exponent depends on the exact definition in use.

13
On

$-1 = (-1)^1 = (-1)^\frac{2}{2} = (-1)^{2 \cdot \frac{1}{2}} = ((-1)^2)^\frac{1}{2} = (1)^\frac{1}{2} = \sqrt{1} = 1$

The thing that looks the most suspect in my example above is the 4th equality, $(-1)^{2 \cdot \frac{1}{2}} = ((-1)^2)^\frac{1}{2}$, which seems to violate the spirit of Ratti's definition of rational exponents ("no common factors")... but technically, that translation from rational exponent to radical expression was not used as this point.

The 4th equality is indeed suspect, but not for the reason you suggest. It is an application the 2nd property of rational exponents that you list above:

If $r$ and $s$ are rational numbers and $a$ is a real number, then we have: $$(a^r)^s = a^{r\cdot s}$$

provided that all expressions used are defined.

More formally and less ambiguous would be:

$$\forall r,s \in \mathbb{Q}\colon \forall a \in \mathbb{R}\colon [a^r\in \mathbb{R} \land a^s\in \mathbb{R} \implies (a^r)^s=a^{r\cdot s}]$$

This statement makes it clear that we cannot infer $((-1)^2)^\frac{1}{2}=(-1)^{2 \times \frac{1}{2}}$ as in the "paradox" because $(-1)^\frac{1}{2} \notin \mathbb R$, i.e. because $(-1)^\frac{1}{2}$ is not defined.

That both restrictions are necessary can be seen from the fact that we must have $a^{r\cdot s}=a^{s\cdot r}=(a^s)^r=(a^r)^s$. If we had $a^s \notin \mathbb{R}$, we could not make this substitution.

With this in mind, we could restate the rule as follows:

$$\forall r,s \in \mathbb{Q}\colon \forall a \in \mathbb{R}\colon [a^r\in \mathbb{R} \land a^s\in \mathbb{R} \implies a^{r\cdot s}=(a^r)^s=(a^s)^r]$$


Though it has nothing to do with resolving the paradox, we may also need to define $x^\frac{1}{n}$ as follows:

$\forall x,y\in \mathbb{R}\colon\forall n\in \mathbb{N}\colon [Odd(n)\lor Even(n) \land n\neq 0 \land y\geq 0\implies [x^\frac{1}{n} =y\iff x=y^n ]]$

Using this rule, we could infer that $4^\frac{1}{2}=2$, but not $4^\frac{1}{2}=-2$.


BTW, as far as $\frac{m}{n}$ having to be in lowest terms, the definition given seems a bit sloppy. It cannot be, for example, that $4^\frac{2}{4}$ is undefined when $4^\frac{2}{4}= 4^\frac{1}{2}$ by substitution of $\frac{2}{4}=\frac{1}{2}$. I really don't think this notion can be source of the paradox.

2
On

$(a^r)^s=a^{rs}$ can indeed be false for $a<0$, as shown by your example.

You can "rescue" this rule by stating instead "$(a^r)^s=a^{rs}=(a^s)^r$, provided all three expressions are defined". (As the product is commutative, you cannot really distinguish $r$ and $s$.)

0
On

I think that the fourth equality introduces the error, since $a^{rs}=(a^r)^s$ may not hold if considering complex numbers. For example, \begin{align} -1=e^{i\pi}=e^{2i\pi \cdot \frac{1}{2}} \neq (e^{2i\pi})^{\frac{1}{2}}=1. \end{align}

3
On

No continuous definition of $a^r$ can be made for all real $a$ and $r$; and likewise, the familiar properties of exponents cannot be extended consistently to all real bases and powers. As a result, there are a number of competing definitions for $a^r$ for non-integer values $r$, depending on how much the author wishes to extend these properties, and in what direction. Here are some things that we can positively say for an identity like $(a^r)^s = a^{rs}$:

  • It is true for all natural numbers $r$ and $s$, and all real numbers $a$. [1]
  • It is true for all integers $r$ and $s$, and all nonzero reals $a$.
  • It is true for all real $r$ and $s$, and all positive reals $a$.

Note that the more permissive we are with $r$ and $s$, the more restrictions we must place on $a$. Some authors do further extend the real-valued definition of $a^r$ (and hence related properties) to negative real $a$'s and non-integer rationals $r$ (while others do not); but this is a fairly fragile definition, in that to be well-defined it requires that $r = m/n$ be written with an odd value for $n$ (books in this vein usually specify that it be in lowest terms). Among the greatest problems with such an approach is that a real-valued “principal $n$th root” will give contradictory results to the complex-valued “principal $n$th root” for negative bases. For example, if a real-valued definition is given, then $(-8)^{1/3} = -2$; but by the standard complex-valued definition, $(-8)^{1/3} = 1 + \sqrt{3}i$. This seems to create some confusion when discussing the issue across different contexts. Arguably it would be best to refrain from that very limited extension in reals, so as to not conflict with the more general complex-valued definition. (See the cited articles in the question above for some published debates on the wisdom of using such a real-valued definition for negative bases and non-integer exponents.)

Regarding the example in the question, most everyone agrees that $(-1)^{2 \cdot \frac{1}{2}} \ne ((-1)^2)^\frac{1}{2}$, if both sides are simplified in the standard order of operations; and this highlights the fact that the identity $(a^r)^s$ = $a^{rs}$ is not true unrestrictedly. Exactly what restrictions need to be honored depend on the definitions in use in a particular textbook. For Ratti, we might rescue the presentation by interpreting the clause “provided that all of the expressions used are defined” in the broad sense of every expression inside the box (not just the one identity being used), and since $a^s$ appears in other places in the box, and $(-1)^\frac{1}{2}$ is certainly undefined in real numbers, then the assertion $((-1)^2)^\frac{1}{2} = (-1)^{2 \cdot \frac{1}{2}}$ (the 4th equality) would thereby be proscribed.

[1]: And more generally for $a$ an element of any ring.

4
On

While I agree with everything in the answer by David, I'll give a different answer here just to put a different emphasis.

The fundamental error is to put the rule $(a^r)^s=a^{rs}$ in the box governed by the condition provided that all the expressions used are defined. That is not the right kind of condition for this rule, it requires specific limitations to the values of $a,r,s$. In this particular context ($a\neq0$ real and $\def\Q{\Bbb Q}r,s\in\Q$), the condition should be:

either $a>0$ or both $r$ and $s$ lie in the valuation ring $\def\Z{\Bbb Z}\Z_{(2)}$, the subring of $\Q$ of numbers that can be represented with an odd denominator.

Note that this condition ensures that both expressions are defined and that they are equal. Note also that these conditions are identical to those under which the powers $a^r$ and $a^s$ are both defined. However, neither of the expressions in the rule involves$~a^s$, so the conditions are not implied by "all expressions used in the rule are defined".


I am not a partisan of defining (certain) non-integer rational powers of negative numbers at all; it is of very little use, and if one wants to study the function $x\mapsto\sqrt[3]{x^2}$ on all of $\def\R{\Bbb R}\R$, there is not much against having to write just that, or $x\mapsto|x|^{2/3}$, rather than $x^{2/3}$. But if one does choose to go that way, I would suggest restating the definition as follows:

For $a\in\R_{\neq0}$ and $r\in\Q$, the power $a^r$ is defined provided that either $a>0$ or $r\in\Z_{(2)}$ (or both); in the former case one has $a^{m/n}=\sqrt[n]{a^m}=(\sqrt[n]a)^m$ for any fraction $m/n$ representing $r$, while in the latter case one has the same identities for any fraction $m/n$ representing $r$ in which $n$ is odd.

Given that the latter case has $r\in\Z_{(2)}$, restricting to odd $n$ there is quite natural (and it is necessary).


There exists some other contexts in which one might want to state the validity of $(a^r)^s=a^{rs}$ with the proviso that all occurring expressions are defined. I can think of the following two cases:

  • Exponents $r,s\in\Z$, and for instance $\def\C{\Bbb C}a\in\C$ unrestricted (it could even be something more general, like a square matrix). Here the rule basically derives from $a^{x+y}=a^xa^y$ (with the same proviso), and some considerations about how negative exponents combine. The proviso would serve to bar negative powers of $0$, and could be replaced by the explicit condition: $r,s\in\Bbb N$, or $a$ invertible.
  • Real $a\geq0$ and real exponents $r,s$. Here the proviso is needed for the same reason as in the previous point, to avoid negative powers of $0$; with $a>0$ the rule is valid unconditionally.

But the second point hints at a generalization where again the condition "all occurring expressions are defined" is insufficient. For real $a>0$, there is no difficulty in defining $a^r$ for all $r\in\C$. However (as I've mentioned in this answer to another question), the rule $(a^r)^s=a^{rs}$ is only valid with the restriction that $\def\R{\Bbb R}r\in\R$; this is strictly stronger than the condition $a^r\in\R$ ensuring that $(a^r)^s$ is defined, but which does not make the rule valid. The validity of the rule with the given restriction is easy to prove, see here.

0
On

Well, Algebra meets Calculus and they disagree on some points, literally (unless well guided).

Algebra says, "I have a polynomial $x^n=a$ with $n$ different complex roots. ($a\neq 0$, $n$ integer). And for positive real numbers I can have a function $\sqrt[n]a$ that is positive and solves the equation"

Convenience says: "Oh, so I can write $\sqrt[n]a=a^\frac 1 n$, for real positive $a$.

Student/teacher says: "Oh, it's true for some $n$ and negative $a$, too, so I'll write things like $\sqrt[3]{-8}$ because we all know what is meant. And this is the part where confusion leaks in.

On the other part, Calculus says, "I have a function $e^z$ that behaves like an algebraic polynomial for some $z$ ($z=n\ln x$). And I want it to be holomorphic (https://en.wikipedia.org/w/index.php?title=Holomorphic_function&oldid=699948452)".

The crucial point that no-one tells you is that, in calculus, $e^z:=\exp(z)$ is seen as the well-defined function $\sum_{k=0}^{\infty}\frac{z^n}{n!}$ rather than some dubious exponentiation.

For the function and any integer $k$, $1=e^{2i\pi k}$, and fixing any $k$, you have a branch for the exponentiation such that for any $a=e^\lambda= e^{\lambda+2i\pi k}$ you get a well-defined $a^z=\sum_{k=0}^{\infty}\frac{(z\cdot(\lambda +2i\pi k))^n}{n!}$.

Still, exponentiation is only unique for a fixed branch.

0
On

enter image description here

Graphically the results look very different. Reinforcing the differences.