Problematic Proof? (II)

Algebra Level 5

It is well known that z n = 1 z^{n}=1 (where n is an integer) has n n solutions evenly distributed around the unit circle. These are known as the n th n^\text{th} roots of unity.

What about when n n is not an integer? What if n n is irrational? Here is a proof that z r = 1 z^{r}=1 has an infinite number of solutions when r r is irrational. There is also a conclusion as to what that means. Can you spot a flaw?

Step:

  1. 1 = e 2 k i π 1=e^{2ki\pi} for any integer k k so z r = e 2 k i π z^{r}=e^{2ki\pi}
  2. z = e 2 k i π / r z=e^{2ki\pi/r}
  3. This argument will never be identical to a previous one as k 1 / r k 2 / r mod 1 k_{1}/r\neq k_{2}/r\text{ mod 1} for k 1 k 2 k_{1}\neq k_{2}
  4. Since the number z z could have an infinite number of distinct arguments there must be an infinite number of numbers that satisfy the equation.

Conclusion:

Every number of magnitude 1 is a solution for z z to the equation z r = 1 z^{r}=1 , r r is irrational.

Step 1 is flawed Step 4 is flawed Steps 1 to 4 are correct, but the conclution is wrong. The proof and conclusion are correct Step 2 is flawed Step 3 is flawed

This section requires Javascript.
You are seeing this because something didn't load right. We suggest you, (a) try refreshing the page, (b) enabling javascript if it is disabled on your browser and, finally, (c) loading the non-javascript version of this page . We're sorry about the hassle.

2 solutions

Mark Hennings
Mar 8, 2017

The attempted proof is incorrectly defining non-integer powers of complex numbers. The non-integer power of a complex number z z cannot be calculated for all z C z \in \mathbb{C} at a time. We define z u = e u log z z^u \; = \; e^{u \log z} and we try log z = log z + i A r g z z 0 \log z = \log|z| + i\mathrm{Arg}\,z \hspace{2cm} z \neq 0 The problem lies with the argument, since it is not uniquely defined on the whole complex plane. We need to make a conventional choice, such as demanding that π < A r g z π -\pi < \mathrm{Arg}\,z \le \pi for all z z . Other choices could be possible. It is important that a choice be made since, otherwise, the power z u z^u is multivalued. The numbers e i u θ e^{iu\theta} and e i u θ + 2 π i u e^{iu\theta + 2\pi iu} are distinct (so long as u u is not an integer), and these are equally good candidates for the u u th power of e i θ e^{i\theta} . A convention about the argument is necessary so that we know which of the (in fact infinitely many) candidate values for z u z^u to choose.

The most common convention is use what is called the principal branch of the argument . This causes us to remove the non-positive reals from the complex plane, and consider the set (called the cut plane ) C \ ( , 0 ] \mathbb{C} \backslash (-\infty,0] . Every z z in this set has a unique representation in the form z = r e i θ r > 0 , π < θ < π z \; = \; re^{i\theta} \hspace{2cm} r > 0\;,\; -\pi < \theta < \pi We can then define log z = ln r + i θ \log z \; = \; \ln r + i\theta and hence define z u z^u for any complex u u , but only for complex numbers in the cut plane .

It just so happens that this care is not necessary when u u is an integer.

Before we can talk about a function like z r z^r where r r is irrational, therefore, we need to choose an appropriate branch of the argument. Suppose for the sake of argument that we have chosen the principal branch of the argument. The problem then arises at S t e p 2 \boxed{\mathrm{Step\; 2}} . We can only go from Step 1 to Step 2 by saying that z = e 2 π i k r z = e^{\frac{2\pi ik}{r}} for such integers k k for which 2 k r < 1 \big|\tfrac{2k}{r}\big| < 1 , so that k < 1 2 r |k| < \tfrac12|r| .

To see why, suppose that k k was an integer with k 1 2 r |k| \ge \tfrac12|r| , and consider the complex number z = e 2 π i k r z = e^{\frac{2\pi ik}{r}} . To calculate z r z^r , we would have to find an integer m m such that 1 2 < k r m < 1 2 -\tfrac12 < \tfrac{k}{r} - m < \tfrac12 , and regard z z in the form z = e 2 π i ( k r m ) z = e^{2\pi i(\frac{k}{r}-m)} with π < 2 π ( k r m ) < π -\pi < 2\pi(\tfrac{k}{r} - m) < \pi , so that we could then define z r z^r , but then z r = e 2 π i ( k m r ) = e 2 π i m r 1 z^r \; = \; e^{2\pi i(k- mr)} \; = \; e^{-2\pi imr} \neq 1

If we chose a different branch of the argument, we would get a different range of appropriate k k for any r r , but there would still only be a finite number of them.

I believe I have an alternate way of dealing with the problem that does not require defining a branch of a logarithm. (I should clarify first however that the nonpositive reals are not "removed" from the complex plane when defining the principal logarithm, as you claim. It is simply the case that it is discontinuous (and, hence, not analytic) along the branch cut. At least this is the case from what I've seen. You may want to correct that...) I claim that step 2 can be corrected by simply adding 'there exists an integer k such that'. Here is my argument. I am aware that there are some missing details, but anyone can feel free to fill them in if they like. Also, feel free to alert me if you think there is a mistake. And if anyone doesn't think r r can be multiplied directly by the exponent in the way in which I do it, then please provide a counterexample that proves my method is incorrect. Otherwise, I don't wish to prove it's valid... But I certainly would not frown upon you proving it valid yourself. I'm just simply aware that it produces no error in the result. Ok, to the fun... Since z r = 1 z^r = 1 , z z is a unit complex number. Hence, let t t be a real number such that z = e i t z=e^{it} . Considering all angles, x=A, coterminal to t gives all representations of z z in the form e i x e^{ix} . So, trivially, n Z z = e i ( t + 2 n π ) \bigvee_{n\in \mathbb{Z}} z = e^{i(t+2n\pi)} . Raising both sides to the r r power in each equation, and noting z r = 1 = e 2 π i k z^r=1=e^{2\pi i k} for any integer k k , I get n , k Z e i r ( t + 2 n π ) = e 2 π i k \bigvee_{n,k\in \mathbb{Z}} e^{ir(t+2n\pi)}=e^{2\pi i k} , which implies n , m , k Z i r ( t + 2 n π ) + 2 π i m = 2 π i k \bigvee_{n,m,k\in \mathbb{Z}} ir(t+2n\pi)+2\pi i m=2\pi i k . But since I can subtract 2 π i m 2\pi i m on both sides to obtain 2 π i ( k m ) 2\pi i (k-m) on the RHS, and k - m is an integer, this is equivalent to (also dividing by r r ) n , k Z i ( t + 2 n π ) = 2 π i k r \bigvee_{n,k\in \mathbb{Z}} i(t+2n\pi)=\frac{2\pi i k}{r} . Exponentiating both sides leads to z = e 2 π i k r z=e^{\frac{2\pi i k}{r}} for some k. Cheers? (I realize the 2 n π 2n\pi was not necessary but I included it in there anyway.)

James Wilson - 3 years, 8 months ago

Log in to reply

You are wrong about the cut plane. Yes you can define π A r g z π -\pi \le \mathrm{Arg}\,z \le \pi and handle all nonzero complex numbers, but you end up with a function which is not differentiable (you cannot differentiate it on the nonpositive reals). The standard aim with these complex functions is to have analytic (differentiable) ones, so that we can apply the big guns of complex number theory, which means that they have to be defined on open (connected) domains. Thus the cut plane is as I have said.

You are saying that a (nonzero) complex number is the collection of all possible arguments that could be used. In another notation, you are identifying a complex number with the set { ( r , θ + 2 n π ) : n Z } \{ (r,\theta + 2n\pi) \,:\, n \in \mathbb{Z}\} OK, this is like sticking a pin though the Riemann surface, and identifying all points that are "vertically" above each other.

The problem comes when taking powers. You can integer powers, saying that { ( r , θ + 2 n π ) : n Z } m = { ( r m , m θ + 2 n π ) : n Z } \{ (r,\theta + 2n\pi) \,:\, n \in \mathbb{Z}\}^m \; = \; \{ (r^m,m\theta + 2n\pi) \,:\, n \in \mathbb{Z}\} but that does not work for noninteger, let alone complex, powers. Since { ( r , θ + 2 n π ) : n Z } a n d { ( r , θ + 2 π + 2 n π ) : n Z } \{ (r,\theta + 2n\pi) \,:\, n \in \mathbb{Z}\} \hspace{1cm} \mathrm{and} \hspace{1cm} \{ (r,\theta + 2\pi + 2n\pi) \,:\, n \in \mathbb{Z}\} are the same complex numbers (in your sense), their "square roots" { ( r , θ + 2 n π ) : n Z } 1 2 = { ( r , 1 2 θ + 2 n π ) : n Z } { ( r , θ + 2 π + 2 n π ) : n Z } 1 2 = { ( r , 1 2 θ + π + 2 n π ) : n Z } \begin{aligned} \{ (r,\theta + 2n\pi) \,:\, n \in \mathbb{Z}\}^{\frac12} & = \; \{ (\sqrt{r},\tfrac12\theta + 2n\pi) \,:\, n \in \mathbb{Z}\} \\ \{ (r,\theta + 2\pi + 2n\pi) \,:\, n \in \mathbb{Z}\}^{\frac12} & = \; \{ (\sqrt{r},\tfrac12\theta + \pi + 2n\pi) \,:\, n \in \mathbb{Z}\} \end{aligned} should be the same, which they are not.

Mark Hennings - 3 years, 8 months ago

Log in to reply

What you are implying is that there is no principal logarithm of negative real numbers? I have never seen this to be the case. Using π < A r g ( z ) < = π -\pi<Arg(z)<=\pi as is used in Fundamentals of Complex Analysis by E. B. Saff and A. D. Snider, the logarithm of a negative real number r is L o g r + i π Log|r| + i\pi .

James Wilson - 3 years, 8 months ago

Log in to reply

@James Wilson If you look at that book, you will note that it discusses the cut plane D D^\star , just as I have defined it, and points out the analyticity there. How to define the argument on the negative reals (should it be π \pi or π -\pi ?) is a matter of convention, and therefore of dubious value.

You can try to define multi-valued functions, but generally end up having to make a specific choice to get anywhere useful (like calculate integrals). As I have shown above, taking anything but integer powers of multivalued functions is fraught with problems.

Mark Hennings - 3 years, 8 months ago

Log in to reply

@Mark Hennings Uh, I'm just quoting you here. Maybe I misunderstood? "and hence define z u z^u for any complex u u , but only for complex numbers in the cut plane." According to my understanding, z u z^u can be defined for the entire plane. I do not see any reason why this could not be so.

James Wilson - 3 years, 8 months ago

Log in to reply

@James Wilson It depends on whether you want to be able to define the logarithm and powers on the whole plane, but in a manner which is not analytic, or whether you want to be able to define analytic logarithms and powers. The power of complex analysis lies in the remarkable properties of analytic functions. Perhaps I did not make it clear in my original proof that I was looking for an analytic function.

Mark Hennings - 3 years, 8 months ago

Log in to reply

@Mark Hennings Ah ok. Context is key.

James Wilson - 3 years, 8 months ago

@Mark Hennings By the way, I essentially used the formula (1) on page 133 (if you have the book). (with t possibly replaced by Arg(z))

James Wilson - 3 years, 8 months ago

@Mark Hennings Gotcha on the "(should it be π \pi or π -\pi )" thing. I thought you were leaving the logarithm undefined for A r g ( z ) = π Arg(z)=\pi .

James Wilson - 3 years, 8 months ago

Log in to reply

@James Wilson I am. Once the nonpositive reals are removed, there is no nonuniqueness in the definition of the argument, and the resulting logarithm is analytic. My point was that extending this definition to include the nonpositive reals is arbitrary, and therefore not helpful.

Mark Hennings - 3 years, 8 months ago

Log in to reply

@Mark Hennings Oh ok. That's true. I know it's indeed not very helpful in this case, as we would never have to consider a negative real number.

James Wilson - 3 years, 8 months ago

I appreciate your patience in responding to me, but what you did with the 1/2 does not reflect the same thing I did. I multiplied r r by the entire exponent. In my case they would both be the same.

James Wilson - 3 years, 8 months ago

Log in to reply

@James Wilson But if you do that, you end up with a different type of "number", with { ( r , θ + 2 n π ) : n Z } 1 2 = { ( r , 1 2 θ + n π : n Z } \{(r,\theta + 2n\pi) : n \in \mathbb{Z}\}^{\frac12} \; = \; \{(\sqrt{r},\tfrac12\theta + n\pi : n \in \mathbb{Z}\} which is not a complex number as you want to represent it (you are adjusting the argument by multiples of π \pi and not 2 π 2\pi ). If anything, the last represents all the modulus/argument options for both z \sqrt{z} and z -\sqrt{z} . Are you now asking that these complex numbers should actually be families of as many complex numbers as you want at a time?

Mark Hennings - 3 years, 8 months ago

Log in to reply

@Mark Hennings I mean each do just represent a set of complex numbers. That doesn't seem like it should be out of the ordinary to me.

James Wilson - 3 years, 8 months ago

Log in to reply

@James Wilson You would need to formulate this notion of complex number axiomatically and in detail, and show that it worked. I am worried about the idea that complex numbers could be collections of complex numbers when it is convenient. We would therefore have 1 1 , 1 -1 and { 1 , 1 } \{1,-1\} as different complex numbers?

So you are defining operators on sets of complex numbers. Consider the square root. For f ( z ) = z 2 f(z) = z^2 , you are defining the square root of a set A A as the inverse image f 1 ( A ) f^{-1}(A) of A A . This formalism would be challenging to implement for all functions.

Mark Hennings - 3 years, 8 months ago

Log in to reply

@Mark Hennings Well, actually, not in the proof I gave. Like I said, it essentially uses formula (1) on page 133 of Saff, Snider (i.e. z r = e r L o g z + i a r g ( z ) z^r=e^{rLog|z|+iarg(z)} ), and the simple property that e i x = e i y i x = i y + 2 π i k e^{ix}=e^{iy}\Rightarrow ix = iy + 2\pi i k .

James Wilson - 3 years, 8 months ago

Log in to reply

@James Wilson I point you to the final 2 paragraphs of my original proof. If you want to be Saff & Snider, replace 1 2 < k r m < 1 2 -\tfrac12 < \tfrac{k}{r} - m < \tfrac12 with 1 2 < k r m 1 2 -\tfrac12 < \tfrac{k}{r} - m \le \tfrac12 . You still end up with problems.

Mark Hennings - 3 years, 8 months ago

Log in to reply

@Mark Hennings I can't seem to detect any problems. I can just let t = A r g ( z ) t=Arg(z) , and so z r = e r ( L o g ( 1 ) + i t + 2 π i n ) z^r=e^{r(Log(1)+it+2\pi i n)} from Saff and Snider, page 132 (I meant page 132 btw, not 133). Then, because z r = 1 = e 2 π i k z^r=1=e^{2\pi i k} , r ( i t + 2 π i n ) + 2 π i m = 2 π i k r(it+2\pi i n) + 2\pi i m =2\pi i k .

James Wilson - 3 years, 8 months ago

Log in to reply

@James Wilson I still don't have to define a branch of the logarithm because I can just use the multiple-valued logarithm...

James Wilson - 3 years, 8 months ago

@James Wilson I am just curious what you are saying is wrong with my technique. I'm obviously not seeing it.

James Wilson - 3 years, 8 months ago

Log in to reply

@James Wilson The problem is that numbers of the form e 2 π k i r e^{\frac{2\pi ki}{r}} could be r r th roots of unity, but for different definitions of the r r th root. For example, if the definition of argument is principal, then e 2 π k i r e^{\frac{2\pi ki}{r}} will be a viable r r th root of unity provided that 1 2 r < k 1 2 r -\tfrac12r < k \le \tfrac12r . This means that there only finitely many r r th roots of unity for this definition. If you elected for a definition of argument that said that 29 π < A r g z 31 π 29\pi < \mathrm{Arg}\,z \le 31\pi for all z z , a different collection of values of k k would step up to the plate.

It is true that an infinite number of complex numbers can be r r th roots of unity, since there are infinitely many choices for the r r th root function. That does not mean that infinitely many complex numbers are r r th roots of unity.

Mark Hennings - 3 years, 8 months ago

Log in to reply

@Mark Hennings Ok. Well, I'm using the division symbol to mean division. So that 2 π k i r \frac{2\pi k i}{r} is just a number in the domain of the function e z e^z .

James Wilson - 3 years, 8 months ago

Log in to reply

@James Wilson I presume we were looking for all the solutions to z r = 1 z^r=1 . And we can raise e to any complex number power. If 2 π k i r \frac{2\pi k i}{r} is not a principle angle, it still has no effect on what the value of e 2 π k i r e^{\frac{2\pi k i}{r}} is. There is no need to ensure it is a principle angle. I don't just like to argue for fun by the way. I just like to be honest with my thought processes.

James Wilson - 3 years, 8 months ago

Log in to reply

@James Wilson I will have one last go, and then stop. To be able to define z r z^r uniquely for a complex number z z , we have to decide on a specific convention for the argument. A function is something that takes elements of its domain and assigns each to a uniquely defined value. To define z r z^r uniquely in this way we need to define the argument of z z exactly (such as requiring it to lie in the interval ( π , π ] (-\pi,\pi] ). This means that you do need to ensure that 2 π k r \tfrac{2\pi k}{r} is a principal angle (if we have adopted that convention), since otherwise we cannot apply the function as we have defined it.

A great deal of confusion is caused by the term multi-valued function, as this exchange proves. By definition, a function cannot take multiple values. When we say that the logarithm is multivalued, we really mean that there are many possible different definitions, all of which have slightly different consequences. However, it is not possible, as we interpret complex numbers, for all the definitions to be in play at the same time.

There is a theory of multivalued functions, which are properly interpreted as functions mapping each point in a domain A A to some subset of a set B B . You can, in suitable circumstances, assign a topology to the collection 2 B 2^B of subsets of B B , and so discuss the idea of continuous multifunctions - in other words, continuous functions from A A to 2 B 2^B . This is not what is going on here.

Mark Hennings - 3 years, 8 months ago

Log in to reply

@Mark Hennings Ok, so, I gather what you're saying is that me saying "raising both sides to the r r power in each equation" was wrong. Instead I should've said "multiplying each exponent by r r ." Indeed that was the exact part with the hole I was referring to in my original post. It's valid, but I can't provide proper justification.

James Wilson - 3 years, 8 months ago

Log in to reply

@James Wilson Perhaps it wasn't clear what I was talking about

James Wilson - 3 years, 8 months ago

Log in to reply

@James Wilson There's also one other hole, but that one should be able to be easily filled...

James Wilson - 3 years, 8 months ago

Log in to reply

@James Wilson I'll try and work on filling these two main gaps, and see where I get. Then I'll rewrite it.

James Wilson - 3 years, 8 months ago

@James Wilson My approach was designed not to use any multi-valued functions by the way. Also, if z r z^r is taken to be multi-valued, then I'm going to assume that, e.g., in the equation z r = x z^r=x it means y = x y=x for some y y in the set of values of z r z^r . If not, we could say the error was in step 1...

James Wilson - 3 years, 8 months ago

Log in to reply

@James Wilson Maybe the question should be clear on how it's defining z r z^r : your way or the other way

James Wilson - 3 years, 8 months ago

You really love math. I can tell.

James Wilson - 3 years, 8 months ago

Log in to reply

Yes I do. I have my life studying, learning and teaching it, and it is still exciting.

Mark Hennings - 3 years, 8 months ago

Me too. But I don't put as much effort into it as I should.

James Wilson - 3 years, 8 months ago

The proof is absolutely correct: there are an infinite number of solutions. However this does not mean that every number with magnitude 1 is a solution. There are literally more numbers on that circle (uncountably infinitely many) than there are natural numbers (countably infinitely many). This does mean that a number arbitrarily close to another number (magnitude one) is a solution, but not necessarily the number itself.

As an explicit example take z 1 / 2 = 1 z^{1/\sqrt{2}}=1 . -1 is not a solution to this problem as there is no value of k such that 2 2 k 2\sqrt{2}k =1 mod 2

It is not entirely clear what "proof" and "conclusion" refers to. IE Is the conclusion part of the proof?

Can you clean up the phrasing of the problem?

Calvin Lin Staff - 4 years, 3 months ago

But as i^4=1; by applying fourth root on both sides we get i=1 . Which is wrong .. so I conclude that step 2 is wrong

Sudhamsh Suraj - 4 years, 3 months ago

Log in to reply

Sorry I understood it is actually e^2inπ

Sudhamsh Suraj - 4 years, 3 months ago

Can you please explain in some detail?I can't understand the part

"However this does not mean that every number with magnitude 1 is a solution. There are literally more numbers on that circle (uncountably infinitely many) than there are natural numbers (countably infinitely many). This does mean that a number arbitrarily close to another number (magnitude one) is a solution, but not necessarily the number itself."

Thanks!

Harsh Shrivastava - 4 years, 3 months ago

Log in to reply

Are you familiar with the idea that some infinities are bigger than others? (proof here is you're interested https://en.wikipedia.org/wiki/Cantor's diagonal argument) Anyway there are more numbers around the circle because the argument can vary continuously over all irrational numbers (between -pi and pi at least). This type of infinity is larger than the infinity that refers to counting numbers. For this reason you could pick a value on the circle that cannot be achieved even though the argument can take an infinite number of values.

I'm probably not explaining it very well but there is a lot of information available on the topic

William Whitehouse - 4 years, 3 months ago

0 pending reports

×

Problem Loading...

Note Loading...

Set Loading...