try ai
Popular Science
Edit
Share
Feedback
  • De Moivre's Theorem

De Moivre's Theorem

SciencePediaSciencePedia
Key Takeaways
  • De Moivre's Theorem provides a simple formula for raising a complex number to a power: raise its magnitude to the power and multiply its argument by the power.
  • The theorem fundamentally connects algebra with geometry, re-framing complex exponentiation as a process of rotation and scaling in the complex plane.
  • It is a powerful tool for deriving complex trigonometric identities and finding the n-th roots of complex numbers, transforming difficult problems into manageable algebra.
  • Its principles extend to other mathematical areas, providing a model for understanding matrix powers in linear algebra and 3D rotations via quaternions.

Introduction

Calculating high powers of complex numbers, such as (1+i3)12(1+i\sqrt{3})^{12}(1+i3​)12, using repeated algebraic multiplication is a daunting and error-prone task. This computational challenge highlights a gap in basic algebraic tools and hints at a more elegant, underlying structure. This article unveils that structure by introducing De Moivre's Theorem, a powerful formula that transforms complex exponentiation from a laborious chore into a simple, intuitive process. By exploring the theorem, you will gain a deeper understanding of the geometric nature of complex numbers and their surprising connections across various scientific disciplines.

The first section, "Principles and Mechanisms," will guide you through the shift from Cartesian to polar coordinates, revealing how complex multiplication is fundamentally an act of rotation and scaling. This geometric insight leads directly to the formulation of De Moivre's Theorem and demonstrates its power in solving complex calculations with ease. The subsequent section, "Applications and Interdisciplinary Connections," explores the theorem's far-reaching impact, showing how it serves as a master key for deriving trigonometric identities, solving polynomial equations, and even provides a conceptual blueprint for understanding matrix powers and 3D rotations in fields like engineering and computer graphics.

Principles and Mechanisms

Imagine trying to calculate (1+i3)12(1+i\sqrt{3})^{12}(1+i3​)12. If your only tool is standard algebra, you're in for a long afternoon. You'd have to multiply (1+i3)(1+i\sqrt{3})(1+i3​) by itself, then the result by (1+i3)(1+i\sqrt{3})(1+i3​) again, and so on, eleven times in total. Each step would involve the FOIL method, collecting real and imaginary parts, and hoping you don't make a small error that snowballs into a giant one. There must be a better way. And indeed, there is. The journey to this better way reveals a stunningly beautiful geometric truth at the heart of complex numbers.

The Geometric Heart of Complex Numbers

Our first move is to change our point of view. Instead of thinking of a complex number z=a+biz = a+biz=a+bi as a pair of coordinates, let's visualize it as an arrow—a vector—in the ​​complex plane​​, starting from the origin and pointing to the coordinate (a,b)(a, b)(a,b). Like any arrow, it has two defining features: its length and its direction.

The length of this arrow, which we call the ​​magnitude​​ or ​​modulus​​ and denote by ∣z∣|z|∣z∣ or rrr, is found using the Pythagorean theorem: r=∣z∣=a2+b2r = |z| = \sqrt{a^2 + b^2}r=∣z∣=a2+b2​. It tells us how "far" the number is from the origin.

The direction of the arrow is the angle it makes with the positive real axis, measured counter-clockwise. We call this angle the ​​argument​​, denoted by θ\thetaθ.

This pair of numbers, (r,θ)(r, \theta)(r,θ), is the ​​polar form​​ of the complex number. It's a different way of addressing the same point in the plane. Instead of saying "go aaa units horizontally and bbb units vertically," we say "face in the direction θ\thetaθ and walk rrr units straight ahead." Using trigonometry, we can see that a=rcos⁡θa = r\cos\thetaa=rcosθ and b=rsin⁡θb = r\sin\thetab=rsinθ. This gives us the fundamental connection:

z=a+ib=rcos⁡θ+i(rsin⁡θ)=r(cos⁡θ+isin⁡θ)z = a + ib = r\cos\theta + i(r\sin\theta) = r(\cos\theta + i\sin\theta)z=a+ib=rcosθ+i(rsinθ)=r(cosθ+isinθ)

This shift from Cartesian (a,b)(a,b)(a,b) to polar (r,θ)(r,\theta)(r,θ) coordinates is the key that unlocks everything.

The Secret Rule: Multiply Lengths, Add Angles

Now, let's see what happens when we multiply two complex numbers, z1=r1(cos⁡θ1+isin⁡θ1)z_1 = r_1(\cos\theta_1 + i\sin\theta_1)z1​=r1​(cosθ1​+isinθ1​) and z2=r2(cos⁡θ2+isin⁡θ2)z_2 = r_2(\cos\theta_2 + i\sin\theta_2)z2​=r2​(cosθ2​+isinθ2​). The algebra might look a bit messy at first, but a wonderful pattern emerges.

z1z2=[r1(cos⁡θ1+isin⁡θ1)]⋅[r2(cos⁡θ2+isin⁡θ2)]=r1r2[(cos⁡θ1cos⁡θ2−sin⁡θ1sin⁡θ2)+i(sin⁡θ1cos⁡θ2+cos⁡θ1sin⁡θ2)]\begin{align} z_1 z_2 & = [r_1(\cos\theta_1 + i\sin\theta_1)] \cdot [r_2(\cos\theta_2 + i\sin\theta_2)] \\ & = r_1 r_2 [(\cos\theta_1 \cos\theta_2 - \sin\theta_1 \sin\theta_2) + i(\sin\theta_1 \cos\theta_2 + \cos\theta_1 \sin\theta_2)] \end{align}z1​z2​​=[r1​(cosθ1​+isinθ1​)]⋅[r2​(cosθ2​+isinθ2​)]=r1​r2​[(cosθ1​cosθ2​−sinθ1​sinθ2​)+i(sinθ1​cosθ2​+cosθ1​sinθ2​)]​​

If you remember your trigonometric angle-sum identities, you'll recognize the expressions in the parentheses immediately. They are none other than cos⁡(θ1+θ2)\cos(\theta_1 + \theta_2)cos(θ1​+θ2​) and sin⁡(θ1+θ2)\sin(\theta_1 + \theta_2)sin(θ1​+θ2​)! So, the result simplifies beautifully:

z1z2=r1r2[cos⁡(θ1+θ2)+isin⁡(θ1+θ2)]z_1 z_2 = r_1 r_2 [\cos(\theta_1 + \theta_2) + i\sin(\theta_1 + \theta_2)]z1​z2​=r1​r2​[cos(θ1​+θ2​)+isin(θ1​+θ2​)]

This is a remarkable result. To multiply two complex numbers, we simply ​​multiply their magnitudes and add their arguments​​. Multiplication in the complex plane isn't just a jumble of algebra; it's a geometric operation of scaling and rotation.

This rule is made even more elegant by what is arguably the most beautiful equation in all of mathematics, ​​Euler's formula​​:

eiθ=cos⁡θ+isin⁡θe^{i\theta} = \cos\theta + i\sin\thetaeiθ=cosθ+isinθ

Using this, we can write any complex number as z=reiθz = r e^{i\theta}z=reiθ. Now look what happens to our multiplication rule:

z1z2=(r1eiθ1)(r2eiθ2)=(r1r2)ei(θ1+θ2)z_1 z_2 = (r_1 e^{i\theta_1}) (r_2 e^{i\theta_2}) = (r_1 r_2) e^{i(\theta_1 + \theta_2)}z1​z2​=(r1​eiθ1​)(r2​eiθ2​)=(r1​r2​)ei(θ1​+θ2​)

The familiar rule for exponents perfectly captures the geometric action of multiplying magnitudes and adding angles. Euler's formula provides the natural language for describing rotation.

De Moivre's Theorem: The Power of Rotation

What is raising a number to a power, if not multiplying it by itself repeatedly? If we want to compute znz^nzn, where nnn is an integer, we are just performing the multiplication operation nnn times. Applying our new rule is simple:

  • The new magnitude will be r×r×⋯×rr \times r \times \dots \times rr×r×⋯×r (nnn times), which is rnr^nrn.
  • The new argument will be θ+θ+⋯+θ\theta + \theta + \dots + \thetaθ+θ+⋯+θ (nnn times), which is nθn\thetanθ.

And so we arrive at the celebrated formula named after Abraham de Moivre:

zn=[r(cos⁡θ+isin⁡θ)]n=rn[cos⁡(nθ)+isin⁡(nθ)]z^n = [r(\cos\theta + i\sin\theta)]^n = r^n[\cos(n\theta) + i\sin(n\theta)]zn=[r(cosθ+isinθ)]n=rn[cos(nθ)+isin(nθ)]

Or, in the even more compact language of Euler's formula: (reiθ)n=rneinθ(re^{i\theta})^n = r^n e^{in\theta}(reiθ)n=rneinθ. This is ​​De Moivre's Theorem​​. It turns the laborious task of repeated multiplication into a simple act of arithmetic.

Taming Monstrous Calculations

Let's return to our original problem. How can we calculate (3−i)12(\sqrt{3}-i)^{12}(3​−i)12? First, we convert α=3−i\alpha = \sqrt{3}-iα=3​−i to polar form. The magnitude is ∣α∣=(3)2+(−1)2=3+1=2|\alpha| = \sqrt{(\sqrt{3})^2 + (-1)^2} = \sqrt{3+1} = 2∣α∣=(3​)2+(−1)2​=3+1​=2. The argument θ\thetaθ satisfies cos⁡θ=3/2\cos\theta = \sqrt{3}/2cosθ=3​/2 and sin⁡θ=−1/2\sin\theta = -1/2sinθ=−1/2, so θ=−π/6\theta = -\pi/6θ=−π/6. Thus, α=2(cos⁡(−π/6)+isin⁡(−π/6))\alpha = 2(\cos(-\pi/6) + i\sin(-\pi/6))α=2(cos(−π/6)+isin(−π/6)).

Now, we apply De Moivre's theorem for n=12n=12n=12:

α12=212[cos⁡(12⋅−π6)+isin⁡(12⋅−π6)]\alpha^{12} = 2^{12} \left[ \cos\left(12 \cdot -\frac{\pi}{6}\right) + i\sin\left(12 \cdot -\frac{\pi}{6}\right) \right]α12=212[cos(12⋅−6π​)+isin(12⋅−6π​)]
α12=4096[cos⁡(−2π)+isin⁡(−2π)]=4096(1+i⋅0)=4096\alpha^{12} = 4096 [\cos(-2\pi) + i\sin(-2\pi)] = 4096(1 + i \cdot 0) = 4096α12=4096[cos(−2π)+isin(−2π)]=4096(1+i⋅0)=4096

What seemed like an impossible calculation becomes astonishingly simple. The complex number, after 12 steps of rotation and scaling, lands directly on the positive real axis. Similarly, a calculation like (1−i)10(1-i)^{10}(1−i)10 can be shown to result in −32i-32i−32i, a purely imaginary number. The final result depends critically on the total angle of rotation.

This principle is powerful in many applications, from signal processing to oscillator design. For example, one could determine the specific number of steps nnn for the state of an oscillator, Zn=(1+i3)nZ_n = (1+i\sqrt{3})^nZn​=(1+i3​)n, to produce a "reset pulse" by becoming a large, purely real number. This requires finding an nnn such that the angle nθn\thetanθ is a multiple of π\piπ, while the magnitude 2n2^n2n exceeds a certain threshold.

Seeing the Big Picture: The Dance of Rotations

De Moivre's theorem is more than just a computational shortcut; it's a tool for understanding. Suppose you need to know which quadrant the number Z25Z^{25}Z25 lies in, where Z=32−12iZ = \frac{\sqrt{3}}{2} - \frac{1}{2}iZ=23​​−21​i, but you don't need the exact value.

First, we find the polar form of ZZZ. The magnitude ∣Z∣|Z|∣Z∣ is (3/2)2+(−1/2)2=1\sqrt{(\sqrt{3}/2)^2 + (-1/2)^2} = 1(3​/2)2+(−1/2)2​=1. It lies on the unit circle. The argument is θ=−π/6\theta = -\pi/6θ=−π/6. So, Z=cos⁡(−π/6)+isin⁡(−π/6)Z = \cos(-\pi/6) + i\sin(-\pi/6)Z=cos(−π/6)+isin(−π/6).

According to De Moivre's theorem, Z25=cos⁡(−25π/6)+isin⁡(−25π/6)Z^{25} = \cos(-25\pi/6) + i\sin(-25\pi/6)Z25=cos(−25π/6)+isin(−25π/6). We don't need a calculator. We just need to find where this angle points. The angle −25π/6-25\pi/6−25π/6 is the same as −4π−π/6-4\pi - \pi/6−4π−π/6. Since −4π-4\pi−4π represents two full clockwise rotations, the final direction is identical to the original direction, −π/6-\pi/6−π/6. An angle of −π/6-\pi/6−π/6 points into the fourth quadrant (positive real part, negative imaginary part). We know the answer without ever computing the sines and cosines. The theorem allows us to track the geometric "dance" of the number as it's repeatedly multiplied.

Beyond Positive Powers

The logic of De Moivre's theorem extends seamlessly. Division is the inverse of multiplication, so it's natural to expect that dividing by a complex number means dividing the magnitudes and subtracting the arguments. This is exactly right. This means De Moivre's formula also holds for negative integers. Calculating an impedance parameter like Z−5Z^{-5}Z−5 in an electrical circuit, where Z=3+iZ=\sqrt{3}+iZ=3​+i, becomes a straightforward application of the formula with n=−5n=-5n=−5. The same elegant machinery applies whether we are projecting forward in time (positive powers) or backward (negative powers).

A Deeper Unity: Matrices and Waves

The true beauty of a great principle is in the unexpected connections it reveals. Consider the set of all 2×22 \times 22×2 matrices of the form (a−bba)\begin{pmatrix} a & -b \\ b & a \end{pmatrix}(ab​−ba​). If you add or multiply two such matrices, the result is another matrix of the exact same form. This structure is identical to the arithmetic of complex numbers a+bia+bia+bi. In fact, we can say that the complex number cos⁡ϕ+isin⁡ϕ\cos\phi + i\sin\phicosϕ+isinϕ is the rotation matrix R(ϕ)=(cos⁡ϕ−sin⁡ϕsin⁡ϕcos⁡ϕ)R(\phi) = \begin{pmatrix} \cos\phi & -\sin\phi \\ \sin\phi & \cos\phi \end{pmatrix}R(ϕ)=(cosϕsinϕ​−sinϕcosϕ​).

From this perspective, De Moivre's theorem is stating something that is almost geometrically obvious: applying a rotation of angle ϕ\phiϕ, nnn times in a row, is equivalent to a single rotation by an angle of nϕn\phinϕ. That is, [R(ϕ)]n=R(nϕ)[R(\phi)]^n = R(n\phi)[R(ϕ)]n=R(nϕ). The abstract algebraic statement (cos⁡ϕ+isin⁡ϕ)n=cos⁡(nϕ)+isin⁡(nϕ)(\cos\phi + i\sin\phi)^n = \cos(n\phi) + i\sin(n\phi)(cosϕ+isinϕ)n=cos(nϕ)+isin(nϕ) is a direct reflection of a concrete geometric action in linear algebra.

This connection doesn't stop there. The expression cos⁡(kθ)\cos(k\theta)cos(kθ) appears everywhere in the study of waves, vibrations, and signals. De Moivre's theorem gives us a powerful way to handle it. A complex number on the unit circle is z=eiθz = e^{i\theta}z=eiθ. Its inverse is z−1=e−iθz^{-1} = e^{-i\theta}z−1=e−iθ. Let's add them:

z+z−1=eiθ+e−iθ=(cos⁡θ+isin⁡θ)+(cos⁡θ−isin⁡θ)=2cos⁡θz + z^{-1} = e^{i\theta} + e^{-i\theta} = (\cos\theta + i\sin\theta) + (\cos\theta - i\sin\theta) = 2\cos\thetaz+z−1=eiθ+e−iθ=(cosθ+isinθ)+(cosθ−isinθ)=2cosθ

And by De Moivre's theorem, this generalizes immediately to:

zk+z−k=2cos⁡(kθ)z^k + z^{-k} = 2\cos(k\theta)zk+z−k=2cos(kθ)

This little identity is a gateway between complex analysis and Fourier analysis. It allows us to turn messy trigonometric sums into elegant geometric series in the complex plane, a technique crucial for understanding periodic structures like crystals or complex waves.

From a shortcut for tedious multiplication to a unifying principle connecting algebra, geometry, and wave mechanics, De Moivre's theorem is a perfect example of how a change in perspective can transform a problem, revealing the simple, beautiful structure that lies beneath the surface.

Applications and Interdisciplinary Connections

After our journey through the elegant mechanics of De Moivre's theorem, one might be tempted to file it away as a beautiful, but perhaps niche, mathematical curiosity. Nothing could be further from the truth. This theorem is not a museum piece; it is a master key, unlocking doors that connect seemingly disparate rooms in the grand house of science and engineering. It serves as a remarkable bridge between the algebra of powers, the geometry of rotations, and the periodic world of trigonometry. Let's explore some of these connections and see the theorem in action.

The Master Key to Trigonometry

At its most immediate, De Moivre's theorem is a powerhouse for manipulating trigonometric functions. Have you ever tried to derive the formula for cos⁡(5θ)\cos(5\theta)cos(5θ) using only the angle addition formulas like cos⁡(A+B)\cos(A+B)cos(A+B)? It's a torturous path, a thicket of algebra where a single misplaced sign spells disaster. De Moivre's theorem, however, turns this ordeal into an elegant stroll. By taking the real part of (cos⁡θ+isin⁡θ)5(\cos\theta + i\sin\theta)^5(cosθ+isinθ)5, a straightforward binomial expansion gives us cos⁡(5θ)\cos(5\theta)cos(5θ) as a neat polynomial in cos⁡θ\cos\thetacosθ and sin⁡θ\sin\thetasinθ. With the help of the identity sin⁡2θ=1−cos⁡2θ\sin^2\theta = 1 - \cos^2\thetasin2θ=1−cos2θ, we can express cos⁡(nθ)\cos(n\theta)cos(nθ) as a polynomial purely in terms of cos⁡θ\cos\thetacosθ. These resulting expressions are no mere academic exercises; they are the famous Chebyshev polynomials, fundamental tools in numerical analysis and approximation theory, used to find the "best" polynomial approximations of complex functions.

The same principle allows us to unravel identities for functions like sin⁡(nθ)\sin(n\theta)sin(nθ) or even tan⁡(nθ)\tan(n\theta)tan(nθ). This power to translate trigonometric statements about multiple angles into algebraic polynomials is profound. It allows us to turn a difficult trigonometric equation, like finding the angles θ\thetaθ where tan⁡(5θ)=1\tan(5\theta) = 1tan(5θ)=1, into a standard polynomial equation. Once in the polynomial world, we can bring in powerful tools like Vieta's formulas to analyze the properties of the solutions without ever needing to calculate the specific angles themselves.

The theorem works just as beautifully in reverse. In physics and engineering, we often encounter powers of trigonometric functions, like sin⁡5(θ)\sin^5(\theta)sin5(θ), which can be very difficult to integrate or analyze. For instance, the energy of a non-linear oscillator might depend on the fourth or fifth power of its displacement. De Moivre's theorem, via Euler's formula, allows us to perform a "linearization." We can express sin⁡θ=exp⁡(iθ)−exp⁡(−iθ)2i\sin\theta = \frac{\exp(i\theta) - \exp(-i\theta)}{2i}sinθ=2iexp(iθ)−exp(−iθ)​, take its power, and then group the terms to get a simple sum of sines or cosines of multiple angles. A complicated, high-power oscillation is thus revealed to be a simple superposition of pure, harmonic tones. This technique is a cornerstone of Fourier analysis, which is used everywhere from signal processing (decomposing a sound wave into its constituent frequencies) to quantum mechanics.

Solving the Unsolvable: From Roots to Infinite Sums

The theorem's true genius shines when it's used to solve problems that seem intractable in the real domain. Consider the simple-looking equation zn=1z^n = 1zn=1. In the real numbers, the answers are trivial: 111 if nnn is odd, and ±1\pm 1±1 if nnn is even. But in the complex plane, De Moivre's theorem reveals a stunningly beautiful and symmetric solution: nnn distinct roots, all lying on the unit circle, forming the vertices of a regular nnn-sided polygon. This deep insight extends to far more complex equations. Finding the roots of a polynomial like z8+z4+1=0z^8 + z^4 + 1 = 0z8+z4+1=0 becomes a manageable, two-step process: first solve for z4z^4z4, then find the roots of the results. Each step is a direct application of De Moivre's formula, turning a single problem into a cascade of simpler ones and revealing a beautiful constellation of solutions in the complex plane.

The method of embedding a real problem into a complex one is also a secret weapon for evaluating formidable-looking sums. Imagine being asked to calculate the sum Sn(θ)=∑k=1nk(nk)cos⁡(kθ)S_n(\theta) = \sum_{k=1}^{n} k \binom{n}{k} \cos(k\theta)Sn​(θ)=∑k=1n​k(kn​)cos(kθ). Attacking this directly is a nightmare. The trick is to realize that this sum is just the real part of a related complex sum, ∑k(nk)(exp⁡(iθ))k\sum k \binom{n}{k} (\exp(i\theta))^k∑k(kn​)(exp(iθ))k. This complex sum, however, can be recognized as a derivative of the simple binomial expansion ∑(nk)xk=(1+x)n\sum \binom{n}{k} x^k = (1+x)^n∑(kn​)xk=(1+x)n. By performing the calculus in the complex domain, we arrive at a compact, elegant expression. Taking its real part at the very end gives us the answer to our original, difficult problem. It feels like a magic trick, but it is a testament to the power of viewing a problem from a higher, complex-valued dimension.

Beyond the Plane: Matrices, Rotations, and New Algebras

The influence of De Moivre's theorem extends far beyond the complex plane itself, providing a conceptual blueprint for other areas of mathematics and physics.

In linear algebra, we often need to compute high powers of a matrix, MnM^nMn. This is computationally expensive, but if the matrix can be diagonalized, the problem simplifies to computing the powers of its eigenvalues. What if these eigenvalues are complex numbers? Suppose the eigenvalues of a matrix describing a 2D transformation are eiπ/4e^{i\pi/4}eiπ/4 and e−iπ/4e^{-i\pi/4}e−iπ/4. Then the nnn-th power of these eigenvalues is immediately given by De Moivre's formula as einπ/4e^{in\pi/4}einπ/4 and e−inπ/4e^{-in\pi/4}e−inπ/4. This allows us to find a simple, closed-form expression for quantities like the trace of MnM^nMn, directly connecting matrix exponentiation to trigonometric functions. The abstract algebraic operation of taking a matrix power is seen to correspond to a simple geometric rotation.

Perhaps the most stunning extension of De Moivre's formula is into the realm of three-dimensional rotations. In computer graphics, robotics, and aerospace navigation, we constantly need to describe and compose rotations in 3D space. While matrices can do this, they are cumbersome. A more elegant solution lies in a new number system called quaternions. A unit quaternion of the form q=cos⁡(θ2)+usin⁡(θ2)q = \cos(\frac{\theta}{2}) + \mathbf{u}\sin(\frac{\theta}{2})q=cos(2θ​)+usin(2θ​), where u\mathbf{u}u is a vector-like object whose square is −1-1−1, can represent a rotation by angle θ\thetaθ around the axis u\mathbf{u}u.

What happens if we perform this rotation nnn times? We must calculate qnq^nqn. The structure of quaternions wonderfully mimics that of complex numbers, and an analogue of De Moivre's theorem emerges: qn=cos⁡(nθ2)+usin⁡(nθ2)q^n = \cos(\frac{n\theta}{2}) + \mathbf{u}\sin(\frac{n\theta}{2})qn=cos(2nθ​)+usin(2nθ​). The fundamental idea—that taking a power corresponds to multiplying the angle—persists even in this more sophisticated algebraic setting. The simple rule discovered by De Moivre for rotations in a 2D plane provides the very pattern for describing composite rotations in 3D space, a principle used countless times a second in every modern video game engine and spacecraft guidance system.

From deriving trigonometric identities to solving polynomials, from summing series to powering matrices and guiding spaceships, the legacy of De Moivre's theorem is rich and vibrant. It is a prime example of how a single, elegant mathematical idea can echo through the centuries, providing clarity, power, and a deeper appreciation for the profound unity of the mathematical and physical worlds.