The Sum of Two Squares Theorem

SciencePedia

Key Takeaways

An integer can be written as a sum of two squares if and only if all its prime factors of the form $4k+3$ appear with an even exponent in its prime factorization.
The concept of Gaussian integers elegantly explains this theorem by reframing the problem as one of factorization in the complex plane.
The theorem has profound and unexpected applications in diverse fields such as the geometry of crystal lattices, the physics of wave vibrations, and the security of modern cryptography.
On average, the number of ways an integer can be written as a sum of two squares is not an integer but the fundamental mathematical constant π.

Introduction

What makes a number special? For mathematicians, a simple question like "Which numbers can be written as the sum of two squares?" can unlock a world of profound beauty and hidden connections. While numbers like 5 ( $2^2 + 1^2$ ) and 8 ( $2^2 + 2^2$ ) fit this description, others like 3 and 6 mysteriously do not. This simple observation poses a puzzle: is there a universal rule governing this property, or is it merely an arithmetic coincidence? The pursuit of this answer reveals not just a rule, but a stunning narrative that connects arithmetic, algebra, and geometry.

This article unravels the mystery behind the sum of two squares. It provides a complete answer to this centuries-old question and showcases how a single idea can ripple through mathematics, revealing its deep, underlying unity. In the following chapters, we will first delve into the "Principles and Mechanisms," uncovering the complete criterion through the filters of modular arithmetic and the elegant world of Gaussian integers. We will then explore the theorem's surprising reach in "Applications and Interdisciplinary Connections," discovering its impact on fields ranging from the geometry of crystal lattices and the physics of wave mechanics to the security of modern digital cryptography.

Principles and Mechanisms

The question of which numbers can be written as the sum of two squares seems, at first, like a simple arithmetic curiosity. You can check a few: $1 = 1^2 + 0^2$ , $2 = 1^2 + 1^2$ , $4 = 2^2 + 0^2$ , $5 = 2^2 + 1^2$ . But then you hit a snag: $3$ is impossible. So is $6$ . And $7$ . Then $8 = 2^2 + 2^2$ works. What's the pattern here? Is there a deep principle governing this seemingly random game? The answer, it turns out, is a resounding yes. It's a beautiful story that takes us from simple arithmetic to the geometric world of complex numbers and back again, revealing a surprising unity in mathematics.

A Simple Filter: The Modulo 4 Test

Let's start by trying to find a rule that tells us which numbers cannot be a sum of two squares. This is often easier than finding out which ones can. Imagine we are detectives looking for a common trait among the "impossible" numbers like 3, 7, 11, 15, and so on.

Consider any integer, let's call it $n$ . When you square it, what happens? If $n$ is even, we can write it as $n=2k$ . Its square is $n^2 = (2k)^2 = 4k^2$ . This number is a multiple of 4. In the language of modular arithmetic, we say $n^2 \equiv 0 \pmod{4}$ . If $n$ is odd, we can write it as $n=2k+1$ . Its square is $n^2 = (2k+1)^2 = 4k^2 + 4k + 1 = 4(k^2+k) + 1$ . This number always leaves a remainder of 1 when divided by 4. So, $n^2 \equiv 1 \pmod{4}$ .

This is a powerful observation! The square of any integer, when divided by 4, can only leave a remainder of 0 or 1. Never 2 or 3.

Now, what happens when we add two squares, $a^2 + b^2$ ? The possible remainders modulo 4 for this sum are simply the sums of the possible remainders for each square:

$0 + 0 = 0$ (if both $a$ and $b$ are even)
$0 + 1 = 1$ (if one is even, one is odd)
$1 + 1 = 2$ (if both $a$ and $b$ are odd)

Look at that list: 0, 1, 2. The remainder 3 is nowhere to be found! This gives us our first powerful rule: any integer that leaves a remainder of 3 when divided by 4 cannot be written as the sum of two squares. This immediately rules out 3, 7, 11, 15, 19, 23, and an infinite list of others. It’s a simple but remarkably effective filter.

The Heart of the Matter: Prime Numbers

This modular arithmetic test is a great start, but it's not the whole story. For instance, $6 \equiv 2 \pmod{4}$ and $21 \equiv 1 \pmod{4}$ , and neither of these can be written as a sum of two squares. To get a deeper understanding, we must turn our attention to the building blocks of all integers: the prime numbers.

Let's apply our filter to odd primes. If an odd prime $p$ can be written as $p = a^2 + b^2$ , we know it can't be of the form $4k+3$ . Since it's an odd prime, it also can't be of the form $4k$ or $4k+2$ . The only possibility left is that it must be of the form $4k+1$ . Indeed, if we look at $p=a^2+b^2$ , one of $a, b$ must be even and the other odd (otherwise their sum of squares would be even). This means $a^2+b^2 \equiv 0+1 \equiv 1 \pmod{4}$ .

This leads us to a momentous conjecture, first proposed by Pierre de Fermat in the 17th century:

An odd prime number $p$ can be written as the sum of two squares if and only if it leaves a remainder of 1 when divided by 4.

The "only if" part we just figured out. But the "if" part—that every prime of the form $4k+1$ can be written as a sum of two squares—is much harder to prove. Leonhard Euler was the first to provide a complete proof, and one of the most elegant methods uses a technique called "infinite descent." The logic is wonderfully clever: you assume there's a smallest prime of the form $4k+1$ that cannot be written as a sum of two squares. Then, through a series of ingenious steps, you show that this assumption implies the existence of an even smaller prime of the same form with the same property. This is a logical contradiction, like finding a "smallest positive number" that isn't the smallest. The only way out of the paradox is for the initial assumption to be false, meaning no such prime exists. This proves that every prime of the form $4k+1$ is a sum of two squares. And not just that, but the representation is unique (ignoring order and signs). For example, $37 = 1^2 + 6^2$ and there's no other way to do it with two positive integers.

A New World: The Gaussian Integers

The "if and only if" condition is beautiful, but it still feels a bit like magic. Why should the remainder when divided by 4 have anything to do with this? The most profound explanation comes from stepping outside our familiar one-dimensional number line and into a two-dimensional world: the complex plane.

Let's consider numbers of the form $a+bi$ , where $a$ and $b$ are integers and $i$ is the imaginary unit with $i^2 = -1$ . These are called the Gaussian integers, and they form a grid of points in the complex plane. Just as we can do arithmetic with regular integers (add, subtract, multiply), we can do the same with Gaussian integers.

The key insight is to look at the "size" of a Gaussian integer, which we call its norm. The norm of $z = a+bi$ is defined as $N(z) = a^2 + b^2$ . Does that look familiar? It's exactly the sum of two squares we've been hunting for! Geometrically, it's the square of the distance from the origin to the point $(a,b)$ in the plane.

Suddenly, our original question—"Can an integer $n$ be written as $a^2+b^2$ ?"—is transformed. It becomes: "Is the integer $n$ the norm of some Gaussian integer $a+bi$ ?".

This new perspective is incredibly powerful because it connects the sum of squares to factorization. Notice that $a^2+b^2 = (a+bi)(a-bi)$ . So, writing an integer prime $p$ as a sum of two squares, $p=a^2+b^2$ , is precisely the same thing as factoring $p$ into two Gaussian integers: $p = (a+bi)(a-bi)$ .

In our world of regular integers, a prime like 7 cannot be factored. It turns out that 7 also cannot be factored in the world of Gaussian integers—it remains a "Gaussian prime." But a prime like 5, which is $5=2^2+1^2$ , can be factored as $5 = (2+i)(2-i)$ . So, 5 is prime in our world but not in the Gaussian world.

The rule that Fermat discovered now has a beautiful explanation:

A prime $p \equiv 3 \pmod{4}$ (like 3, 7, 11) remains prime in the Gaussian integers. It cannot be factored, and thus it cannot be written as a sum of two squares.
A prime $p \equiv 1 \pmod{4}$ (like 5, 13, 17) is no longer prime in the Gaussian integers. It "splits" into a product of two Gaussian primes, $(a+bi)$ and $(a-bi)$ . This factorization directly gives us the sum-of-squares representation $p = a^2+b^2$ .
The prime 2 is a special case. It factors as $2 = (1+i)(1-i) = (1+i)(-i(1+i)) = -i(1+i)^2$ . It is the only prime that is a "squared" Gaussian prime, up to a unit factor.

The Magic of Multiplication and The Complete Recipe

Now that we understand primes, what about composite numbers like 6, 10, or 45? The answer comes from another beautiful identity, which looks magical at first glance but is perfectly natural in the world of Gaussian integers.

If you have two numbers that are sums of two squares, say $N_1 = a^2+b^2$ and $N_2 = c^2+d^2$ , their product $N_1 N_2$ is also a sum of two squares. This is guaranteed by the Brahmagupta–Fibonacci identity: $(a^2+b^2)(c^2+d^2) = (ac-bd)^2 + (ad+bc)^2$ Where does this come from? It's just a statement about the norms of Gaussian integers! Let $z_1 = a+bi$ and $z_2 = c+di$ . Then $N(z_1) = a^2+b^2$ and $N(z_2) = c^2+d^2$ . Their product is $z_1 z_2 = (ac-bd) + i(ad+bc)$ . The norm of this product is $N(z_1 z_2) = (ac-bd)^2 + (ad+bc)^2$ . Since the norm of a product is the product of the norms, $N(z_1 z_2) = N(z_1)N(z_2)$ , the identity is proven.

This property—that the set of sums of two squares is closed under multiplication—is the final piece of the puzzle. It allows us to build a complete rule for any positive integer $n$ . We just need to look at its prime factorization.

Factors of 2 are fine.
Factors of primes of the form $4k+1$ are fine.
What about factors of primes of the form $4k+3$ ? If we have one such factor, say $n=3$ , it's not a sum of two squares. If we have two, say $n=3 \times 3 = 9$ , then $9 = 3^2+0^2$ , so it works. If we have $n = 3 \times 7 = 21$ , it fails. But $n=3^2 \times 7 = 63$ also fails.

The final, complete criterion is this: An integer $n > 0$ can be written as a sum of two squares if and only if in its prime factorization, every prime of the form $4k+3$ appears with an even exponent. So $45 = 3^2 \times 5$ works because the prime 3 (which is $4k+3$ type) has an even exponent (2). And indeed, $45=6^2+3^2$ . But $42 = 2 \times 3 \times 7$ fails because both 3 and 7 have an odd exponent (1).

A View from the Mountaintop

From this detailed recipe, we can zoom out and ask bigger questions. How common are these numbers? It turns out they become progressively rarer as you go up the number line. The number of such integers up to a large number $r$ is roughly proportional to $\frac{r}{\sqrt{\ln r}}$ .

But perhaps the most astonishing result of all comes when we ask a different question: on average, how many ways are there to write a number as a sum of two squares? Remember, we count $(\pm a, \pm b)$ and $( \pm b, \pm a)$ as different ways. So $r_2(5) = 8$ because of $(\pm 1, \pm 2)$ and $(\pm 2, \pm 1)$ . The average value, taken over all integers, is not an integer or a simple fraction. It is $\pi$ .

$\lim_{x\to\infty} \frac{1}{x} \sum_{n=1}^{\lfloor x \rfloor} r_2(n) = \pi$

Why on earth would $\pi$ , the ratio of a circle's circumference to its diameter, appear here? The intuition is profoundly geometric. The sum $\sum_{n \le x} r_2(n)$ is just the total number of integer grid points $(a,b)$ inside or on a circle of radius $\sqrt{x}$ , since for each such point, $a^2+b^2 = n \le x$ . The number of these points is approximately the area of the circle, which is $\pi (\sqrt{x})^2 = \pi x$ . So the average number of points per integer $n$ up to $x$ is about $\frac{\pi x}{x} = \pi$ .

What began as a simple game of adding squares has led us to modular arithmetic, prime numbers, the beautiful geometry of Gaussian integers, and finally, to a deep and unexpected connection with the fundamental constant $\pi$ . It’s a perfect illustration of how, in mathematics, the simplest questions can often lead to the most profound and interconnected truths.

Applications and Interdisciplinary Connections

In the previous chapter, we embarked on a journey into the heart of number theory, uncovering the elegant rules that govern which integers can be expressed as the sum of two squares. We now have the key in our hands, a criterion based on prime factorization. But what doors does this key unlock? You might be tempted to think this is a niche curiosity, a charming but isolated piece of mathematical trivia. Nothing could be further from the truth.

The story of sums of two squares is a spectacular example of what makes mathematics so powerful and beautiful: its profound and often surprising unity. A simple question about whole numbers turns out to be a master key, revealing deep connections between seemingly disparate worlds—the geometry of grids, the physics of vibration, the statistical behavior of numbers, the abstract structures of modern algebra, and even the cutting-edge technology of digital security. Let's go on a tour and see where this idea appears, often in disguise.

The Geometry of Grids and Lattices

Perhaps the most intuitive place to find our theorem at work is in the world of geometry. Imagine a vast, two-dimensional sheet of graph paper, extending infinitely in all directions. The points with integer coordinates $(x,y)$ form what mathematicians call an integer lattice, denoted $\mathbb{Z}^2$ . This isn't just an abstraction; it's the fundamental model for the orderly arrangement of atoms in a two-dimensional crystal.

A natural question arises: if you pick two atoms in this crystal, what are the possible values for the square of the distance between them? If the atoms are at $(x_1, y_1)$ and $(x_2, y_2)$ , the squared distance is $d^2 = (x_2 - x_1)^2 + (y_2 - y_1)^2$ . Since the coordinates are all integers, their differences, let's call them $a = x_2 - x_1$ and $b = y_2 - y_1$ , are also integers. Suddenly, a question about physics and geometry—determining possible interaction potentials in a crystal—has transformed into our original question: which numbers can be written as a sum of two integer squares, $a^2+b^2$ ? Our theorem gives the complete answer. An integer like 2023, whose prime factorization is $7^1 \times 17^2$ , can never represent a squared distance on this lattice because the prime $7$ (which is of the form $4k+3$ ) appears with an odd exponent. In contrast, 2025, or $45^2$ , which is $3^4 \times 5^2$ , is a perfectly valid squared distance, as is 2050, or $2 \times 5^2 \times 41$ .

This connection becomes even more vivid when we consider circles. Imagine a ripple spreading from an atom at the origin $(0,0)$ . At any moment, the ripple forms a circle with equation $x^2 + y^2 = R$ . Will this circle hit any other atoms? It will if and only if $R$ is a number that can be written as a sum of two squares. For a circle like $x^2+y^2=7$ , the answer is no. For a circle like $x^2+y^2=5$ , the answer is yes, and it will pass through eight points: $(\pm 1, \pm 2)$ and $(\pm 2, \pm 1)$ .

What's more, the arithmetic of the number of representations, $r_2(R)$ , dictates the geometry of the points. For $R=1125 = 3^2 \times 5^3$ , our formulas tell us there are exactly 16 integer points on the circle. But these 16 points do not form a regular 16-gon. Instead, they form two distinct, concentric, irregular 8-sided polygons, a beautiful and subtle symmetry dictated entirely by the number-theoretic properties of 1125.

The theorem's geometric influence is not limited to distances. In a surprising twist, it also governs the possible areas of certain shapes. Consider a right-angled triangle whose three vertices all lie on the integer lattice, with the extra condition that its legs are not parallel to the axes. The area of such a triangle can always be written as $\mathcal{A} = \frac{N}{2}$ for some integer $N$ . One might guess $N$ could be any integer, but it cannot. The integer $N$ must have at least one prime factor that is not of the form $4k+3$ . This means an area like $\frac{3}{2}$ or $\frac{7}{2}$ is impossible for such a triangle! This remarkable constraint emerges because the calculation of the area inevitably involves an expression of the form $a^2+b^2$ .

The Music of a Donut

Let's switch fields from pure geometry to physics, specifically to the study of waves and vibrations. Imagine a drumhead. When you strike it, it vibrates at a set of fundamental frequencies, its "resonant modes." These frequencies are not random; they are determined by the eigenvalues of a physical operator called the Laplacian. For a rectangular drum, the calculation is straightforward. But what if we had a more exotic drum, say, one shaped like the surface of a donut, or a torus?

This is a classic problem in mathematical physics, with applications ranging from wave mechanics to quantum field theory. One can build a simple flat torus by taking a rectangular piece of paper and gluing its opposite edges—top to bottom, and left to right. The allowed frequencies of vibration on this surface correspond to the eigenvalues of the negative Laplacian operator, $-\Delta$ . And what are these eigenvalues? In a stunning reveal, they turn out to be precisely the integers that can be written as a sum of two squares, $\lambda = n^2 + m^2$ for integers $n$ and $m$ !

An integer like 3 or 7, which cannot be written as a sum of two squares, is a "forbidden" frequency on the torus. It cannot be a fundamental mode of vibration. An integer like 50, which is $1^2 + 7^2$ and $5^2+5^2$ , is an allowed frequency. But there's more. In physics, it's common for different vibrational patterns (eigenfunctions) to correspond to the same frequency (eigenvalue). This phenomenon is called degeneracy, and its measure is the eigenvalue's multiplicity. For the torus, the multiplicity of an eigenvalue $\lambda$ is exactly $r_2(\lambda)$ , the number of ways to write $\lambda$ as a sum of two squares. So, the eigenvalue $\lambda=50$ has a multiplicity of $r_2(50) = 12$ , because it can be formed from $(\pm 1, \pm 7)$ , $(\pm 7, \pm 1)$ , and $(\pm 5, \pm 5)$ . The question "How many ways can a number be written as a sum of two squares?" is the same as the physicist's question, "How many distinct modes of vibration share this frequency?"

An Unexpected Average: The Appearance of $\pi$

So, we know which numbers are sums of two squares, but are they common or rare? A quick glance shows they seem to thin out as we go to higher numbers. Landau's theorem on the distribution of these numbers confirms this: the number of sums of two squares up to a large number $x$ is proportional to $\frac{x}{\sqrt{\ln x}}$ . Since this grows more slowly than $x$ , their "density" among the integers approaches zero.

This might lead you to believe they are, on average, insignificant. Let's ask a different question. We know that some numbers, like 3, have zero representations. Others, like 5, have eight. Highly composite numbers can have many, many more. What is the average number of ways an integer can be written as a sum of two squares? If we sum up all the $r_2(k)$ values from $k=1$ to a large $N$ and then divide by $N$ , what does this value approach?

The intuition is geometric. The sum $\sum_{k=1}^N r_2(k)$ is simply the number of integer lattice points inside or on the circle $x^2+y^2=N$ . For large $N$ , the number of these points is well-approximated by the area of the circle, which is $\pi (\sqrt{N})^2 = \pi N$ . So the average number of points per integer value should be roughly $\pi N / N = \pi$ .

This simple, beautiful argument turns out to be exactly right. Advanced methods from analytic number theory, involving the study of so-called Epstein zeta functions, rigorously confirm this result. The average value of $r_2(k)$ is not some complicated fraction, but precisely $\pi$ . The fundamental constant of circles and spheres emerges from a discrete counting problem about integer squares. It's a breathtaking reminder that the continuous world of geometry and the discrete world of arithmetic are just two sides of the same coin.

Deep Structures in Algebra and Analysis

The sum of two squares theorem is not just a result; it's a window into deeper algebraic structures. The natural home for this theorem is the set of Gaussian integers, complex numbers of the form $a+bi$ where $a$ and $b$ are integers. In this world, the question " $n = a^2+b^2$ " becomes a question about factorization: can the integer $n$ be factored into $(a+bi)(a-bi)$ ? Our theorem is a translation of a fundamental fact: a prime number $p$ can be factored in the Gaussian integers if and only if it is 2 or congruent to $1 \pmod{4}$ .

This structural property echoes throughout more abstract mathematics. When we generalize from Gaussian integers to Gaussian rationals (numbers $a+bi$ where $a,b$ are rational), the theorem's criterion reappears, this time characterizing the image of the norm map—a fundamental homomorphism in abstract algebra. The theorem also provides the essential building blocks for the Dedekind zeta function of the Gaussian numbers, a powerful function in analytic number theory whose "Euler product" form depends directly on how primes factorize.

The theorem's influence even extends to complex analysis. Consider a function defined by a power series, $f(z) = \sum a_n z^n$ , where we set the coefficient $a_n=1$ if $n$ is a sum of two squares and $a_n=0$ otherwise. This function is perfectly well-defined inside the unit circle $|z| \lt 1$ . Can we extend it beyond this boundary? For many functions, we can. But not for this one. The specific, irregular-yet-structured pattern of which coefficients are zero and which are one—a pattern dictated by our theorem—creates an impenetrable barrier. The unit circle becomes a natural boundary, a fractal wall of singularities through which no analytic continuation is possible. The arithmetic nature of the exponents dictates the analytic fate of the function.

From Ancient Riddles to Modern Cryptography

When Pierre de Fermat studied these numbers in the 17th century, it was a pursuit of pure intellectual curiosity, a beautiful puzzle with no conceivable practical application. For centuries, it remained so. Who could have imagined that this very question would become relevant to the security of our digital age?

The connection comes through one of the most advanced areas of modern number theory: elliptic curves. These are curves defined by equations like $y^2 = x^3 + Ax + B$ . Over the past few decades, elliptic curves defined over finite fields have become a cornerstone of modern cryptography, the science of secure communication that protects everything from your credit card transactions to state secrets.

A crucial part of using these curves for cryptography is being able to count the number of points on them over a finite field $\mathbb{F}_p$ . For the specific, important elliptic curve $E: y^2 \equiv x^3 - x \pmod p$ , the number of points—and thus its security properties—depends critically on the nature of the prime $p$ . If $p \equiv 3 \pmod 4$ , the calculation is simple. But if $p \equiv 1 \pmod 4$ , the number of points is determined by how you write $p$ as a sum of two squares, $p = a^2+b^2$ . The "trace of Frobenius," a key parameter related to the point count, is given by $-2a$ (with a specific rule to fix the sign of $a$ ).

Think about this for a moment. A 17th-century riddle about integer sums is now an essential part of the mathematics used to secure 21st-century communications. It is a stunning testament to the enduring power and unforeseeable utility of fundamental mathematical exploration.

Our journey is complete. We began with a simple question and have seen its reflection in crystals, heard its echo in the music of a torus, found its average value to be $\pi$ , and discovered its signature in the foundations of modern cryptography. The sum of two squares theorem is far more than a theorem; it is a story about the interconnectedness of all of mathematics, a lesson in how the pursuit of pure, beautiful ideas can lead to the most unexpected and profound destinations.

The Sum of Two Squares Theorem

Introduction

Principles and Mechanisms

A Simple Filter: The Modulo 4 Test

The Heart of the Matter: Prime Numbers

A New World: The Gaussian Integers

The Magic of Multiplication and The Complete Recipe

A View from the Mountaintop

Applications and Interdisciplinary Connections

The Geometry of Grids and Lattices

The Music of a Donut

An Unexpected Average: The Appearance of π\piπ

Deep Structures in Algebra and Analysis

From Ancient Riddles to Modern Cryptography

The Sum of Two Squares Theorem

Introduction

Principles and Mechanisms

A Simple Filter: The Modulo 4 Test

The Heart of the Matter: Prime Numbers

A New World: The Gaussian Integers

The Magic of Multiplication and The Complete Recipe

A View from the Mountaintop

Applications and Interdisciplinary Connections

The Geometry of Grids and Lattices

The Music of a Donut

An Unexpected Average: The Appearance of π\piπ

Deep Structures in Algebra and Analysis

From Ancient Riddles to Modern Cryptography

An Unexpected Average: The Appearance of $\pi$

An Unexpected Average: The Appearance of $\pi$