Weil Height

SciencePedia

Key Takeaways

The Weil height measures the arithmetic complexity of an algebraic number, considering its defining polynomial or its properties across all absolute values.
Northcott's Finiteness Property is a crucial result establishing that only a finite number of algebraic numbers exist below a given bound of height and degree.
The Néron-Tate canonical height is a refined, quadratic height function on elliptic curves essential for understanding their group structure and finding rational points.
The height function provides a unifying language in arithmetic geometry, connecting major theorems and conjectures like Faltings' theorem and the abc conjecture.

Introduction

In mathematics, how do we measure the "size" of a number? While absolute value tells us a number's distance from zero, it fails to capture its inherent arithmetic complexity. A number like 10001/10000 is close to 1 but feels more complex than the integer 2. This gap highlights the need for a more nuanced ruler in number theory. The Weil height is that ruler—a profound concept that quantifies the complexity of numbers, paving the way for deeper insights into their structure. This article provides a comprehensive overview of this fundamental tool. The first chapter, "Principles and Mechanisms," will construct the Weil height from the ground up, exploring its definitions for different number types, its core properties, and its refinement into the canonical height on geometric objects. Following this, the "Applications and Interdisciplinary Connections" chapter will demonstrate the height's power in action, showing how it serves as a critical tool in solving Diophantine equations, understanding algebraic structures, and framing some of the most important conjectures in modern arithmetic geometry.

Principles and Mechanisms

Suppose I ask you a simple question: How "big" is a number? Your first instinct might be to talk about its distance from zero on the number line—what mathematicians call its absolute value. The number $1,000$ is "bigger" than $10$ . Simple enough. But in the world of numbers, especially the rational numbers (fractions), things are not always so straightforward. Is the number $10,001/10,000$ big or small? Its value is very close to $1$ , so in that sense, it's small. But look at the components! It's built from rather large integers. And what about $-84/125$ ? Its absolute value is less than one, but it seems somehow more "complicated" than, say, the integer $2$ .

This hints that we need a different notion of size, one that captures not just the magnitude of a number, but its arithmetic complexity. This is the beautiful idea behind the Weil height. It’s a tool that allows us to measure the complexity of numbers in a way that is profound, elegant, and incredibly useful. It's the physicist's dream of a conserved quantity, but for the world of numbers.

What is the "Size" of a Rational Number?

Let's try to invent a measure of complexity for a rational number, say $x = a/b$ , where we've canceled all common factors so the fraction is in lowest terms. A good measure should probably depend on the size of the numerator $a$ and the denominator $b$ . A natural first guess might be to take the largest of the two (in absolute value). We call this the naive height, $H(x) = \max\{|a|, |b|\}$ .

This is a good start. For an integer $n$ (which we write as $n/1$ ), its naive height is $H(n) = |n|$ . The height of $1/n$ is $H(1/n) = |n|$ . This is nice! The numbers $n$ and $1/n$ are, in a sense, equally complex, and this height function captures that.

However, mathematicians have learned that it’s often much better to work with sums than products. Logarithms turn products into sums. So, we define the absolute logarithmic Weil height (or just height) as the natural logarithm of the naive height:

h(x) = \log \max\{|a|, |b|\}

Let's get a feel for this. Using this definition, the height of the integer $2$ (or $2/1$ ) is $h(2) = \log\max\{|2|, |1|\} = \log(2)$ . The height of its reciprocal, $1/2$ , is $h(1/2) = \log\max\{|1|, |2|\} = \log(2)$ . They are the same, just as we hoped. What about $1/3$ ? Its height is $h(1/3) = \log\max\{|1|,|3|\} = \log(3)$ . The height of a number is always non-negative, since for any fraction $a/b$ in lowest terms, $\max\{|a|, |b|\}$ is at least $1$ . It's only zero for special numbers like $0/1$ , $1/1$ , and $-1/1$ .

A Universal Language: Heights from All Perspectives

How do we extend this idea from simple fractions to more complex numbers like $\sqrt{2}$ or the golden ratio $\phi = (1+\sqrt{5})/2$ ? These are algebraic numbers—roots of polynomials with integer coefficients. It turns out there are two ways to think about their height, and the fact that these two very different-looking approaches give the exact same answer is one of the first signs that we're onto something deep.

Perspective 1: It's All in the Family

An algebraic number $\alpha$ is defined by its family—the other roots of its minimal polynomial. The minimal polynomial for $\sqrt{2}$ is $x^2 - 2 = 0$ , whose roots are $\sqrt{2}$ and $-\sqrt{2}$ . The "complexity" of $\sqrt{2}$ must surely be tied to this polynomial.

A general formula exists that formalizes this. For an algebraic number $\alpha$ of degree $d$ with minimal polynomial $a_d x^d + \dots + a_0 = 0$ (with integer coefficients), its height is given by a beautiful formula that involves the leading coefficient $a_d$ and all the roots $\alpha_1, \dots, \alpha_d$ (the "Galois conjugates" of $\alpha$ ):

h(\alpha) = \frac{1}{d} \left( \log|a_d| + \sum_{i=1}^{d} \log\max\{1, |\alpha_i|\} \right)

Let's test this on $\sqrt{2}$ . Its minimal polynomial is $x^2 - 2 = 0$ . Here, the degree is $d=2$ , the leading coefficient is $a_2=1$ , and the roots are $\alpha_1 = \sqrt{2}$ and $\alpha_2 = -\sqrt{2}$ . Plugging these in:

h(\sqrt{2}) = \frac{1}{2} \left( \log|1| + \log\max\{1, |\sqrt{2}|\} + \log\max\{1, |-\sqrt{2}|\} \right)

Since $\log(1)=0$ and $|\sqrt{2}| = |-\sqrt{2}| > 1$ , this simplifies wonderfully:

h(\sqrt{2}) = \frac{1}{2} \left( 0 + \log(\sqrt{2}) + \log(\sqrt{2}) \right) = \log(\sqrt{2}) = \frac{1}{2}\log(2)

This is remarkable! The height of $\sqrt{2}$ is exactly half the height of $2$ . It confirms our intuition that $\sqrt{2}$ should be arithmetically "simpler" than $2$ .

Perspective 2: A Sum Over All Viewpoints

Now for a completely different approach. In physics, a global property of a system, like its total energy, is often an integral of a local energy density over all of space. Number theory has a similar idea. It has different ways of measuring "size", called absolute values or places.

You are familiar with the "place at infinity", which corresponds to the usual absolute value $|x|_\infty$ . But for every prime number $p$ , there is a  $p$ -adic absolute value, $|x|_p$ , which essentially measures the power of $p$ in the prime factorization of $x$ . For example, $|12|_2 = |2^2 \cdot 3|_2 = (1/2)^2 = 1/4$ , because of the $2^2$ factor. $|12|_3 = 1/3$ , and for any other prime like $p=5$ , $|12|_5 = 1$ .

These absolute values are linked by one of the most sublime and mysterious facts in all of mathematics: the product formula. For any non-zero rational number $x$ , if you multiply its size at every single place (all the p-adics and the one at infinity), the result is always exactly 1.

\prod_{v \in M_{\mathbb{Q}}} |x|_v = 1

where $M_{\mathbb{Q}}$ denotes the set of all places of $\mathbb{Q}$ . It's a kind of conservation law for numbers.

Using this, we can define the height as a sum of "local complexities" over all these places:

h(\alpha) = \sum_{v \in M_K} \frac{n_v}{[K:\mathbb{Q}]} \log\max\{1, |\alpha|_v\}

Here, the sum is over all places $v$ of a number field $K$ containing $\alpha$ , and the $n_v/[K:\mathbb{Q}]$ terms are weighting factors. This formula looks intimidating, but for a rational number in $\mathbb{Q}$ , it simplifies nicely. The local complexity $\log\max\{1, |x|_v\}$ is non-zero only for the place at infinity and for the primes dividing the numerator or denominator of $x$ . When you sum them up, you recover our original, simple definition.

For more complex numbers, this definition still works perfectly. It gives the exact same value as the definition from the minimal polynomial. This unity is a powerful sign that the Weil height is a natural and fundamental concept. The fact that the definition does not depend on which number field $K$ we use to do the calculation is a direct consequence of the way places split in field extensions.

The Rules of the Game

A new concept is only as good as the rules it follows. The Weil height obeys a few simple and powerful laws that make it the perfect tool for exploring the number world.

Bounded Complexity of Sums and Products: The height of a sum or product is controlled by the heights of the constituents. Specifically:
- $h(\alpha \beta) \le h(\alpha) + h(\beta)$
- $h(\alpha + \beta) \le h(\alpha) + h(\beta) + \log(2)$ These triangle-like inequalities mean that when we combine numbers, their complexity doesn't explode uncontrollably.
Northcott's Finiteness Property: This is the crown jewel. For any given bounds on complexity (height) and degree, there are only a finite number of algebraic numbers that fit. In other words, the set
$\{\alpha \in \overline{\mathbb{Q}} : h(\alpha) \le B \text{ and } [\mathbb{Q}(\alpha):\mathbb{Q}] \le d\}$
is finite for any bound $B$ and degree $d$ . This is an astonishingly powerful statement. It says the algebraic numbers are not a continuous smear; they are discrete points, quantized by their complexity. It's like finding that notes in music aren't just any frequency, but are quantized into a scale.
Kronecker's Theorem: A direct consequence of Northcott's property is that a number has a height of exactly zero if and only if it is $0$ or a root of unity (a number $\zeta$ such that $\zeta^n=1$ for some integer $n$ ). These are, in a sense, the arithmetically simplest numbers in existence.

From Numbers to Geometry: The Canonical Height

The story of height becomes truly dynamic when we shift our focus from individual numbers to geometric objects defined by numbers, like the set of rational solutions to an equation like $y^2 = x^3 + Ax + B$ . This is an elliptic curve, and its set of rational points forms a group—we can "add" two points on the curve to get a third.

The most natural way to define the height of a point $P=(x,y)$ on the curve is to just take the height of its $x$ -coordinate: we call this the naive height, $h_x(P) = h(x(P))$ . This is a great starting point, but it has a subtle flaw. If we take a point $P$ and "double" it on the curve to get $[2]P$ , we'd hope for a simple relationship between their heights. The map that takes $x(P)$ to $x([2]P)$ is a rational function of degree 4. So we might expect $h_x([2]P) = 4 h_x(P)$ . But it’s not exact! Instead, we find:

h_x([m]P) = m^2 h_x(P) + O(1)

That little $O(1)$ term represents a bounded error. It's like a small amount of friction in an otherwise perfect system. It ruins what would have been a beautiful quadratic relationship. A physicist would not stand for this! There must be a way to find the "true" conserved quantity.

And there is. The brilliant insight of Néron and Tate was to "average out" this error by taking a limit. We define the Néron-Tate canonical height as:

\hat{h}(P) = \lim_{m \to \infty} \frac{1}{4^m} h_x([2^m]P)

This limiting process kills the pesky error term, leaving behind a "perfect" height function with magnificent properties:

It is truly quadratic: $\hat{h}([n]P) = n^2 \hat{h}(P)$ for any integer $n$ . The friction is gone!
It satisfies the parallelogram law: $\hat{h}(P+Q) + \hat{h}(P-Q) = 2\hat{h}(P) + 2\hat{h}(Q)$ . This makes it a true quadratic form on the group of points.
It detects torsion: $\hat{h}(P)=0$ if and only if $P$ is a torsion point (a point such that $[n]P = \mathcal{O}$ for some $n \ge 1$ ), which are the group-theoretic analogues of roots of unity.

This canonical height is no longer just a measure of coordinate complexity; it's a deep geometric invariant of the point itself. In fact, this whole procedure is a special case of a more general story in arithmetic dynamics, where one studies the iteration of a map $f$ on a geometric space. The Néron-Tate height corresponds to the map $f(P) = [m]P$ on an elliptic curve, but the same limit construction gives a canonical height for a vast class of dynamical systems.

This beautiful structure—a sum of local terms, invariant under coordinate changes thanks to the product formula, and refinable into a perfect quadratic form—is what makes the Weil height one of the most powerful and fundamental tools in modern number theory, connecting the discrete world of integers to the continuous landscapes of geometry. It reveals a hidden unity and a rigid structure in the seemingly chaotic universe of numbers.

Applications and Interdisciplinary Connections

In the previous chapter, we painstakingly constructed a new kind of ruler—the Weil height. We learned that this isn't just any ruler; it's a measure of arithmetic complexity. For a simple fraction, it tells you how large its numerator and denominator are. For a more complicated algebraic number, it captures the size of the coefficients of its defining polynomial. We have built our "complexity-meter". The natural, exhilarating next question is: what can we do with it? What secret structures of the mathematical world does it allow us to see?

Think of the invention of the telescope. It didn't just let us see the moon a bit better; it revealed that Jupiter has its own moons, that Venus has phases, and that the faint haze of the Milky Way is made of countless individual stars. It revealed a hidden architecture to the cosmos. The Weil height is our telescope for the cosmos of numbers. It takes abstract, seemingly intractable problems about the structure of numbers and equations and transforms them into tangible questions about size, distance, and shape. Let us now point our new telescope at a few constellations in the mathematical sky and see what wonders it reveals.

The Architecture of Number Systems: A Structural Tool

Let's start with the internal architecture of number systems themselves. Consider a number field like $K = \mathbb{Q}(\sqrt{2})$ , the set of all numbers of the form $a+b\sqrt{2}$ where $a$ and $b$ are rational. Within this field, some numbers play a special role: the units. These are the numbers whose reciprocals are also in the system of "integers" of the field (here, numbers $a+b\sqrt{2}$ with $a, b \in \mathbb{Z}$ ). For instance, $1+\sqrt{2}$ is a unit, because its reciprocal, $-1+\sqrt{2}$ , is also one of these integers. The units form a group, and a famous theorem by Dirichlet tells us this group has a beautifully simple structure: it's built from a finite set of "fundamental" units. In our case, every unit is just a power of a single fundamental unit, $\epsilon = 1+\sqrt{2}$ (up to a sign). So any unit $u$ can be written as $u = \pm \epsilon^m$ .

This is a lovely structural statement, but it raises a practical question. If I hand you a complicated-looking unit, say $u = 7+5\sqrt{2}$ , how can you tell if it's "fundamental" or just a power of something simpler? How do you find the exponent $m$ ? You could try to compute powers of $\epsilon$ until you find $u$ , but what if $m$ is huge?

This is where our height function comes to the rescue. The height, being a measure of complexity, behaves beautifully with respect to multiplication. In fact, one of the key properties we can derive is that for a unit $u = \pm \epsilon^m$ , its logarithmic height is simply related to the height of the fundamental unit by the equation $h(u) = |m| h(\epsilon)$ . Suddenly, our difficult structural question has become a simple problem of division! To find the magnitude of the exponent, we just need to compute the ratio of the heights: $|m| = h(u)/h(\epsilon)$ . The height acts as a "logarithm" for the multiplicative structure of the units, turning a search for an exponent into simple arithmetic. The 'smallest' non-trivial unit is the one with the lowest non-zero height, and all other units are built from it, their heights neatly falling on a ladder whose rungs are spaced by $h(\epsilon)$ . The abstract algebraic structure is laid bare as a simple, uniformly spaced lattice by our height ruler.

The Quest for Rational Solutions: A Searchlight in the Dark

One of the oldest and most noble quests in mathematics is the search for integer or rational solutions to polynomial equations—a field known as Diophantine geometry. Here, the height function transforms from a structural tool into a powerful searchlight, allowing us to navigate infinite spaces of possibilities.

Elliptic Curves and the Finite Basis

Consider an equation like $y^2 = x^3 + Ax + B$ , which defines an elliptic curve. The set of its rational solutions has a stunning hidden structure: they form a group! You can "add" two solutions to get a third one. The celebrated Mordell-Weil theorem states that this group is finitely generated. This means there's a finite set of "generator" points from which all other infinitely many rational solutions can be produced through the group operation.

This is a profound existence theorem, but it leaves us in the dark. How can we ever find these generators if there are infinitely many points to check? The task seems impossible. But again, the concept of height provides the light. On an elliptic curve, we can define a refined version of the height, called the canonical height $\hat{h}$ , which is perfectly adapted to the group structure. It satisfies a beautiful quadratic law: the height of a point $[n]P$ (the point $P$ added to itself $n$ times) is exactly $\hat{h}([n]P) = n^2 \hat{h}(P)$ .

This quadratic growth is the key. A deep procedure known as "descent" can, in many cases, provide an absolute upper bound, say $B$ , on the canonical height of the generators we are looking for. But this is a bound on an abstract height value; how does it help us find actual numbers? The crucial link is an inequality that relates the abstract canonical height of a point $P=(x,y)$ to the familiar Weil height of its $x$ -coordinate: $|\hat{h}(P) - \frac{1}{2}h(x)|$ is bounded by some constant that depends only on the curve.

The chain of logic is therefore complete: the descent gives a bound on $\hat{h}(P)$ , which gives a bound on $h(x)$ , which in turn gives a bound on the size of the numerator and denominator of the rational number $x$ . The infinite ocean of potential solutions shrinks to a finite, searchable pond. The height function acts as a powerful searchlight, telling us that even if there are infinitely many fish in the sea, the ones we're looking for—the generators—must be swimming in this small, illuminated patch of water.

Diophantine Approximation: A Universal Speed Limit

A related question is not just whether solutions exist, but how "special" they are. In Diophantine approximation, we ask: how well can we approximate a given algebraic number $\alpha$ with a "simpler" number $\beta$ ? What makes an approximation "good"? A good approximation is one where $|\alpha - \beta|$ is very small compared to the complexity of $\beta$ . And what is our measure of complexity? The height, of course!

For approximating an irrational algebraic number $\alpha$ with rational numbers $p/q$ , Roth's theorem tells us there's a hard limit. The inequality $|\alpha - p/q| 1/q^{2+\varepsilon}$ can only have a finite number of solutions for any $\varepsilon > 0$ . The exponent 2 is a sharp threshold.

What if we try to approximate $\alpha$ with more complex numbers, say, algebraic numbers $\beta$ of degree at most $n$ ? W. M. Schmidt's magnificent generalization of Roth's theorem, a consequence of his Subspace Theorem, gives the answer. Using the multiplicative Weil height $H(\beta)$ (where our logarithmic height is $h(\beta) = \ln H(\beta)$ ), the new threshold appears. For any $\varepsilon > 0$ , the inequality $|\alpha - \beta| H(\beta)^{-(n+1)-\varepsilon}$ has only finitely many solutions among all algebraic numbers $\beta$ of degree at most $n$ .

This is a beautiful and profound result. It establishes a "universal speed limit" for approximation. The more complex the tools you are allowed to use (i.e., the larger the degree $n$ of the approximating numbers), the better you can approximate $\alpha$ . The height function is what makes this law precise, providing the exact trade-off between the quality of the approximation and the complexity of the approximant.

This theme finds its most powerful expression in Baker's theory of linear forms in logarithms. This theory provides explicit, computable lower bounds for quantities like $|\Lambda| = |b_1 \ln \alpha_1 + \dots + b_n \ln \alpha_n|$ , preventing them from being too close to zero. These bounds have been used to solve a vast range of Diophantine equations. Unsurprisingly, the heights of the algebraic numbers $\alpha_i$ are critical parameters in these bounds. Curiously, the bounds are often weaker for numbers of larger height, a subtle point which reveals the intricate nature of these deep results.

The Geometer's Perspective: A Unifying Language

So far, we have seen the height as a practical tool for solving problems in algebra and number theory. But its deepest role, and the source of its true power, is as a bridge to geometry. In modern mathematics, number theory and geometry have merged into the field of arithmetic geometry, and the height function is its central language.

The Rhythm of Dynamics

What happens to the height of a number when we repeatedly apply a function to it? This is the domain of arithmetic dynamics. Consider the simplest interesting case: the map $f(z)=z^d$ on the projective line. For any starting point $\alpha$ , we generate a sequence of points $\alpha, \alpha^d, \alpha^{d^2}, \alpha^{d^3}, \dots$ . We can define a "canonical height" for this dynamical system by looking at the long-term growth rate: $\hat{h}_f(\alpha) = \lim_{n \to \infty} d^{-n} h(f^n(\alpha))$ . A quick calculation reveals something wonderful. Since $h(f^n(\alpha)) = h(\alpha^{d^n}) = d^n h(\alpha)$ , the limit is trivial! We find that $\hat{h}_f(\alpha) = h(\alpha)$ . For this simplest of maps, the standard Weil height is already the perfect, natural measure of complexity. For more complex maps, the canonical height is a new tool that measures the "arithmetic escape rate" of a point's orbit, blending number theory and chaos theory.

The Grand Unification: Vojta's Conjectures

The journey culminates in a breathtaking vision of unity. Many of the celebrated results we have mentioned—the theorems of Thue, Siegel, Roth, Faltings, and the conjectures of abc and Szpiro—are believed to be mere facets of a single, unified gemstone: a web of deep conjectures proposed by Paul Vojta. And the language of these conjectures is the language of heights.

Faltings' theorem (formerly the Mordell Conjecture), a monumental achievement of 20th-century mathematics, states that a curve of genus greater than one has only a finite number of rational points. Vojta's conjecture provides a stunningly simple heuristic for why this should be true. It predicts an inequality, a sort of "tug-of-war," between two different types of heights defined on the curve. One height, $h_{K_X}$ , grows at a certain rate. The conjecture states that it can't grow much faster than $\varepsilon$ times another standard height, $h_A$ . But we know from basic properties that $h_{K_X}$ must grow at least at a fixed rate $c$ times $h_A$ . If we choose our $\varepsilon$ to be smaller than $c$ , the only way for both inequalities to hold is if the height $h_A$ is bounded! And by Northcott's property, bounded height means a finite number of points. Finiteness from a fight between inequalities!

Perhaps the most spectacular illustration of this unifying power is the connection to the famous abc conjecture. This simple-sounding but profound conjecture about the equation $a+b=c$ is considered one of the most important open problems in Diophantine analysis. It turns out that if you apply Vojta's grand geometric conjecture to the simplest possible case—the projective line $\mathbb{P}^1$ with three points removed ( $0, 1$ , and $\infty$ )—it yields, as a direct consequence, the abc conjecture. The same machinery, when applied to geometric objects called "elliptic surfaces," likewise implies Szpiro's conjecture, a related statement about elliptic curves.

From Thue equations to Faltings' theorem, from unit groups to the abc conjecture, all these seemingly disparate topics are woven together by the common thread of the height function. It provides a dictionary for translating between algebra, geometry, and dynamics. It allows us to prove that some sets of solutions are finite (like in Faltings' theorem or Siegel's theorem on Thue equations and helps us compute others (like in the Mordell-Weil theorem). Sometimes these proofs based on height are "ineffective"—they prove finiteness without giving us a way to find all the solutions, like a map showing a finite number of islands but with no coordinates. Other times, they are wonderfully effective.

Our simple ruler, born from the desire to measure the size of a fraction, has led us on an incredible journey. It has shown us the hidden lattice structure of units, illuminated the hiding places of rational points, and revealed itself as the native language of the deepest conjectures that unite number theory and geometry. The humble act of measuring complexity has unveiled a universe of profound and beautiful connections.