Algebraic number theory


Algebraic number theory is a branch of number theory that uses the techniques of abstract algebra to study the integers, rational numbers, and their generalizations. Number-theoretic questions are expressed in terms of properties of algebraic objects such as algebraic number fields and their rings of integers, finite fields, and function fields. These properties, such as whether a ring admits unique factorization, the behavior of ideals, and the Galois groups of fields, can resolve questions of primary importance in number theory, like the existence of solutions to Diophantine equations.

History of algebraic number theory

Diophantus

The beginnings of algebraic number theory can be traced to Diophantine equations, named after the 3rd-century Alexandrian mathematician, Diophantus, who studied them and developed methods for the solution of some kinds of Diophantine equations. A typical Diophantine problem is to find two integers x and y such that their sum, and the sum of their squares, equal two given numbers A and B, respectively:
Diophantine equations have been studied for thousands of years. For example, the solutions to the quadratic Diophantine equation x2 + y2 = z2 are given by the Pythagorean triples, originally solved by the Babylonians. Solutions to linear Diophantine equations, such as 26x + 65y = 13, may be found using the Euclidean algorithm.
Diophantus' major work was the Arithmetica, of which only a portion has survived.

Fermat

was first conjectured by Pierre de Fermat in 1637, famously in the margin of a copy of Arithmetica where he claimed he had a proof that was too large to fit in the margin. No successful proof was published until 1995 despite the efforts of countless mathematicians during the 358 intervening years. The unsolved problem stimulated the development of algebraic number theory in the 19th century and the proof of the modularity theorem in the 20th century.

Gauss

One of the founding works of algebraic number theory, the Disquisitiones Arithmeticae is a textbook of number theory written in Latin by Carl Friedrich Gauss in 1798 when Gauss was 21 and first published in 1801 when he was 24. In this book Gauss brings together results in number theory obtained by mathematicians such as Fermat, Euler, Lagrange and Legendre and adds important new results of his own. Before the Disquisitiones was published, number theory consisted of a collection of isolated theorems and conjectures. Gauss brought the work of his predecessors together with his own original work into a systematic framework, filled in gaps, corrected unsound proofs, and extended the subject in numerous ways.
The Disquisitiones was the starting point for the work of other nineteenth century European mathematicians including Ernst Kummer, Peter Gustav Lejeune Dirichlet and Richard Dedekind. Many of the annotations given by Gauss are in effect announcements of further research of his own, some of which remained unpublished. They must have appeared particularly cryptic to his contemporaries; we can now read them as containing the germs of the theories of L-functions and complex multiplication, in particular.

Dirichlet

In a couple of papers in 1838 and 1839 Peter Gustav Lejeune Dirichlet proved the first class number formula, for quadratic forms. The formula, which Jacobi called a result "touching the utmost of human acumen", opened the way for similar results regarding more general number fields. Based on his research of the structure of the unit group of quadratic fields, he proved the Dirichlet unit theorem, a fundamental result in algebraic number theory.
He first used the pigeonhole principle, a basic counting argument, in the proof of a theorem in diophantine approximation, later named after him Dirichlet's approximation theorem. He published important contributions to Fermat's last theorem, for which he proved the cases n = 5 and n = 14, and to the biquadratic reciprocity law. The Dirichlet divisor problem, for which he found the first results, is still an unsolved problem in number theory despite later contributions by other researchers.

Dedekind

's study of Lejeune Dirichlet's work was what led him to his later study of algebraic number fields and ideals. In 1863, he published Lejeune Dirichlet's lectures on number theory as Vorlesungen über Zahlentheorie about which it has been written that:
"Although the book is assuredly based on Dirichlet's lectures, and although Dedekind himself referred to the book throughout his life as Dirichlet's, the book itself was entirely written by Dedekind, for the most part after Dirichlet's death."

1879 and 1894 editions of the Vorlesungen included supplements introducing the notion of an ideal, fundamental to ring theory. Dedekind defined an ideal as a subset of a set of numbers, composed of algebraic integers that satisfy polynomial equations with integer coefficients. The concept underwent further development in the hands of Hilbert and, especially, of Emmy Noether. Ideals generalize Ernst Eduard Kummer's ideal numbers, devised as part of Kummer's 1843 attempt to prove Fermat's Last Theorem.

Hilbert

unified the field of algebraic number theory with his 1897 treatise Zahlbericht. He also resolved a significant number-theory problem formulated by Waring in 1770. As with [|the finiteness theorem], he used an existence proof that shows there must be solutions for the problem rather than providing a mechanism to produce the answers. He then had little more to publish on the subject; but the emergence of Hilbert modular forms in the dissertation of a student means his name is further attached to a major area.
He made a series of conjectures on class field theory. The concepts were highly influential, and his own contribution lives on in the names of the Hilbert class field and of the Hilbert symbol of local class field theory. Results were mostly proved by 1930, after work by Teiji Takagi.

Artin

established the Artin reciprocity law in a series of papers. This law is a general theorem in number theory that forms a central part of global class field theory. The term "reciprocity law" refers to a long line of more concrete number theoretic statements which it generalized, from the quadratic reciprocity law and the reciprocity laws of Eisenstein and Kummer to Hilbert's product formula for the norm symbol. Artin's result provided a partial solution to Hilbert's ninth problem.

Modern theory

Around 1955, Japanese mathematicians Goro Shimura and Yutaka Taniyama observed a possible link between two apparently completely distinct, branches of mathematics, elliptic curves and modular forms. The resulting modularity theorem states that every elliptic curve is modular, meaning that it can be associated with a unique modular form.
It was initially dismissed as unlikely or highly speculative, and was taken more seriously when number theorist André Weil found evidence supporting it, but no proof; as a result the "astounding" conjecture was often known as the Taniyama–Shimura-Weil conjecture. It became a part of the Langlands program, a list of important conjectures needing proof or disproof.
From 1993 to 1994, Andrew Wiles provided a proof of the modularity theorem for semistable elliptic curves, which, together with Ribet's theorem, provided a proof for Fermat's Last Theorem. Almost every mathematician at the time had previously considered both Fermat's Last Theorem and the Modularity Theorem either impossible or virtually impossible to prove, even given the most cutting edge developments. Wiles first announced his proof in June 1993 in a version that was soon recognized as having a serious gap at a key point. The proof was corrected by Wiles, partly in collaboration with Richard Taylor, and the final, widely accepted version was released in September 1994, and formally published in 1995. The proof uses many techniques from algebraic geometry and number theory, and has many ramifications in these branches of mathematics. It also uses standard constructions of modern algebraic geometry, such as the category of schemes and Iwasawa theory, and other 20th-century techniques not available to Fermat.

Basic notions

Failure of unique factorization

An important property of the ring of integers is that it satisfies the fundamental theorem of arithmetic, that every integer has a factorization into a product of prime numbers, and this factorization is unique up to the ordering of the factors. This may no longer be true in the ring of integers of an algebraic number field.
A prime element is an element of such that if divides a product, then it divides one of the factors or. This property is closely related to primality in the integers, because any positive integer satisfying this property is either or a prime number. However, it is strictly weaker. For example, is not a prime number because it is negative, but it is a prime element. If factorizations into prime elements are permitted, then, even in the integers, there are alternative factorizations such as
In general, if is a unit, meaning a number with a multiplicative inverse in, and if is a prime element, then is also a prime element. Numbers such as and are said to be associate. In the integers, the primes and are associate, but only one of these is positive. Requiring that prime numbers be positive selects a unique element from among a set of associated prime elements. When K is not the rational numbers, however, there is no analog of positivity. For example, in the Gaussian integers, the numbers and are associate because the latter is the product of the former by, but there is no way to single out one as being more canonical than the other. This leads to equations such as
which prove that in, it is not true that factorizations are unique up to the order of the factors. For this reason, one adopts the definition of unique factorization used in unique factorization domains. In a UFD, the prime elements occurring in a factorization are only expected to be unique up to units and their ordering.
However, even with this weaker definition, many rings of integers in algebraic number fields do not admit unique factorization. There is an algebraic obstruction called the ideal class group. When the ideal class group is trivial, the ring is a UFD. When it is not, there is a distinction between a prime element and an irreducible element. An irreducible element is an element such that if, then either or is a unit. These are the elements that cannot be factored any further. Every element in O admits a factorization into irreducible elements, but it may admit more than one. This is because, while all prime elements are irreducible, some irreducible elements may not be prime. For example, consider the ring. In this ring, the numbers, and are irreducible. This means that the number has two factorizations into irreducible elements,
This equation shows that divides the product. If were a prime element, then it would divide or, but it does not, because all elements divisible by are of the form. Similarly, and divide the product, but neither of these elements divides itself, so neither of them are prime. As there is no sense in which the elements, and can be made equivalent, unique factorization fails in. Unlike the situation with units, where uniqueness could be repaired by weakening the definition, overcoming this failure requires a new perspective.

Factorization into prime ideals

If is an ideal in, then there is always a factorization
where each is a prime ideal, and where this expression is unique up to the order of the factors. In particular, this is true if is the principal ideal generated by a single element. This is the strongest sense in which the ring of integers of a general number field admits unique factorization. In the language of ring theory, it says that rings of integers are Dedekind domains.
When is a UFD, every prime ideal is generated by a prime element. Otherwise, there are prime ideals which are not generated by prime elements. In, for instance, the ideal is a prime ideal which cannot be generated by a single element.
Historically, the idea of factoring ideals into prime ideals was preceded by Ernst Kummer's introduction of ideal numbers. These are numbers lying in an extension field of. This extension field is now known as the Hilbert class field. By the principal ideal theorem, every prime ideal of generates a principal ideal of the ring of integers of. A generator of this principal ideal is called an ideal number. Kummer used these as a substitute for the failure of unique factorization in cyclotomic fields. These eventually led Richard Dedekind to introduce a forerunner of ideals and to prove unique factorization of ideals.
An ideal which is prime in the ring of integers in one number field may fail to be prime when extended to a larger number field. Consider, for example, the prime numbers. The corresponding ideals are prime ideals of the ring. However, when this ideal is extended to the Gaussian integers to obtain, it may or may not be prime. For example, the factorization implies that
note that because, the ideals generated by and are the same. A complete answer to the question of which ideals remain prime in the Gaussian integers is provided by Fermat's theorem on sums of two squares. It implies that for an odd prime number, is a prime ideal if and is not a prime ideal if. This, together with the observation that the ideal is prime, provides a complete description of the prime ideals in the Gaussian integers. Generalizing this simple result to more general rings of integers is a basic problem in algebraic number theory. Class field theory accomplishes this goal when K is an abelian extension of Q.

Ideal class group

Unique factorization fails if and only if there are prime ideals that fail to be principal. The object which measures the failure of prime ideals to be principal is called the ideal class group. Defining the ideal class group requires enlarging the set of ideals in a ring of algebraic integers so that they admit a group structure. This is done by generalizing ideals to fractional ideals. A fractional ideal is an additive subgroup of which is closed under multiplication by elements of, meaning that if. All ideals of are also fractional ideals. If and are fractional ideals, then the set of all products of an element in and an element in is also a fractional ideal. This operation makes the set of non-zero fractional ideals into a group. The group identity is the ideal, and the inverse of is a ideal quotient:
The principal fractional ideals, meaning the ones of the form where, form a subgroup of the group of all non-zero fractional ideals. The quotient of the group of non-zero fractional ideals by this subgroup is the ideal class group. Two fractional ideals and represent the same element of the ideal class group if and only if there exists an element such that. Therefore, the ideal class group makes two fractional ideals equivalent if one is as close to being principal as the other is. The ideal class group is generally denoted,, or .
The number of elements in the class group is called the class number of K. The class number of is 2. This means that there are only two ideal classes, the class of principal fractional ideals, and the class of a non-principal fractional ideal such as.
The ideal class group has another description in terms of divisors. These are formal objects which represent possible factorizations of numbers. The divisor group is defined to be the free abelian group generated by the prime ideals of. There is a group homomorphism from, the non-zero elements of up to multiplication, to. Suppose that satisfies
Then is defined to be the divisor
The kernel of is the group of units in, while the cokernel is the ideal class group. In the language of homological algebra, this says that there is an exact sequence of abelian groups,

Real and complex embeddings

Some number fields, such as, can be specified as subfields of the real numbers. Others, such as, cannot. Abstractly, such a specification corresponds to a field homomorphism or. These are called real embeddings and complex embeddings, respectively.
A real quadratic field, with, and not a perfect square, is so-called because it admits two real embeddings but no complex embeddings. These are the field homomorphisms which send to and to, respectively. Dually, an imaginary quadratic field admits no real embeddings but admits a conjugate pair of complex embeddings. One of these embeddings sends to, while the other sends it to its complex conjugate,.
Conventionally, the number of real embeddings of is denoted, while the number of conjugate pairs of complex embeddings is denoted. The signature of K is the pair. It is a theorem that, where is the degree of.
Considering all embeddings at once determines a function
This is called the Minkowski embedding. The subspace of the codomain fixed by complex conjugation is a real vector space of dimension called Minkowski space. Because the Minkowski embedding is defined by field homomorphisms, multiplication of elements of by an element corresponds to multiplication by a diagonal matrix in the Minkowski embedding. The dot product on Minkowski space corresponds to the trace form.
The image of under the Minkowski embedding is a -dimensional lattice. If is a basis for this lattice, then is the discriminant of. The discriminant is denoted or. The covolume of the image of is.

Places

Real and complex embeddings can be put on the same footing as prime ideals by adopting a perspective based on valuations. Consider, for example, the integers. In addition to the usual absolute value function |·| : QR, there are p-adic absolute value functions |·|p : QR, defined for each prime number p, which measure divisibility by p. Ostrowski's theorem states that these are all possible absolute value functions on Q. Therefore, absolute values are a common language to describe both the real embedding of Q and the prime numbers.
A place of an algebraic number field is an equivalence class of absolute value functions on K. There are two types of places. There is a -adic absolute value for each prime ideal of O, and, like the p-adic absolute values, it measures divisibility. These are called finite places. The other type of place is specified using a real or complex embedding of K and the standard absolute value function on R or C. These are infinite places. Because absolute values are unable to distinguish between a complex embedding and its conjugate, a complex embedding and its conjugate determine the same place. Therefore, there are real places and complex places. Because places encompass the primes, places are sometimes referred to as primes. When this is done, finite places are called finite primes and infinite places are called infinite primes. If is a valuation corresponding to an absolute value, then one frequently writes to mean that is an infinite place and to mean that it is a finite place.
Considering all the places of the field together produces the adele ring of the number field. The adele ring allows one to simultaneously track all the data available using absolute values. This produces significant advantages in situations where the behavior at one place can affect the behavior at other places, as in the Artin reciprocity law.

Places at infinity geometrically

There is a geometric analogy for places at infinity which holds on the function fields of curves. For example, let and be a smooth, projective, algebraic curve. The function field has many absolute values, or places, and each corresponds to a point on the curve. If is the projective completion of an affine curve
then the points in
correspond to the places at infinity. Then, the completion of at one of these points gives an analogue of the -adics.
For example, if then its function field is isomorphic to where is an indeterminant and the field is the field of fractions of polynomials in. Then, a place at a point measures the order of vanishing or the order of a pole of a fraction of polynomials at the point. For example, if, so on the affine chart this corresponds to the point, the valuation measures the order of vanishing of minus the order of vanishing of at. The function field of the completion at the place is then which is the field of power series in the variable, so an element is of the form
for some. For the place at infinity, this corresponds to the function field which are power series of the form

Units

The integers have only two units, and. Other rings of integers may admit more units. The Gaussian integers have four units, the previous two as well as. The Eisenstein integers have six units. The integers in real quadratic number fields have infinitely many units. For example, in, every power of is a unit, and all these powers are distinct.
In general, the group of units of, denoted, is a finitely generated abelian group. The fundamental theorem of finitely generated abelian groups therefore implies that it is a direct sum of a torsion part and a free part. Reinterpreting this in the context of a number field, the torsion part consists of the roots of unity that lie in. This group is cyclic. The free part is described by Dirichlet's unit theorem. This theorem says that rank of the free part is. Thus, for example, the only fields for which the rank of the free part is zero are and the imaginary quadratic fields. A more precise statement giving the structure of O×Z Q as a Galois module for the Galois group of K/Q is also possible.
The free part of the unit group can be studied using the infinite places of. Consider the function
where varies over the infinite places of and |·|v is the absolute value associated with. The function is a homomorphism from to a real vector space. It can be shown that the image of is a lattice that spans the hyperplane defined by The covolume of this lattice is the regulator of the number field. One of the simplifications made possible by working with the adele ring is that there is a single object, the idele class group, that describes both the quotient by this lattice and the ideal class group.

Zeta function

The Dedekind zeta function of a number field, analogous to the Riemann zeta function is an analytic object which describes the behavior of prime ideals in. When is an abelian extension of, Dedekind zeta functions are products of Dirichlet L-functions, with there being one factor for each Dirichlet character. The trivial character corresponds to the Riemann zeta function. When is a Galois extension, the Dedekind zeta function is the Artin L-function of the regular representation of the Galois group of, and it has a factorization in terms of irreducible Artin representations of the Galois group.
The zeta function is related to the other invariants described above by the class number formula.

Local fields

a number field K at a place w gives a complete field. If the valuation is Archimedean, one obtains R or C, if it is non-Archimedean and lies over a prime p of the rationals, one obtains a finite extension a complete, discrete valued field with finite residue field. This process simplifies the arithmetic of the field and allows the local study of problems. For example, the Kronecker–Weber theorem can be deduced easily from the analogous local statement. The philosophy behind the study of local fields is largely motivated by geometric methods. In algebraic geometry, it is common to study varieties locally at a point by localizing to a maximal ideal. Global information can then be recovered by gluing together local data. This spirit is adopted in algebraic number theory. Given a prime in the ring of algebraic integers in a number field, it is desirable to study the field locally at that prime. Therefore, one localizes the ring of algebraic integers to that prime and then completes the fraction field much in the spirit of geometry.

Major results

Finiteness of the class group

One of the classical results in algebraic number theory is that the ideal class group of an algebraic number field K is finite. This is a consequence of Minkowski's theorem since there are only finitely many Integral ideals with norm less than a fixed positive integer page 78. The order of the class group is called the class number, and is often denoted by the letter h.

Dirichlet's unit theorem

Dirichlet's unit theorem provides a description of the structure of the multiplicative group of units O× of the ring of integers O. Specifically, it states that O× is isomorphic to G × Zr, where G is the finite cyclic group consisting of all the roots of unity in O, and r = r1 + r2 − 1 denotes the number of real embeddings. In other words, O× is a finitely generated abelian group of rank r1 + r2 − 1 whose torsion consists of the roots of unity in O.

Reciprocity laws

In terms of the Legendre symbol, the law of quadratic reciprocity for positive odd primes states
A reciprocity law is a generalization of the law of quadratic reciprocity.
There are several different ways to express reciprocity laws. The early reciprocity laws found in the 19th century were usually expressed in terms of a power residue symbol generalizing the quadratic reciprocity symbol, that describes when a prime number is an nth power residue modulo another prime, and gave a relation between and. Hilbert reformulated the reciprocity laws as saying that a product over p of Hilbert symbols, taking values in roots of unity, is equal to 1. Artin's reformulated reciprocity law states that the Artin symbol from ideals to elements of a Galois group is trivial on a certain subgroup. Several more recent generalizations express reciprocity laws using cohomology of groups or representations of adelic groups or algebraic K-groups, and their relationship with the original quadratic reciprocity law can be hard to see.

Class number formula

The class number formula relates many important invariants of a number field to a special value of its Dedekind zeta function.

Related areas

Algebraic number theory interacts with many other mathematical disciplines. It uses tools from homological algebra. Via the analogy of function fields vs. number fields, it relies on techniques and ideas from algebraic geometry. Moreover, the study of higher-dimensional schemes over Z instead of number rings is referred to as arithmetic geometry. Algebraic number theory is also used in the study of arithmetic hyperbolic 3-manifolds.

Introductory texts