A Diophantine equation begins with a severe restriction: the unknowns are not allowed to move through a continuum. A polynomial equation such as $x^2 + y^2 = 1$ is a curve over $\mathbb{R}$, but over $\mathbb{Z}$ it becomes a finite arithmetic question. The same symbols can describe a geometric object, a congruence problem, a search algorithm, or a theorem about the impossibility of solutions.
The subject is built around one recurring tension. Polynomial equations are flexible over large fields, but the integral or rational points on them can be sparse, structured, or absent for reasons that are invisible over $\mathbb{R}$ or $\mathbb{C}$. Diophantine methods try to detect this arithmetic structure without losing sight of the underlying algebraic shape.
[example: A Circle with Few Integral Points]
The notation $\mathbb{Z}[x,y]$ means the ring of polynomials in the variables $x$ and $y$ with integer coefficients. Let $F(x,y)=x^2+y^2-1\in \mathbb{Z}[x,y]$. Over $\mathbb{R}$, the equation $F(x,y)=0$ is
\begin{align*}
x^2+y^2-1=0,
\end{align*}
equivalently
\begin{align*}
x^2+y^2=1,
\end{align*}
which is the unit circle and has infinitely many real solutions, for example $(\cos \theta,\sin \theta)$ for each $\theta\in\mathbb{R}$.
Over $\mathbb{Z}$, suppose $(x,y)\in\mathbb{Z}^2$ satisfies $x^2+y^2=1$. Since $x^2$ and $y^2$ are nonnegative integers whose sum is $1$, one of them is $1$ and the other is $0$. If $x^2=1$ and $y^2=0$, then $x=1$ or $x=-1$, and $y=0$, giving $(1,0)$ and $(-1,0)$. If $x^2=0$ and $y^2=1$, then $x=0$, and $y=1$ or $y=-1$, giving $(0,1)$ and $(0,-1)$. Substituting these four pairs gives $1^2+0^2=1$, $(-1)^2+0^2=1$, $0^2+1^2=1$, and $0^2+(-1)^2=1$, so the integral solution set is exactly
\begin{align*}
\{(1,0),(-1,0),(0,1),(0,-1)\}.
\end{align*}
The same polynomial therefore defines a continuum of real points but only four integral points.
[/example]
This first example shows why the coefficient ring and the allowed solution set must be part of the definition. The equation alone does not determine the problem; the same polynomial asks different questions over $\mathbb{N}$, $\mathbb{Z}$, $\mathbb{Q}$, or a finite ring.
## Definition
A Diophantine equation records a polynomial constraint together with a specified arithmetic domain for its unknowns. This lets us distinguish the algebraic expression from the arithmetic question of which tuples are allowed. In the formal definition, $R$ is the ambient commutative ring in which the polynomial can be evaluated, and $S\subset R$ is the chosen set from which the unknowns are allowed to be drawn.
[definition: Diophantine Equation]
Let $R$ be a commutative ring, and let $S\subset R$. A Diophantine equation over $S$ is an equation
\begin{align*}
F(x_1,\ldots,x_n)=0
\end{align*}
where $n\in \mathbb{N}$, $F\in \mathbb{Z}[x_1,\ldots,x_n]$, and the unknown tuple $(x_1,\ldots,x_n)$ is required to lie in $S^n$.
[/definition]
The most common choices are $S=\mathbb{Z}$, $S=\mathbb{N}$, and $S=\mathbb{Q}$. To compare these choices, we need a named object that records all permitted tuples satisfying the same polynomial equation.
[definition: Solution Set of a Diophantine Equation]
Let $R$ be a commutative ring, let $S\subset R$, let $F\in \mathbb{Z}[x_1,\ldots,x_n]$, and let the Diophantine equation over $S$ be $F(x_1,\ldots,x_n)=0$. Its solution set over $S$ is
\begin{align*}
X_F(S)=\{(a_1,\ldots,a_n)\in S^n : F(a_1,\ldots,a_n)=0\}.
\end{align*}
[/definition]
The notation $X_F(S)$ separates two tasks: understanding the polynomial $F$ and understanding the arithmetic set of its solutions. Many natural problems impose several polynomial conditions at once, so the next step is to name the multi-equation version without pretending it is merely a typographical variant.
[definition: System of Diophantine Equations]
Let $R$ be a commutative ring, and let $S\subset R$. A system of [Diophantine equations](/page/Diophantine%20Equations) over $S$ is a finite collection
\begin{align*}
F_1(x_1,\ldots,x_n)=0,\quad \ldots,\quad F_m(x_1,\ldots,x_n)=0
\end{align*}
with $m,n\in \mathbb{N}$, each $F_j\in \mathbb{Z}[x_1,\ldots,x_n]$, and the unknown tuple $(x_1,\ldots,x_n)$ required to lie in $S^n$.
[/definition]
A system might seem more expressive than a single equation, because each component can impose an independent constraint. The obstruction to collapsing the system is that one equation must remember all the separate vanishing conditions without creating new integer solutions.
Over $\mathbb{Z}$ there is a special way around this obstruction: a sum of integer squares is zero only when every square is zero. Thus the single equation
\begin{align*}
F_1(x)^2+\cdots+F_m(x)^2=0
\end{align*}
forces all the original equations at once, so finite systems and single Diophantine equations define the same kinds of integer solution sets.
[quotetheorem:9916]
The reduction is special to ordered domains such as $\mathbb{Z}$. Over a ring where nonzero squares can sum to zero, the same replacement may add false solutions.
## Linear Diophantine Equations
The first complete theory appears in degree one. Here the problem is not geometric curvature or factorisation, but divisibility: which integer combinations of the coefficients can be produced?
A linear equation in two variables is the prototype because every solution is obtained from one solution by moving along a one-dimensional lattice. The obstruction is the [greatest common divisor](/page/Greatest%20Common%20Divisor) of the coefficients, so the definition isolates the equations for which this gcd test is the right tool.
[definition: Linear Diophantine Equation]
A linear Diophantine equation over $\mathbb{Z}$ is a Diophantine equation of the form
\begin{align*}
a_1x_1+\cdots+a_nx_n=b
\end{align*}
where $n\in \mathbb{N}$ and $a_1,\ldots,a_n,b\in \mathbb{Z}$.
[/definition]
The definition places the constant on the right because the divisibility criterion is easier to state in that form. The central question is now concrete: when does the integer $c$ lie among all integer linear combinations $ax+by$?
[quotetheorem:9917]
Existence is only the first half of the linear story. Once one solution exists, the remaining question is how all other solutions are arranged, and the answer is that they form a translate of the integer kernel of the map $(x,y)\mapsto ax+by$.
[quotetheorem:9918]
[example: Solving $15x+21y=6$]
We compute the greatest common divisor by the Euclidean algorithm:
\begin{align*}
21=1\cdot 15+6,\qquad 15=2\cdot 6+3,\qquad 6=2\cdot 3+0.
\end{align*}
Thus $\gcd(15,21)=3$, and since $6=2\cdot 3$, the divisibility condition in *Bezout Criterion for a Linear Diophantine Equation* is satisfied, so integral solutions exist.
Dividing the equation $15x+21y=6$ by $3$ gives
\begin{align*}
5x+7y=2.
\end{align*}
The identity
\begin{align*}
5\cdot 3+7\cdot(-2)=15-14=1
\end{align*}
gives a Bezout relation for $5$ and $7$. Multiplying both sides by $2$ gives
\begin{align*}
5\cdot 6+7\cdot(-4)=2.
\end{align*}
Hence $(x_0,y_0)=(6,-4)$ is one solution of $5x+7y=2$, and therefore also of $15x+21y=6$ because
\begin{align*}
15\cdot 6+21\cdot(-4)=90-84=6.
\end{align*}
By *Parametrisation of Two-Variable Linear Solutions*, applied to $5x+7y=2$ with $\gcd(5,7)=1$, every integral solution is
\begin{align*}
(x,y)=(6+7t,-4-5t)
\end{align*}
for some $t\in\mathbb{Z}$. Substituting this pair verifies the parametrisation:
\begin{align*}
15(6+7t)+21(-4-5t)=90+105t-84-105t=6.
\end{align*}
Thus the integer solutions form the one-parameter family $(6+7t,-4-5t)$ with $t\in\mathbb{Z}$.
[/example]
Linear equations are the friendly edge of the theory: the obstruction is visible, finite, and computable. Nonlinear equations often keep the same local divisibility flavour but add new phenomena such as factorisation, infinite descent, and rational parametrisation.
## Congruences and Local Obstructions
### Modular Shadows of Integral Equations
Before searching for integral solutions, it is natural to reduce the equation modulo $m$. If there is no solution after reduction, then no integral solution could have existed. This gives a fast way to prove impossibility without estimating the size of all possible integers.
A congruence version of a Diophantine equation replaces equality in $\mathbb{Z}$ by equality in a [quotient ring](/page/Quotient%20Ring). For a positive integer $m$, the notation $\mathbb{Z}/m\mathbb{Z}$ denotes the ring of residue classes modulo $m$, and $(\mathbb{Z}/m\mathbb{Z})^n$ denotes ordered $n$-tuples of such residue classes. The polynomial is the same, but each variable is now a residue class, so we need a separate definition for this finite arithmetic shadow.
[definition: Polynomial Congruence]
Let $m\in \mathbb{N}$. A polynomial congruence modulo $m$ is a condition
\begin{align*}
F(x_1,\ldots,x_n)\equiv 0 \pmod{m}
\end{align*}
where $F\in \mathbb{Z}[x_1,\ldots,x_n]$ and the unknown tuple lies in $(\mathbb{Z}/m\mathbb{Z})^n$.
[/definition]
The next problem is to identify when a finite congruence computation proves a statement about the original integral equation. We therefore name the modulus that detects an impossibility before stating why such a modulus rules out all integer solutions.
[definition: Local Obstruction]
Let $F\in \mathbb{Z}[x_1,\ldots,x_n]$. A modulus $m\in \mathbb{N}$ is a local obstruction to the integral equation $F(x_1,\ldots,x_n)=0$ if the congruence
\begin{align*}
F(x_1,\ldots,x_n)\equiv 0\pmod{m}
\end{align*}
has no solution in $(\mathbb{Z}/m\mathbb{Z})^n$.
[/definition]
The point of naming a local obstruction is that it gives a certificate of global failure. Any integral solution would reduce to a residue-class solution modulo $m$, so absence modulo $m$ forbids existence over $\mathbb{Z}$.
[quotetheorem:4523]
[example: A Modulo $4$ Obstruction]
We show that $x^2+y^2=3$ has no solution in $\mathbb{Z}^2$ by reducing modulo $4$. For any integer $n$, its residue modulo $4$ is one of $0,1,2,3$. Squaring these four possibilities gives
\begin{align*}
0^2\equiv 0,\quad 1^2\equiv 1,\quad 2^2\equiv 0,\quad 3^2\equiv 1 \pmod{4}.
\end{align*}
Thus every square modulo $4$ is congruent to either $0$ or $1$.
If $x,y\in\mathbb{Z}$, then each of $x^2$ and $y^2$ is congruent to $0$ or $1$ modulo $4$. Therefore the possible residues of $x^2+y^2$ modulo $4$ are obtained from
\begin{align*}
0+0\equiv 0,\quad 0+1\equiv 1,\quad 1+0\equiv 1,\quad 1+1\equiv 2 \pmod{4}.
\end{align*}
So $x^2+y^2$ can be congruent only to $0$, $1$, or $2$ modulo $4$, never to $3$. Since any integral solution of $x^2+y^2=3$ would have to satisfy $x^2+y^2\equiv 3\pmod{4}$, no such integral solution exists. Hence the modulus $4$ is a local obstruction.
[/example]
Local testing is powerful because it converts an infinite search into finitely many residue checks. Its limitation is just as important: passing every simple congruence test does not by itself produce an integral point.
### Prime-Power Approximation
A refined local view asks for solutions modulo every power of a prime. These compatible systems lead toward $p$-adic numbers, where congruence information is organised into a complete arithmetic space. Here $\mathbb{Z}_p$ denotes the ring of $p$-adic integers, built to record compatible residues modulo $p,p^2,p^3,\ldots$, while $\mathbb{Q}_p$ denotes its field of fractions, the field of $p$-adic numbers.
[definition: $p$-adic Local Solubility]
Let $p$ be a prime and let $F\in \mathbb{Z}[x_1,\ldots,x_n]$. The equation $F(x_1,\ldots,x_n)=0$ is locally soluble at $p$ if there exists $(a_1,\ldots,a_n)\in \mathbb{Z}_p^n$ such that
\begin{align*}
F(a_1,\ldots,a_n)=0
\end{align*}
in $\mathbb{Z}_p$.
[/definition]
This definition is an integral local condition: it asks for a solution in $\mathbb{Z}_p^n$, not merely in $\mathbb{Q}_p^n$. That distinction matters because allowing $p$-adic denominators changes the problem. The definition is still the bridge between elementary congruences and the local fields used in algebraic number theory, but the next local-to-global question is a rational one rather than an integral one.
Local solubility also exposes the limit of congruence methods. If an equation has no solution modulo some $m$, then it has no integral solution; but the converse direction is false in many important families. The guiding question is whether compatible solutions in all local worlds force a genuine global solution. At this point the discussion changes category: instead of integral points over $\mathbb{Z}$ and integral local points over $\mathbb{Z}_p$, the Hasse principle concerns rational points on varieties over $\mathbb{Q}$.
For the purposes of this note, a variety over a field $K$ may be read as the geometric object cut out by polynomial equations whose coefficients lie in $K$, together with the usual algebraic-geometric refinements needed to treat it intrinsically. If $X$ is such an object, an $L$-point of $X$, where $L$ is a field containing $K$, means a solution of the defining equations with coordinates in $L$. Thus a $\mathbb{Q}$-point is a rational solution, an $\mathbb{R}$-point is a real solution, and a $\mathbb{Q}_p$-point is a $p$-adic solution. The notation $X\in\mathcal{C}$ means that the variety $X$ belongs to the class of varieties under discussion.
With this language in place, the local-to-global question can be stated without referring to a particular equation or choice of coordinates. The principle below packages the hoped-for implication: if a rational variety has points in every completion of $\mathbb{Q}$, then those local witnesses should come from an actual rational point.
[definition: Hasse Principle]
A class $\mathcal{C}$ of varieties over $\mathbb{Q}$ satisfies the Hasse principle if, for every variety $X\in \mathcal{C}$, the existence of points on $X$ over $\mathbb{R}$ and over $\mathbb{Q}_p$ for every prime $p$ implies the existence of a point on $X$ over $\mathbb{Q}$.
[/definition]
The Hasse principle is true for several classical quadratic problems and fails for more complicated varieties. This failure is one reason Diophantine equations cannot be reduced to modular arithmetic alone.
## Quadratic Equations and Descent
### Pythagorean Triples
Quadratic Diophantine equations are the first nonlinear equations with a rich but still accessible theory. They combine congruence restrictions, factorisation, geometry, and descent arguments.
The Pythagorean equation is central because it has infinitely many nonzero integer solutions and a complete parametrisation. It shows how a curve can be controlled by drawing rational lines through one known point.
[definition: Pythagorean Triple]
A Pythagorean triple is a tuple $(a,b,c)\in \mathbb{Z}^3$ satisfying
\begin{align*}
a^2+b^2=c^2.
\end{align*}
It is primitive if $\gcd(a,b,c)=1$.
[/definition]
Primitive triples are the arithmetic version of rational points on the unit circle. Dividing by $c^2$ turns the equation into
\begin{align*}
\left(\frac{a}{c}\right)^2+\left(\frac{b}{c}\right)^2=1,
\end{align*}
but clearing denominators can introduce common factors or duplicate the same triple in different ways. The arithmetic task is therefore to identify exactly which rational parameters produce positive primitive triples, and how the parity and coprimality conditions remove the unwanted repetitions.
A usable parametrisation must do more than produce examples: it should give every positive primitive triple exactly up to the expected interchange of the two legs. The theorem below supplies that classification and makes the necessary coprimality and parity restrictions explicit.
[quotetheorem:4485]
[example: Generating the Triple $(20,21,29)$]
Take $m=5$ and $n=2$. We have $\gcd(5,2)=1$, and $5\not\equiv 2\pmod{2}$ because $5\equiv 1\pmod{2}$ while $2\equiv 0\pmod{2}$. Thus the hypotheses on $m$ and $n$ in *Parametrisation of Primitive Pythagorean Triples* are satisfied.
Now compute each coordinate in the parametrisation:
\begin{align*}
m^2-n^2=5^2-2^2=25-4=21.
\end{align*}
\begin{align*}
2mn=2\cdot 5\cdot 2=20.
\end{align*}
\begin{align*}
m^2+n^2=5^2+2^2=25+4=29.
\end{align*}
Therefore the formula gives the triple $(21,20,29)$, which is the same Pythagorean triple as $(20,21,29)$ after interchanging the two legs. Checking the equation explicitly,
\begin{align*}
20^2+21^2=400+441=841=29^2.
\end{align*}
Since the parametrisation theorem gives a primitive triple under these coprimality and parity conditions, $(20,21,29)$ is a primitive Pythagorean triple.
[/example]
### Descent and Impossibility
Parametrisation explains why some quadratic equations have many solutions. Other equations resist because any proposed primitive solution would force a smaller one, and this descent contradicts the existence of a smallest positive size.
[definition: Infinite Descent]
An infinite descent argument is a proof method in which the existence of a solution with a positive integer size measure implies the existence of another solution with a strictly smaller positive integer size measure.
[/definition]
Descent needs a mechanism that turns one alleged solution into a strictly smaller one. In this example the mechanism starts locally: congruence information forces all variables in a hypothetical primitive solution to share a factor, contradicting primitivity and leaving no minimal solution from which descent could begin.
[quotetheorem:9919]
This theorem can be read as a local obstruction modulo $3$ together with primitivity. It illustrates a frequent pattern: congruence information forces divisibility, and divisibility contradicts the assumption that the solution was primitive.
## Rational Points and Parametrisation
The rest of the page is a transition from elementary Diophantine equations to the standard language used for rational points on curves. The terms introduced below are included to make later theorem cards readable, not to develop algebraic geometry from the ground up.
Integral solutions are rigid because denominators are forbidden. Rational solutions are more flexible, and for curves of low degree the presence of one rational point can generate many others.
To make that idea precise, we separate rational solubility from integral solubility. The same polynomial may have rational solutions but no integral solutions, or integral solutions only after multiplying through by a scale.
[definition: Rational Solution]
Let $F\in \mathbb{Z}[x_1,\ldots,x_n]$. A rational solution of $F(x_1,\ldots,x_n)=0$ is a tuple $(q_1,\ldots,q_n)\in \mathbb{Q}^n$ such that
\begin{align*}
F(q_1,\ldots,q_n)=0.
\end{align*}
[/definition]
The next question is when rational solutions can be described by explicit parameters. For the unit circle, a rational line through one known rational point intersects the circle in exactly one further rational point, producing the following complete formula.
[quotetheorem:4478]
[example: From a Rational Parameter to an Integral Triple]
Take $t=\frac{2}{3}$ in the formula from *Rational Parametrisation of the Unit Circle*. First compute the common denominator:
\begin{align*}
1+t^2=1+\left(\frac{2}{3}\right)^2=1+\frac{4}{9}=\frac{9}{9}+\frac{4}{9}=\frac{13}{9}.
\end{align*}
The $x$-coordinate is
\begin{align*}
x=\frac{1-t^2}{1+t^2}=\frac{1-\frac{4}{9}}{\frac{13}{9}}=\frac{\frac{5}{9}}{\frac{13}{9}}=\frac{5}{9}\cdot\frac{9}{13}=\frac{5}{13}.
\end{align*}
The $y$-coordinate is
\begin{align*}
y=\frac{2t}{1+t^2}=\frac{2\cdot\frac{2}{3}}{\frac{13}{9}}=\frac{\frac{4}{3}}{\frac{13}{9}}=\frac{4}{3}\cdot\frac{9}{13}=\frac{36}{39}=\frac{12}{13}.
\end{align*}
Thus the rational point is $\left(\frac{5}{13},\frac{12}{13}\right)$. Substituting it into the unit-circle equation gives
\begin{align*}
\left(\frac{5}{13}\right)^2+\left(\frac{12}{13}\right)^2=\frac{25}{169}+\frac{144}{169}=\frac{169}{169}=1.
\end{align*}
Multiplying this identity by $13^2$ clears denominators:
\begin{align*}
13^2\left(\left(\frac{5}{13}\right)^2+\left(\frac{12}{13}\right)^2\right)=13^2\cdot 1.
\end{align*}
The left-hand side is
\begin{align*}
13^2\cdot\frac{5^2}{13^2}+13^2\cdot\frac{12^2}{13^2}=5^2+12^2.
\end{align*}
Therefore
\begin{align*}
5^2+12^2=13^2.
\end{align*}
So the rational point with parameter $t=\frac{2}{3}$ produces the integral Pythagorean triple $(5,12,13)$.
[/example]
Parametrisation is a success story, but it is not the general behaviour of Diophantine equations. Curves of higher degree can have finitely many rational points, infinitely many rational points with no simple parametrisation, or no rational points despite having real points.
## Elliptic Curves and Higher-Degree Phenomena
An elliptic curve is the first major case where rational points carry both arithmetic and geometry. In this section, a curve over $\mathbb{Q}$ means the solution set of polynomial equations with rational coefficients, interpreted with the projective points needed to include points at infinity. A rational point on such a curve is a point whose [homogeneous coordinates](/page/Homogeneous%20Coordinates) may be chosen in $\mathbb{Q}$.
The phrase smooth projective curve means that the curve has no singular points after passing to [projective space](/page/Projective%20Space); for this page, it is enough to think of smoothness as excluding cusps and self-intersections, and projective space as adding the points at infinity needed for a complete geometric object. The genus is a nonnegative integer measuring the curve's intrinsic complexity: genus $0$ curves behave like lines or conics, while genus $1$ curves are the first curves with elliptic-curve group structure. A specified rational point is part of the data because it serves as the identity element for that group law. In a short Weierstrass model, this base point is usually the point at infinity, and nonsingularity means that the cubic has no repeated root.
### Cubics with Group Structure
Cubic equations introduce a new kind of structure. Some plane cubic curves carry an [abelian group](/page/Abelian%20Group) law on their rational points, and the arithmetic of that group becomes a central object.
A standard form is needed because not every cubic is presented in a way that exposes its smoothness or group operation. The Weierstrass form gives a manageable model for many arithmetic questions.
[definition: Elliptic Curve over $\mathbb{Q}$]
An elliptic curve over $\mathbb{Q}$ is a smooth projective curve of genus $1$ over $\mathbb{Q}$ with a specified rational point. A short Weierstrass model for such a curve is the projective curve obtained by adjoining the point at infinity to the affine equation
\begin{align*}
y^2=x^3+Ax+B
\end{align*}
where $A,B\in \mathbb{Q}$ and
\begin{align*}
-16(4A^3+27B^2)\ne 0.
\end{align*}
[/definition]
The definition is geometric, but the Diophantine question is arithmetic: describe the rational or integral points satisfying the equation. The specified rational point serves as the identity for the group law, and the key structural theorem says that the rational points have finite algebraic complexity.
[quotetheorem:4508]
[Mordell's theorem](/theorems/4508) does not list the points, but it changes the problem from an arbitrary infinite search into the study of a finitely generated abelian group. This is the beginning of modern arithmetic geometry.
[example: A Cubic with a Visible Rational Point]
Consider the affine cubic
\begin{align*}
E: y^2=x^3-x.
\end{align*}
This is the short Weierstrass equation with $A=-1$ and $B=0$, and its discriminant factor is
\begin{align*}
-16(4A^3+27B^2)=-16(4(-1)^3+27\cdot 0^2)=-16(-4)=64\ne 0.
\end{align*}
Thus the curve is nonsingular in this model.
Now substitute the three proposed points. For $(0,0)$, the left-hand side is $0^2=0$, while the right-hand side is
\begin{align*}
0^3-0=0-0=0.
\end{align*}
For $(1,0)$, the left-hand side is $0^2=0$, while the right-hand side is
\begin{align*}
1^3-1=1-1=0.
\end{align*}
For $(-1,0)$, the left-hand side is $0^2=0$, while the right-hand side is
\begin{align*}
(-1)^3-(-1)=-1+1=0.
\end{align*}
Therefore $(0,0)$, $(1,0)$, and $(-1,0)$ all lie in $E(\mathbb{Q})$.
In the elliptic-curve group law on a short Weierstrass curve, the inverse of an affine point $(x,y)$ is $(x,-y)$. Each of these three points has $y=0$, so each point is equal to its own inverse:
\begin{align*}
(x,0)=-(x,0).
\end{align*}
Adding $(x,0)$ to both sides gives
\begin{align*}
(x,0)+(x,0)=O,
\end{align*}
where $O$ is the point at infinity. Hence these visible rational points are nonidentity points of order $2$, showing that the rational solutions of a cubic Diophantine equation can already contain group-theoretic structure.
[/example]
### Fermat-Type Equations
Higher-degree equations also include some of the most famous impossibility theorems in mathematics. Fermat's equation is simple to state, but its resolution lies far beyond elementary congruence checks.
The exponent-three case is the first genuinely cubic instance of Fermat's equation. It shows a different kind of obstruction from the linear parametrizations and congruence filters above: the equation has many apparent local and algebraic symmetries, but no nonzero integral solution. Including this case here marks the point where Diophantine equations stop being mainly a matter of solving for parameters and become a study of structural impossibility.
[quotetheorem:4514]
[Fermat's Last Theorem](/theorems/4789) is a warning about scale. A polynomial equation with three variables and one line of notation can require centuries of new ideas; even the special exponent-three case already needs ideas beyond the elementary toolkit developed on this page.
## Decidability and Hilbert's Tenth Problem
The examples so far suggest a practical hope: perhaps there is a universal algorithm that decides whether any given Diophantine equation has an integral solution. Hilbert's tenth problem asked for exactly such a procedure over $\mathbb{Z}$.
To state the obstruction, we need the idea of a set of integers being described by the existence of solutions to a polynomial equation. Here $\mathbb{Z}^r$ means the set of ordered $r$-tuples of integers, and $\mathbb{Z}[x_1,\ldots,x_r,y_1,\ldots,y_m]$ means the ring of polynomials in the displayed variables with integer coefficients. The variables $x_i$ record the visible integer inputs, while the auxiliary variables $y_j$ are allowed to witness membership in the set. This turns Diophantine equations into a language for encoding computation.
[definition: Diophantine Set]
A subset $A\subset \mathbb{Z}^r$ is Diophantine if there exist $m\in \mathbb{N}$ and a polynomial $F\in \mathbb{Z}[x_1,\ldots,x_r,y_1,\ldots,y_m]$ such that, for every $(a_1,\ldots,a_r)\in \mathbb{Z}^r$,
\begin{align*}
(a_1,\ldots,a_r)\in A
\end{align*}
if and only if there exists $(b_1,\ldots,b_m)\in \mathbb{Z}^m$ with
\begin{align*}
F(a_1,\ldots,a_r,b_1,\ldots,b_m)=0.
\end{align*}
[/definition]
This definition allows hidden variables. The set $A$ records the visible parameters for which the auxiliary variables can be chosen to solve the equation. Once equations can encode sets, the algorithmic question becomes a precise decision problem.
[definition: Diophantine Decision Problem over $\mathbb{Z}$]
The Diophantine decision problem over $\mathbb{Z}$ asks whether there is an algorithm which, given $n\in \mathbb{N}$ and $F\in \mathbb{Z}[x_1,\ldots,x_n]$, decides whether the equation
\begin{align*}
F(x_1,\ldots,x_n)=0
\end{align*}
has a solution in $\mathbb{Z}^n$.
[/definition]
This decision problem asks for a single method that succeeds on every polynomial input. The obstruction is not a shortage of clever algorithms for particular families, but the expressive power of polynomial equations: over the integers they can encode computational behaviour complicated enough to defeat any universal decision procedure.
[quotetheorem:9920]
This theorem does not say that individual equations are hopeless. It says that no single mechanical procedure can solve every case. The subject therefore splits naturally into families where structure is strong enough for theorems and the global landscape where undecidability is unavoidable.
[example: A Decidable Family inside an Undecidable World]
For a two-variable linear Diophantine equation
\begin{align*}
ax+by=c
\end{align*}
with $a,b,c\in\mathbb{Z}$ and $(a,b)\ne(0,0)$, compute $d=\gcd(a,b)$ by the Euclidean algorithm. By *Bezout Criterion for a Linear Diophantine Equation*, the equation has an integral solution exactly when $d\mid c$. Thus this family is decidable: the algorithm computes $d$, divides $c$ by $d$, and answers yes precisely when the remainder is $0$.
Primitive Pythagorean triples also form a controlled family. By *Parametrisation of Primitive Pythagorean Triples*, after possibly interchanging the two legs, every positive primitive solution of
\begin{align*}
a^2+b^2=c^2
\end{align*}
has the form
\begin{align*}
(a,b,c)=(m^2-n^2,2mn,m^2+n^2)
\end{align*}
where $m,n\in\mathbb{N}$, $m>n>0$, $\gcd(m,n)=1$, and $m\not\equiv n\pmod{2}$. Substituting the formula shows why it satisfies the equation:
\begin{align*}
(m^2-n^2)^2+(2mn)^2=(m^4-2m^2n^2+n^4)+4m^2n^2
\end{align*}
\begin{align*}
=m^4+2m^2n^2+n^4=(m^2+n^2)^2.
\end{align*}
So this quadratic family is not merely solvable case-by-case; its primitive positive solutions are listed by two integer parameters subject to explicit coprimality and parity conditions.
These decidable and parametrised families sit inside the full class of Diophantine equations over $\mathbb{Z}$. By *[Matiyasevich-Robinson-Davis-Putnam Theorem](/theorems/9920)*, there is no algorithm that decides solvability for every polynomial equation
\begin{align*}
F(x_1,\ldots,x_n)=0
\end{align*}
with $F\in\mathbb{Z}[x_1,\ldots,x_n]$ and arbitrary $n$. The contrast shows why degree, number of variables, and arithmetic domain are structural data, not cosmetic labels.
[/example]
## Beyond and Connected Topics
Diophantine equations are a meeting point for several parts of number theory. The elementary route continues through divisibility, congruences, quadratic residues, and arithmetic functions, where many impossibility arguments can be made with modular arithmetic and the Euclidean algorithm. The natural Androma continuation is [Cambridge II Number Theory](/page/Cambridge%20II%20Number%20Theory).
The algebraic route studies solutions using rings, ideals, and fields. Factoring equations in $\mathbb{Z}$ may fail because unique factorisation fails in larger rings; algebraic number theory repairs this by replacing elements with ideals. This is the natural bridge to [Cambridge II Number Fields](/page/Cambridge%20II%20Number%20Fields).
The set-theoretic and foundational route begins with countability, algorithms, and the distinction between finite search and infinite decision problems. Background on functions, countability, and arithmetic sets is developed in [Cambridge IA Numbers and Sets](/page/Cambridge%20IA%20Numbers%20and%20Sets).
The geometric route treats $F=0$ as a variety and asks for rational or integral points. Conics, elliptic curves, and higher-genus curves behave very differently, and the boundary between parametrisation and finiteness is one of the central stories of arithmetic geometry.
The analytic route enters when equations are studied statistically or asymptotically. Counting solutions of bounded height, estimating representation numbers, and using modular forms all turn Diophantine questions into analytic ones.
The computability route begins with Hilbert's tenth problem. It explains why a universal algorithm cannot exist over $\mathbb{Z}$, while leaving many restricted domains and families as active areas of research.
## References
Androma, [Cambridge IA Numbers and Sets](/page/Cambridge%20IA%20Numbers%20and%20Sets).
Androma, [Cambridge IA Differential Equations](/page/Cambridge%20IA%20Differential%20Equations).
Androma, [Cambridge II Number Fields](/page/Cambridge%20II%20Number%20Fields).
Androma, [Cambridge II Number Theory](/page/Cambridge%20II%20Number%20Theory).
Hardy and Wright, *An Introduction to the Theory of Numbers* (1938).
Ireland and Rosen, *A Classical Introduction to Modern Number Theory* (1982).
Silverman and Tate, *Rational Points on Elliptic Curves* (1992).
Matiyasevich, *Hilbert's Tenth Problem* (1993).
Diophantine Equation
Also known as: Diophantine equations, Integral polynomial equations, Integer solutions of polynomial equations, Arithmetic equations, Equations over the integers