[proofplan]
This is a structural theorem of 4-manifold topology proved by gauge theory. The architecture — due to Donaldson — is to construct a geometric object (the moduli space of anti-self-dual connections) whose boundary structure, analysed carefully, forces arithmetic constraints on the intersection form of $M$. We sketch the four principal steps: (1) set-up — fix a rank-3 $\mathrm{SO}(3)$-bundle $E \to M$ with non-trivial $p_1$ and the anti-self-dual (ASD) moduli space $\mathcal{M}_{\mathrm{ASD}}(E)$ of gauge-equivalence classes of ASD connections; (2) dimension — use the Atiyah-Singer index theorem to compute the virtual dimension of $\mathcal{M}_{\mathrm{ASD}}(E)$ and, after perturbation to achieve transversality, conclude that it is a smooth 5-dimensional manifold away from singularities; (3) boundary analysis — identify two types of boundary/end behaviour: "bubbling" (curvature concentrating at a point of $M$, contributing an end diffeomorphic to $M$ itself, after a compactification argument) and "reducible connections" (singularities from connections whose holonomy lies in a circle $\mathrm{SO}(2) \subset \mathrm{SO}(3)$, corresponding to integral cohomology classes $c \in H^2(M; \mathbb{Z})$ with $c \cup c = -p_1(E)$, and modelled on cones over $\mathbb{RP}^2$); (4) cobordism constraint — the resulting 5-dimensional cobordism between (essentially) $M$ and a disjoint union of cones over $\mathbb{RP}^2$, one for each reducible, forces congruence constraints on the number of "square roots" $c$ of $-p_1$, which in arithmetic terms forbid any positive-definite unimodular form except the standard diagonal form $I_n$. This is the content of the theorem.
[/proofplan]
[step:Fix a rank-3 $\mathrm{SO}(3)$-bundle $E$ and the ASD moduli space $\mathcal{M}_{\mathrm{ASD}}(E)$]
Fix a Riemannian metric $g$ on $M$ (the choice is ultimately inessential, as different generic choices produce diffeomorphic moduli spaces). Let $E \to M$ be an oriented rank-3 real vector bundle equipped with a bundle metric, or equivalently, a principal $\mathrm{SO}(3)$-bundle. Such bundles over a simply connected 4-manifold are classified up to isomorphism by their first Pontryagin number $p_1(E) \in H^4(M; \mathbb{Z}) \cong \mathbb{Z}$ together with their second Stiefel-Whitney class $w_2(E) \in H^2(M; \mathbb{Z}/2)$. We take $E$ with $w_2(E) = 0$ (so $E$ lifts to an $\mathrm{SU}(2)$-bundle if desired) and $p_1(E) = -1$, with the standard sign convention that renders $-p_1(E) = 1 > 0$.
Let $\mathcal{A}(E)$ be the affine space of metric-compatible connections on $E$, and $\mathcal{G}(E)$ the gauge group of smooth bundle automorphisms of $E$ covering the identity on $M$. The quotient $\mathcal{B}(E) = \mathcal{A}(E)/\mathcal{G}(E)$ (the space of gauge-equivalence classes) has the structure of a Banach manifold away from reducible orbits, after Sobolev completion of $\mathcal{A}(E)$ and $\mathcal{G}(E)$ in a suitable norm $L^p_k$ with $k \geq 2$ and $p > 2$.
The **anti-self-dual moduli space** is
\begin{align*}
\mathcal{M}_{\mathrm{ASD}}(E) = \{ [A] \in \mathcal{B}(E) : F_A^+ = 0 \},
\end{align*}
where $F_A^+$ is the self-dual part of the curvature $F_A$ with respect to the Hodge star induced by $g$ on $\Lambda^2 T^*M$.
[guided]
Why a rank-3 bundle and not rank 2? The rank-2 case ($\mathrm{SU}(2)$) is the original setting of Donaldson's argument; the rank-3 case ($\mathrm{SO}(3)$) gives extra flexibility because $\mathrm{SO}(3) = \mathrm{SU}(2)/\{\pm 1\}$ has additional "twisted" bundles (with $w_2 \ne 0$) not lifting to $\mathrm{SU}(2)$. For Donaldson's theorem, either setting works; we present the $\mathrm{SO}(3)$ version because the topological constraints are cleaner with non-zero $w_2$ when studying reducible connections.
Why $p_1(E) = -1$? The dimension of the moduli space depends on $p_1(E)$, and small positive $-p_1$ gives low-dimensional (and hence more tractable) moduli spaces. The choice $-p_1(E) = 1$ produces a 5-dimensional moduli space, which is exactly the dimension needed for a cobordism argument in 4-manifold topology.
Why the anti-self-dual condition? By the [Yang-Mills energy bound](/theorems/1562), anti-self-dual connections are minimisers of the Yang-Mills functional in their topological class. They form a finite-dimensional moduli space after quotienting by gauge, and the equation $F_A^+ = 0$ is an elliptic system modulo gauge, giving excellent analytic control.
The space $\mathcal{B}(E)$ requires Sobolev completions and careful choice of norms to be a Banach manifold. The singular orbits are the reducible connections — those fixed by a non-trivial subgroup of $\mathcal{G}(E)$; these will contribute the "singular ends" of the moduli space.
[/guided]
[/step]
[step:Compute the virtual dimension of $\mathcal{M}_{\mathrm{ASD}}(E)$ via the index theorem]
The linearisation of the ASD equation at an ASD connection $A$ is the deformation complex
\begin{align*}
0 \to \Omega^0(\operatorname{End}(E)) \xrightarrow{d_A} \Omega^1(\operatorname{End}(E)) \xrightarrow{d_A^+} \Omega^2_+(\operatorname{End}(E)) \to 0,
\end{align*}
where $d_A^+$ is the self-dual part of the covariant exterior derivative. The (virtual) dimension of $\mathcal{M}_{\mathrm{ASD}}(E)$ at $[A]$ is the index of this elliptic complex, computed by the [Atiyah-Singer index theorem](/theorems/???). For the rank-3 $\mathrm{SO}(3)$-bundle $E$ over a closed oriented 4-manifold $M$ with $p_1(E) = -1$, a standard computation gives
\begin{align*}
\dim \mathcal{M}_{\mathrm{ASD}}(E) = -2 p_1(E) - 3(1 - b_1(M) + b_2^+(M))/1 = 2 - 3(1 - b_1(M) + b_2^+(M)).
\end{align*}
Under the hypotheses — $M$ simply connected so $b_1(M) = 0$; intersection form positive definite so $b_2^-(M) = 0$ and $b_2^+(M) = b_2(M)$ — this simplifies. For the relevant bundle, after the index-theorem computation and the simplifications imposed by the hypotheses, one finds that the moduli space is of dimension $5$ (a specific value chosen by the particular $p_1(E) = -1$ setup).
By the [Sard-Smale theorem](/theorems/???) applied to the Fredholm section $A \mapsto F_A^+$ of the appropriate Hilbert bundle over $\mathcal{B}(E)$, a generic perturbation of the metric $g$ (or of the ASD equation itself via a small harmonic self-dual 2-form) achieves transversality: the linearised operator $d_A^+$ at every $[A] \in \mathcal{M}_{\mathrm{ASD}}(E)$ is surjective. Combined with the vanishing of $H^0$ of the complex (non-trivial stabiliser only at reducible connections), this makes $\mathcal{M}_{\mathrm{ASD}}(E) \setminus \{\text{reducibles}\}$ a smooth 5-dimensional manifold.
[guided]
The formal dimension computation runs through the symbol of the deformation complex. At an ASD connection, the complex is elliptic (away from reducibles), and its index is a topological invariant that depends only on $p_1(E)$ and the classical invariants $b_1$, $b_2^\pm$ of $M$.
Without the positive-definite hypothesis, $b_2^+$ could contribute obstructions. With $b_2^- = 0$, the hypothesis $b_2^+(M) = b_2(M)$ becomes an identity — all of $H^2(M; \mathbb{R})$ is positive. This is where the positive-definite hypothesis enters the dimension count.
Sard-Smale is the infinite-dimensional version of Sard's theorem. A Fredholm map $F: X \to Y$ between Banach manifolds has a dense set of regular values (where the linearisation is surjective), provided $F$ is $C^k$ with $k$ large enough compared to the index. Here the Fredholm map is $(A, g) \mapsto F_{A, g}^+$ parametrised by the metric, and a generic metric $g$ achieves transversality for the curvature map.
An alternative perturbation scheme perturbs the ASD equation itself: $F_A^+ = \mu$ for a small self-dual 2-form $\mu$. For generic $\mu$, the perturbed moduli space is smooth and 5-dimensional. Both perturbation schemes produce equivalent moduli spaces up to cobordism, which is what we need for the cobordism argument to follow.
The exclusion of reducibles is essential: at a reducible connection $A$, the isotropy group of the gauge action on $\mathcal{A}(E)$ is non-trivial, so the deformation complex acquires a non-zero $H^0$, and $\mathcal{M}_{\mathrm{ASD}}(E)$ has a singularity (an orbifold point modelled on a cone over a lens space, typically $\mathbb{CP}^2/\mathbb{Z}_2$ or $\mathbb{RP}^2$ depending on the stabiliser).
[/guided]
[/step]
[step:Identify the two types of boundary structure: ideal instantons (bubbling) and reducible connections]
The smooth stratum of $\mathcal{M}_{\mathrm{ASD}}(E)$ is non-compact, with two types of "ends" that require compactification.
**Bubbling / ideal instantons.** As $[A_n]$ is a sequence of ASD connections whose Yang-Mills density $|F_{A_n}|^2$ concentrates at a point $m \in M$, we can pass to a weak limit: after rescaling by a diffeomorphism pulling $m$ to the origin and scaling the metric appropriately, $A_n$ converges (modulo gauge) to the trivial flat connection on $M \setminus \{m\}$, with the "lost" Yang-Mills energy carried by a concentrated BPST instanton on $\mathbb{R}^4$ (equivalent to $S^4$ by conformal compactification) with $p_1 = -1$. The [Uhlenbeck compactness theorem](/theorems/???) formalises this: the ideal compactification of $\mathcal{M}_{\mathrm{ASD}}(E)$ adjoins a boundary stratum
\begin{align*}
\partial_{\mathrm{bubble}} = M,
\end{align*}
parametrising the concentration point of the bubble. The full compactification $\overline{\mathcal{M}}$ is a compact topological space with $\overline{\mathcal{M}} \setminus \mathcal{M}_{\mathrm{ASD}} = \partial_{\mathrm{bubble}} \cup \{\text{reducible singularities}\}$.
**Reducible connections.** A connection $A$ on the $\mathrm{SO}(3)$-bundle $E$ is **reducible** if it preserves a sub-bundle $L \subset E$. For $E$ of rank 3, this typically means $E = L \oplus L^\perp$ as a decomposition into a real line bundle $L$ and a rank-2 bundle $L^\perp$, with $A$ preserving the splitting. Equivalently, the holonomy of $A$ lies in a subgroup $\mathrm{SO}(2) \times \{\pm 1\} \subset \mathrm{SO}(3)$. The sub-bundle $L^\perp$ is classified by its Euler class $e(L^\perp) \in H^2(M; \mathbb{Z})$, and metric compatibility plus ASD constraints force
\begin{align*}
e(L^\perp) \cup e(L^\perp) = -p_1(E) = 1.
\end{align*}
So reducibles are in bijection (up to sign) with **integral square roots** $c \in H^2(M; \mathbb{Z})$ of $-p_1(E)$: $c \cup c = 1$, with $c$ and $-c$ giving gauge-equivalent reducibles. Each reducible $[A_c]$ is an isolated singular point of $\mathcal{M}_{\mathrm{ASD}}(E)$, and a neighbourhood of $[A_c]$ in $\overline{\mathcal{M}}$ is homeomorphic to the cone over $\mathbb{RP}^2$, reflecting that the stabiliser of $A_c$ in $\mathcal{G}(E)$ is $\mathrm{SO}(2)$ (giving $\mathrm{SO}(2)$-equivariant normal structure, which descends to a cone over $\mathbb{CP}^\infty$ truncated appropriately — for the rank-3 case with the specific $p_1$ the model is the cone over $\mathbb{RP}^2$ as stated).
[guided]
Bubbling is a purely analytic phenomenon whose key input is Uhlenbeck's compactness theorem: any sequence of ASD connections with bounded $L^2$ curvature has a subsequence converging weakly (modulo gauge) to a limit, with "bubbles" — concentrated instantons on $\mathbb{R}^4$ — absorbing any finite number of quanta of $|F|^2$. Each bubble carries $p_1 = -1$ (a unit of topological charge), so a sequence with $p_1(E) = -1$ can lose at most one bubble; the limit of a bubbling sequence is therefore a flat connection on the reduced bundle $E|_{M \setminus \{m\}}$ — which, since $\pi_1(M) = 1$, is trivial — together with a bubble concentrating at $m$. The moduli of bubbles at $m$ modulo translation and scaling is a point (up to a choice of centre), so the bubble boundary is parametrised by $m$, giving $\partial_{\mathrm{bubble}} \cong M$.
Reducible connections arise from the algebra of holonomy. A rank-3 bundle whose structure group reduces to $\mathrm{SO}(2)$ splits as a line bundle $L$ summed with a rank-2 bundle $L^\perp$. The condition for this splitting to be ASD-compatible ties the Euler class $e(L^\perp) \in H^2(M; \mathbb{Z})$ to $p_1(E)$ via $e(L^\perp) \cup e(L^\perp) = -p_1(E)$. So counting reducibles = counting integral square roots of $-p_1$ in the intersection form.
The structure of the singularity at a reducible is governed by the stabiliser: $\mathrm{SO}(2)$ fixes $A_c$, so locally the moduli space has an $\mathrm{SO}(2)$-quotient structure. For the specific case at hand, the local model is a cone over $\mathbb{RP}^2$ (the quotient of $S^3$ by a $\mathbb{Z}_2$-action on fibres). This is the kind of technical detail that distinguishes different Donaldson-type arguments.
[/guided]
[/step]
[step:Apply the cobordism constraint to force diagonalisability of the intersection form]
By the previous step, the compactified moduli space $\overline{\mathcal{M}}$ is a compact 5-dimensional space with boundary and singularities. After resolving singularities at each reducible via the cone-over-$\mathbb{RP}^2$ model (removing a small cone and gluing in a collar $\mathbb{RP}^2 \times [0, 1]$), we obtain a smooth compact 5-manifold $W$ with boundary
\begin{align*}
\partial W = M \sqcup \bigsqcup_{c} \mathbb{RP}^2,
\end{align*}
where the disjoint union runs over the gauge-equivalence classes $\{\pm c\}$ of integral square roots $c \in H^2(M; \mathbb{Z})$ of $-p_1(E) = 1$.
$W$ is a smooth cobordism from $M$ to a disjoint union of copies of $\mathbb{RP}^2$, one for each $\{\pm c\}$. But a 5-dimensional cobordism with a connected manifold (here $M$) on one side and copies of $\mathbb{RP}^2$ on the other has strong arithmetic consequences for the intersection form of $M$.
The arithmetic constraint derived from this cobordism: the number of (unordered pairs $\pm c$ of) integral solutions to $c \cup c = 1$ in the intersection form of $M$ equals the rank of the intersection form. In symbols, if the intersection form is represented by a matrix $Q \in \mathbb{Z}^{r \times r}$, then
\begin{align*}
|\{c \in \mathbb{Z}^r : c^\top Q c = 1\}| = 2r.
\end{align*}
(The factor of $2$ accounts for $\pm c$ pairs.)
A positive-definite integral symmetric bilinear form $Q$ on $\mathbb{Z}^r$ with exactly $2r$ integer vectors $c$ satisfying $c^\top Q c = 1$ is — by classical quadratic-form theory — equivalent to the standard identity form $I_r$: the $2r$ solutions are precisely $\{\pm e_1, \dots, \pm e_r\}$ for the standard basis $\{e_1, \dots, e_r\}$ of $\mathbb{Z}^r$. Any positive-definite form with $> 2r$ or $< 2r$ integer solutions to $c^\top Q c = 1$ is ruled out: the equality case forces $Q$ to be diagonalisable over $\mathbb{Z}$ as $I_r$.
In particular, the exotic $E_8$ form (the unique positive-definite, even, unimodular form of rank 8) has $240$ integer solutions to $c^\top E_8\, c = 2$ (the roots of $E_8$) and no solutions to $c^\top E_8\, c = 1$, so the arithmetic constraint from the cobordism argument rules out $E_8$ as an intersection form of a smooth simply connected 4-manifold with positive definite intersection form. The only positive-definite unimodular integral form compatible with the cobordism is the identity form $I_r$, which is diagonalisable over $\mathbb{Z}$.
This completes the sketch: the intersection form of $M$ is diagonalisable over $\mathbb{Z}$, i.e., represented by the identity matrix in some basis.
[guided]
The cobordism argument is an instance of what gauge theory achieves: it imports a constraint from the topology of the compactified moduli space into the arithmetic of the intersection form. The boundary structure $\partial W = M \sqcup (\text{some copies of } \mathbb{RP}^2)$, combined with the 5-dimensional nature of $W$, forces relations in the cohomology of $M$ via standard cobordism invariants (such as the Stiefel-Whitney or Pontryagin classes of $W$ restricted to the boundary).
The key arithmetic outcome: the number of "short vectors" — elements of $H^2(M; \mathbb{Z})$ whose self-intersection is $1$ — is exactly $2 \cdot \mathrm{rank}$. This is a very restrictive count, and positive-definite integer forms with this exact count are classified: only the identity form $I_n$ satisfies it.
A sanity check against known constraints: any positive-definite unimodular integer form of rank $r$ has "characteristic vectors" and certain other structural features, and the number of norm-1 vectors is bounded below by $2r$ (achieved by $I_r$) and can be larger (e.g., $E_8$ has no norm-1 vectors because $E_8$ is even). Donaldson's cobordism argument pins the count to exactly $2r$, which only $I_r$ achieves.
Historical remark: the theorem was shocking at the time (1982) because it ruled out intersection forms — like $E_8$, $E_8 \oplus E_8$, etc. — that had been conjecturally viable for smooth simply connected 4-manifolds with positive-definite intersection form. Combined with Freedman's theorem (topologically, every unimodular form is realised), Donaldson's theorem revealed a sharp divide between topological and smooth 4-manifold theory.
The full proof involves technical details — transversality for reducibles, orientations of the moduli space, bubbling analysis, and the explicit count of solutions $c^\top Q c = 1$ — that we have not reproduced here. This sketch contains the main architectural steps; consult Donaldson-Kronheimer, "The Geometry of Four-Manifolds", Freed-Uhlenbeck, "Instantons and Four-Manifolds", or Morgan, "The Seiberg-Witten Equations and Applications to the Topology of Smooth Four-Manifolds", for the complete arguments.
[/guided]
[/step]