In a [metric space](/page/Metric%20Space), boundedness is one of the simplest conditions one can impose: a set is bounded if it fits inside a ball of finite radius. In finite-dimensional Euclidean space $\mathbb{R}^n$, boundedness (together with closedness) suffices to characterise [compactness](/page/Compact%20Space) via the [Heine-Borel theorem](/theorems/309). But in infinite-dimensional spaces --- the natural setting for partial differential equations, functional analysis, and the calculus of variations --- boundedness is far too weak. The closed unit ball in any infinite-dimensional [Banach space](/page/Banach%20Space) is bounded and closed, yet never compact: the sequence of standard basis vectors $\{e_k\}_{k=1}^\infty$ in $\ell^2$ satisfies $\|e_j - e_k\|_{\ell^2} = \sqrt{2}$ for $j \neq k$, so no subsequence is Cauchy, let alone convergent.
What goes wrong? The issue is not that the unit ball is "too large" in a naive sense --- its diameter is finite. The problem is that it is *too spread out at fine scales*: no matter how small a radius $\varepsilon > 0$ one chooses, the ball cannot be covered by finitely many balls of radius $\varepsilon$. This quantitative refinement of boundedness --- the requirement that the space admit finite approximations at every scale --- is the notion of **total boundedness**.
[example: Boundedness Without Total Boundedness in $\ell^2$]
Consider the [Hilbert space](/page/Hilbert%20Space) $\ell^2 = \ell^2(\mathbb{N})$ with norm $\|x\|_{\ell^2} = \left(\sum_{k=1}^\infty x_k^2\right)^{1/2}$, and let $S = \{e_k\}_{k=1}^\infty$ be the set of standard basis vectors. The set $S$ is bounded: $\|e_k\|_{\ell^2} = 1$ for all $k$, so $S \subset \overline{B}(0, 1)$.
We claim $S$ is not totally bounded. Choose $\varepsilon = 1/2$. Suppose for contradiction that $S \subset \bigcup_{i=1}^N B(x_i, 1/2)$ for some finite set $\{x_1, \ldots, x_N\} \subset \ell^2$. Since $S$ is infinite and the cover is finite, by the pigeonhole principle there exist distinct indices $j \neq k$ with $e_j, e_k \in B(x_i, 1/2)$ for some $i$. But then
\begin{align*}
\sqrt{2} = \|e_j - e_k\|_{\ell^2} \le \|e_j - x_i\|_{\ell^2} + \|x_i - e_k\|_{\ell^2} < \frac{1}{2} + \frac{1}{2} = 1,
\end{align*}
a contradiction. In fact, the same argument shows that no finite collection of balls of radius $r < \sqrt{2}/2$ can cover $S$.
By contrast, any bounded subset of $\mathbb{R}^n$ is totally bounded: a bounded set fits inside a cube, which can be subdivided into finitely many sub-cubes of any prescribed diameter. The failure of total boundedness for bounded sets is a purely infinite-dimensional phenomenon, and it is the precise mechanism by which compactness breaks down in infinite dimensions.
[/example]
Total boundedness occupies a critical position in the architecture of compactness theory. It is the *metric* ingredient that, together with [completeness](/page/Complete%20Metric%20Space), characterises compactness in metric spaces. A metric space is compact if and only if it is complete and totally bounded. When compactness fails, total boundedness identifies the specific mode of failure: either the space has "gaps" (incompleteness, where Cauchy sequences escape) or it is "too spread out" (failure of total boundedness, where no finite approximation exists). Understanding this decomposition is essential for both verifying compactness in concrete spaces and for recognising why standard compactness arguments fail in infinite-dimensional settings.
## Definition
The fundamental challenge that total boundedness addresses is the question of *finite approximability*: when can an infinite set be described, to any desired accuracy, by a finite list of points? In $\mathbb{R}^n$, every bounded set admits such approximations --- one can lay down a grid of mesh size $\varepsilon$ and select the finitely many grid points that lie near the set. But this grid construction relies on the finite dimensionality of $\mathbb{R}^n$: in $n$ dimensions, a cube of side length $L$ can be covered by $(L/\varepsilon)^n$ sub-cubes of side length $\varepsilon$. When $n = \infty$, no such finite covering exists.
[definition: Total Boundedness]
Let $(M, d)$ be a [metric space](/page/Metric%20Space). A subset $A \subset M$ is **totally bounded** if for every $\varepsilon > 0$, there exists a finite set $F = \{x_1, \ldots, x_N\} \subset M$ such that
\begin{align*}
A \subset \bigcup_{i=1}^N B(x_i, \varepsilon).
\end{align*}
The finite set $F$ is called an **$\varepsilon$-net** (or **$\varepsilon$-covering**) for $A$, and the minimum cardinality of an $\varepsilon$-net is denoted $\mathcal{N}(A, \varepsilon)$, the **covering number** of $A$ at scale $\varepsilon$.
The metric space $(M, d)$ itself is totally bounded if $M$ is a totally bounded subset of itself.
[/definition]
Several features of this definition deserve immediate comment.
[remark: $\varepsilon$-Nets Need Not Lie in $A$]
The definition requires $F \subset M$, not $F \subset A$. However, one can always replace an external $\varepsilon$-net with an internal $2\varepsilon$-net: if $A \subset \bigcup_{i=1}^N B(x_i, \varepsilon)$, choose $a_i \in A \cap B(x_i, \varepsilon)$ for each $i$ with $A \cap B(x_i, \varepsilon) \neq \varnothing$. Then for any $a \in A$, we have $a \in B(x_i, \varepsilon)$ for some $i$, so $d(a, a_i) \le d(a, x_i) + d(x_i, a_i) < 2\varepsilon$. Thus $\{a_1, \ldots, a_N\}$ is a $2\varepsilon$-net for $A$ lying in $A$. It follows that $A$ is totally bounded if and only if for every $\varepsilon > 0$, there exists a finite set $F \subset A$ with $A \subset \bigcup_{x \in F} B(x, \varepsilon)$.
[/remark]
A second subtlety concerns the relationship between total boundedness and the more familiar notion of boundedness.
[remark: Total Boundedness Versus Boundedness]
Every totally bounded set is bounded: if $\{x_1, \ldots, x_N\}$ is a $1$-net for $A$, then $A \subset \bigcup_{i=1}^N B(x_i, 1)$, so
\begin{align*}
\operatorname{diam}(A) \le \max_{1 \le i, j \le N} d(x_i, x_j) + 2.
\end{align*}
The converse holds in $\mathbb{R}^n$ but fails in every infinite-dimensional normed space, as the opening example demonstrates. The distinction is that boundedness controls the *global extent* of the set (it fits inside one large ball), while total boundedness controls the *local complexity* (it can be covered by finitely many small balls at every scale).
[/remark]
## Equivalent Characterisations
The definition of total boundedness as the existence of finite $\varepsilon$-nets is the most directly applicable formulation. But in practice, one often needs to verify total boundedness through different means, or to exploit it in ways that go beyond the covering property. The following characterisations are all equivalent and each illuminates a different facet of the concept.
The first alternative characterisation connects total boundedness to sequences --- the same objects that drive most convergence arguments in analysis.
[quotetheorem:1087]
The implication $(1) \Rightarrow (2)$ is proved by a greedy pigeonhole argument: given a sequence $\{a_k\}_{k=1}^\infty$ in $A$ and a $1$-net, at least one ball of the net contains infinitely many terms; restrict to those terms and repeat with a $1/2$-net, then a $1/3$-net, and take a diagonal subsequence. The resulting subsequence is Cauchy. The implication $(2) \Rightarrow (1)$ is proved by contraposition: if $A$ is not totally bounded, there exists $\varepsilon_0 > 0$ for which no finite $\varepsilon_0$-net exists, and a greedy construction produces a sequence with $d(a_j, a_k) \ge \varepsilon_0$ for all $j \neq k$; no subsequence of such a sequence can be Cauchy.
This characterisation explains the role of total boundedness in the compactness theorem: a metric space is [sequentially compact](/page/Sequential%20Compactness) if and only if every sequence has a *convergent* subsequence, which decomposes into two requirements --- every sequence has a *Cauchy* subsequence (total boundedness) and every Cauchy sequence *converges* (completeness).
The second characterisation replaces the covering condition with a density condition, connecting total boundedness to [separability](/page/Separable%20Space).
[quotetheorem:1088]
The equivalence of (1) and (3) is particularly useful: it shows that total boundedness is a property of the "rough shape" of a set, unaffected by closure. A set and its closure are simultaneously totally bounded or not. This means that to verify total boundedness of a dense subset, it suffices to check it for the closure, and vice versa.
A key consequence of total boundedness is the existence of countable dense subsets.
[quotetheorem:1089]
The proof is constructive: for each $n \in \mathbb{N}$, choose a finite $1/n$-net $F_n$ for $M$. The countable set $D = \bigcup_{n=1}^\infty F_n$ is dense in $M$: for any $x \in M$ and $\varepsilon > 0$, choose $n$ with $1/n < \varepsilon$; then some point of $F_n \subset D$ lies within distance $1/n < \varepsilon$ of $x$.
The converse fails: a separable metric space need not be totally bounded. The real line $\mathbb{R}$ is separable (the rationals $\mathbb{Q}$ are countable and dense) but not totally bounded ($\mathbb{R}$ cannot be covered by finitely many balls of radius $1$, for instance). Separability controls the *countable complexity* of the space; total boundedness controls the *finite complexity at every scale*. The gap between these two notions is precisely the gap between countable and finite.
## Total Boundedness and Compactness
The deepest application of total boundedness is its role in characterising compactness for metric spaces. In a general topological space, compactness is defined by the open-cover property and has no direct metric characterisation. But in a metric space, compactness decomposes cleanly into two independent conditions --- completeness and total boundedness --- each of which is easier to verify in isolation than compactness itself.
[quotetheorem:316]
Each equivalence carries distinct content. The equivalence of (1) and (2) asserts that, in metric spaces, the open-cover property and the sequential property are interchangeable --- something that fails in general topological spaces, as shown by the product $\{0, 1\}^{[0,1]}$ (compact but not sequentially compact) and the ordinal space $\omega_1$ (sequentially compact but not compact). The equivalence with (3) gives a *constructive decomposition* of compactness into two checkable ingredients.
The characterisation (3) reveals why compactness fails in the two canonical examples:
- **The open interval $(0, 1)$** with the standard metric is totally bounded (it inherits total boundedness from the compact set $[0, 1]$) but not complete: the [Cauchy sequence](/page/Cauchy%20Sequence) $x_n = 1/n$ has no limit in $(0, 1)$. The space fails compactness through incompleteness alone.
- **The real line $\mathbb{R}$** is complete but not totally bounded: no finite collection of unit balls covers $\mathbb{R}$. The space fails compactness through lack of total boundedness alone.
- **The unit ball in $\ell^2$** is complete (it is a closed subset of the complete space $\ell^2$) but not totally bounded (as shown in the opening example). This is the infinite-dimensional failure mode.
[example: Verifying Compactness via Completeness and Total Boundedness]
We verify that the [Cantor set](/page/Cantor%20Set) $\mathcal{C} \subset [0, 1]$ is compact by checking the two conditions of the characterisation.
**Completeness.** The Cantor set is a closed subset of $\mathbb{R}$ (it is the intersection of closed sets $C_n$, where $C_n$ is obtained by removing the middle third of each interval at stage $n$). Since $\mathbb{R}$ is complete and closed subsets of [complete metric spaces](/page/Complete%20Metric%20Space) are complete, $\mathcal{C}$ is complete.
**Total boundedness.** Fix $\varepsilon > 0$ and choose $n$ large enough that $3^{-n} < \varepsilon$. At stage $n$ of the Cantor construction, $C_n$ consists of $2^n$ closed intervals, each of length $3^{-n}$. The midpoint of each interval serves as the centre of a ball of radius $3^{-n}/2 < \varepsilon$ covering that interval. Thus $\mathcal{C} \subset C_n$ is covered by $2^n$ balls of radius $\varepsilon$, giving a finite $\varepsilon$-net of cardinality at most $2^n$.
Therefore $\mathcal{C}$ is complete and totally bounded, hence compact.
The covering number satisfies $\mathcal{N}(\mathcal{C}, 3^{-n}) = 2^n$, so $\log \mathcal{N}(\mathcal{C}, \varepsilon) \sim \frac{\log 2}{\log 3} \log(1/\varepsilon)$ as $\varepsilon \to 0$. The ratio $\frac{\log 2}{\log 3}$ is the Hausdorff dimension of the Cantor set, illustrating how the growth rate of covering numbers encodes geometric information about the set.
[/example]
The decomposition into completeness and total boundedness also gives a practical two-step strategy for proving compactness of subsets of function spaces, which we develop in the section on function spaces below.
## Total Boundedness in Normed Spaces
In normed vector spaces, total boundedness interacts with the linear structure in ways that reveal a sharp divide between finite and infinite dimensions. The central result is that total boundedness of the unit ball *characterises* finite dimensionality.
### The Finite-Dimensional Dichotomy
Every bounded set in $\mathbb{R}^n$ is totally bounded --- this is the content of the forward direction of the [Heine-Borel theorem](/theorems/309), proved by covering a bounded set with a grid of cubes whose side length depends on $\varepsilon$ and $n$. The grid has $(2R\sqrt{n}/\varepsilon)^n$ cubes, which is finite for each fixed $\varepsilon$ and $n$. The crucial point is that $n$ is fixed; when $n = \infty$, no such grid exists.
[quotetheorem:1090]
The equivalence of (2) and (3) follows from the compactness characterisation, since $X$ is a Banach space (or can be completed to one) and $\overline{B}(0, 1)$ is complete when $X$ is. The real content is that (1) and (2) are equivalent: total boundedness of the unit ball fails in every infinite-dimensional normed space.
The proof of the non-trivial direction (not finite-dimensional implies not totally bounded) uses [Riesz's lemma](/page/Riesz's%20Lemma): if $Y$ is a proper closed subspace of a normed space $X$, then for every $\theta \in (0, 1)$, there exists $x \in X$ with $\|x\| = 1$ and $\operatorname{dist}(x, Y) \ge \theta$. By induction, one constructs a sequence $\{x_k\}_{k=1}^\infty$ in $\overline{B}(0, 1)$ with $\|x_j - x_k\| \ge \theta$ for all $j \neq k$. Such a sequence has no Cauchy subsequence, so the unit ball is not totally bounded (by the sequential characterisation).
This result has a profound consequence for the strategy of existence proofs in infinite-dimensional spaces: one cannot extract norm-convergent subsequences from bounded sequences using compactness of the unit ball. One must either pass to a weaker topology (weak convergence, where the [Banach-Alaoglu theorem](/theorems/212) restores compactness) or impose additional regularity conditions (equicontinuity, control of derivatives) that recover total boundedness in a weaker norm.
### Scaling and Translation Invariance
In a normed space, total boundedness inherits the symmetries of the linear structure.
[quotetheorem:1091]
Part (1) follows by scaling an $\varepsilon/|\lambda|$-net for $A$ by $\lambda$. Part (2) uses the fact that if $\{a_1, \ldots, a_N\}$ is an $\varepsilon/2$-net for $A$ and $\{b_1, \ldots, b_K\}$ is an $\varepsilon/2$-net for $B$, then $\{a_i + b_j\}_{i,j}$ is an $\varepsilon$-net for $A + B$ (with $NK$ elements). Part (3) is subtler: it uses the fact that a point in $\operatorname{conv}(A)$ is a finite convex combination of points in $A$, and one can approximate any such combination by a combination of points from a finite $\varepsilon$-net, with the error controlled by the convexity structure. Part (4) was noted above.
Part (3) is noteworthy because the convex hull of a totally bounded set in an infinite-dimensional space need not be *closed*, even though it is totally bounded. The closure of the convex hull, $\overline{\operatorname{conv}(A)}$, is both totally bounded and closed; whether it is compact depends on completeness of the ambient space.
## Hereditary Properties and Subsets
A natural question is: how does total boundedness behave under the standard operations of metric space theory --- passing to subsets, products, and completions? The answers are clean and useful, and they underpin the practical applications of total boundedness in function space theory.
### Subsets and Inheritance
Total boundedness is *hereditary*: every subset of a totally bounded set is totally bounded. This is immediate from the definition --- any $\varepsilon$-net for $A$ is also an $\varepsilon$-net for any $B \subset A$. The importance of this property is that it provides a simple sufficient condition for total boundedness: to show a set $A$ is totally bounded, it suffices to exhibit a totally bounded superset $A \subset S$.
In the reverse direction, if a set $A$ is *not* totally bounded, then no subset of $A$ containing an $\varepsilon_0$-separated sequence (for the critical $\varepsilon_0$) is totally bounded either. This is how one typically *disproves* total boundedness: by exhibiting a separated sequence.
### Products
A fundamental question for applications is whether total boundedness is preserved when one forms products of metric spaces. In finite dimensions, this is used implicitly whenever one covers a cube in $\mathbb{R}^n$ by taking Cartesian products of interval covers. The following result confirms that the property extends to arbitrary (including countable) products.
[quotetheorem:1092]
For the countable product, the argument proceeds by tail truncation: given $\varepsilon > 0$, choose $K$ large enough that $\sum_{k > K} 2^{-k} < \varepsilon/2$. Then cover $M_1 \times \cdots \times M_K$ by finitely many balls of radius $\varepsilon/2$ (using total boundedness of the finite product, which follows by taking Cartesian products of $\varepsilon$-nets for each factor). Extend each centre to an arbitrary point in $\prod_{k > K} M_k$. The resulting balls of radius $\varepsilon$ cover $M$, since the tail contributes at most $\varepsilon/2$ regardless of the choice of coordinates.
For finite products, the construction is simpler: if $\{a_1, \ldots, a_N\}$ is an $\varepsilon$-net for $M_1$ and $\{b_1, \ldots, b_K\}$ is an $\varepsilon$-net for $M_2$, then $\{(a_i, b_j)\}_{i,j}$ is an $\varepsilon$-net for $M_1 \times M_2$ under the $\ell^\infty$ product metric, with $NK$ elements.
### Completions
When attempting to construct compact spaces, one natural approach is to start with a totally bounded space and pass to its [completion](/page/Complete%20Metric%20Space), hoping that the result is both complete and totally bounded --- hence compact. The following result confirms that total boundedness survives the completion process.
[quotetheorem:1093]
The proof uses density: a finite $\varepsilon/2$-net for $M$ is an $\varepsilon$-net for $\hat{M}$, since $M$ is dense in $\hat{M}$ and every point of $\hat{M}$ is within distance $\varepsilon/2$ of a point of $M$, which in turn lies within distance $\varepsilon/2$ of some net point. The triangle inequality then gives the desired $\varepsilon$-coverage.
This result provides a clean route to constructing compact metric spaces: start with any totally bounded space and complete it. The Cantor set, for instance, can be viewed as the completion of the "endpoints" of the middle-thirds construction. More significantly, this result underpins the proof that a closed and bounded subset of $\mathbb{R}^n$ is compact: it is totally bounded (by the grid argument) and complete (as a closed subset of a complete space).
## Total Boundedness in Function Spaces
The most significant applications of total boundedness arise in the study of [function spaces](/page/Function%20Space), where it provides the mechanism for extracting convergent subsequences from families of functions. The abstract compactness characterisation --- compact iff complete and totally bounded --- translates into concrete criteria involving uniform bounds, equicontinuity, and control of derivatives.
### The Arzel\`a-Ascoli Mechanism
Consider the space $C(K)$ of continuous real-valued functions on a [compact metric space](/page/Compact%20Space) $(K, d_K)$, equipped with the supremum norm $\|f\|_\infty = \sup_{x \in K} |f(x)|$. A subset $\mathcal{F} \subset C(K)$ is compact (in the sup-norm topology) if and only if it is closed, uniformly bounded, and equicontinuous --- this is the [Arzel\`a-Ascoli theorem](/theorems/66). In terms of the completeness-and-total-boundedness decomposition, the theorem identifies equicontinuity and uniform boundedness as precisely the conditions that ensure total boundedness.
[quotetheorem:1094]
This result makes the role of each condition transparent. Uniform boundedness prevents the values from "escaping to infinity," and equicontinuity prevents the functions from "oscillating too rapidly." Together, they ensure that $\mathcal{F}$ can be approximated at every scale by a finite set of functions --- exactly what total boundedness demands.
The proof of the forward direction (total boundedness implies uniform boundedness and equicontinuity) is an exercise in unpacking the definition. The reverse direction --- the heart of the theorem --- proceeds by discretising the domain $K$ using a finite $\delta$-net (which exists because $K$ is compact, hence totally bounded), discretising the range $[-M, M]$ using a grid of mesh size $\varepsilon$, and observing that the number of "approximate function profiles" (assignments of grid values to net points) is finite.
[example: Total Boundedness of Lipschitz Families]
Let $K = [0, 1]$ with the standard metric, and consider the family
\begin{align*}
\mathcal{F} = \{f \in C([0, 1]) : \|f\|_\infty \le 1 \text{ and } |f(x) - f(y)| \le L|x - y| \text{ for all } x, y \in [0, 1]\}
\end{align*}
for a fixed Lipschitz constant $L > 0$. We verify the Arzel\`a-Ascoli conditions.
**Uniform boundedness.** By definition, $\|f\|_\infty \le 1$ for all $f \in \mathcal{F}$.
**Equicontinuity.** For any $\varepsilon > 0$, set $\delta = \varepsilon / L$. Then $|x - y| < \delta$ implies $|f(x) - f(y)| \le L|x - y| < \varepsilon$ for every $f \in \mathcal{F}$.
By the theorem, $\mathcal{F}$ is totally bounded in $C([0, 1])$. Since $\mathcal{F}$ is also closed (the pointwise limit of functions in $\mathcal{F}$ satisfies the same Lipschitz condition and bound), it is compact.
We can estimate the covering number explicitly. Fix $\varepsilon > 0$ and set $\delta = \varepsilon/(3L)$. Choose an equally spaced $\delta$-net $\{t_0, t_1, \ldots, t_N\}$ for $[0, 1]$ with $N = \lceil 1/\delta \rceil = \lceil 3L/\varepsilon \rceil$. Any function $f \in \mathcal{F}$ is determined up to error $\varepsilon/3$ by its values at these $N + 1$ points (by the Lipschitz condition, $|f(x) - f(t_i)| \le L\delta = \varepsilon/3$ for $x$ in the interval around $t_i$). The values $f(t_i)$ lie in $[-1, 1]$ and can be discretised to multiples of $\varepsilon/3$, giving at most $(6/\varepsilon + 1)$ choices per grid point. The total number of approximate profiles is at most $(6/\varepsilon + 1)^{N+1}$, so
\begin{align*}
\mathcal{N}(\mathcal{F}, \varepsilon) \le \left(\frac{6}{\varepsilon} + 1\right)^{\lceil 3L/\varepsilon \rceil + 1}.
\end{align*}
The covering number grows exponentially in $1/\varepsilon$, reflecting the fact that $\mathcal{F}$ is an infinite-dimensional object (it is not contained in any finite-dimensional subspace of $C([0, 1])$) that is nonetheless "finite-dimensional at every scale."
[/example]
### Sobolev Embeddings and the Rellich-Kondrachov Mechanism
In PDE theory, the dominant application of total boundedness is through the [Rellich-Kondrachov theorem](/theorems/64): bounded subsets of [Sobolev spaces](/page/Sobolev%20Space) $W^{1,p}(U)$ are precompact (totally bounded with compact closure) in $L^q(U)$ for subcritical exponents $q < p^*$. The mechanism is analogous to Arzel\`a-Ascoli: control of the $W^{1,p}$ norm provides uniform bounds on both the function and its gradient, and the gradient bound (combined with Sobolev embedding) yields a form of equicontinuity that prevents oscillation.
The key point is that the total boundedness holds *in the $L^q$ norm*, not in the $W^{1,p}$ norm. The bounded set $\mathcal{F} = \{u \in W^{1,p}(U) : \|u\|_{W^{1,p}} \le 1\}$ is not totally bounded in $W^{1,p}$ (since $W^{1,p}$ is infinite-dimensional and the unit ball is never totally bounded in an infinite-dimensional normed space). But $\mathcal{F}$, viewed as a subset of $L^q(U)$, *is* totally bounded when $q < p^*$. This is the meaning of a *compact embedding*: the identity map $\iota: W^{1,p}(U) \hookrightarrow L^q(U)$ sends bounded sets to totally bounded sets.
[example: Failure of Total Boundedness at the Critical Exponent]
Let $U = B(0, 1) \subset \mathbb{R}^n$ with $n \ge 3$, and let $p = 2$, so $p^* = 2n/(n - 2)$. We construct a bounded sequence in $W^{1,2}(U)$ that has no convergent subsequence in $L^{p^*}(U)$, demonstrating that total boundedness fails at the critical exponent.
For $\lambda > 0$, define the **Aubin-Talenti functions** (concentrating profiles):
\begin{align*}
u_\lambda(x) = \frac{\lambda^{(n-2)/2}}{(\lambda^2 + |x|^2)^{(n-2)/2}}.
\end{align*}
These functions satisfy $\|\nabla u_\lambda\|_{L^2(\mathbb{R}^n)}^2 = C_n$ (a constant depending only on $n$) and $\|u_\lambda\|_{L^{p^*}(\mathbb{R}^n)}^{p^*} = C_n'$ for universal constants $C_n, C_n'$. As $\lambda \to 0$, the function $u_\lambda$ concentrates at the origin: it becomes taller and narrower, with $u_\lambda(0) = \lambda^{-(n-2)/2} \to \infty$, while the mass in $L^{p^*}$ remains constant.
Introducing smooth cut-offs $\varphi_\lambda = \eta \cdot u_\lambda$ where $\eta \in C^\infty_c(U)$ is a fixed cut-off with $\eta = 1$ near the origin, one obtains a bounded sequence in $W^{1,2}_0(U)$ (with $\|\nabla \varphi_\lambda\|_{L^2} \le C$) that concentrates at the origin in $L^{p^*}(U)$. The concentration prevents any subsequence from converging in $L^{p^*}$: the $L^{p^*}$ norm does not vanish (it converges to $C_n'$), yet $\varphi_\lambda \to 0$ pointwise away from the origin.
This example shows that the Rellich-Kondrachov compact embedding $W^{1,2}(U) \hookrightarrow\hookrightarrow L^q(U)$ cannot extend to $q = p^*$. The bounded set $\{\varphi_\lambda : \lambda > 0\} \cap \overline{B}(0, C)$ is not totally bounded in $L^{p^*}(U)$, precisely because the concentrating sequence has no convergent subsequence.
[/example]
## Covering Numbers and Metric Entropy
The covering number $\mathcal{N}(A, \varepsilon)$ --- the minimum number of balls of radius $\varepsilon$ needed to cover $A$ --- provides a quantitative measure of the "size" or "complexity" of a totally bounded set. The logarithm $\log \mathcal{N}(A, \varepsilon)$ is called the **metric entropy** (or Kolmogorov entropy) of $A$ at scale $\varepsilon$. While qualitative total boundedness (finiteness of $\mathcal{N}(A, \varepsilon)$ for each $\varepsilon > 0$) is what matters for compactness, the *rate of growth* of $\mathcal{N}(A, \varepsilon)$ as $\varepsilon \to 0$ carries significant geometric and analytic information.
[definition: Covering Number and Metric Entropy]
Let $(M, d)$ be a metric space and $A \subset M$.
The **covering number** $\mathcal{N}(A, \varepsilon)$ is the minimum cardinality of an $\varepsilon$-net for $A$:
\begin{align*}
\mathcal{N}(A, \varepsilon) = \min\{N \in \mathbb{N} : \exists x_1, \ldots, x_N \in M \text{ with } A \subset \textstyle\bigcup_{i=1}^N B(x_i, \varepsilon)\}.
\end{align*}
The **metric entropy** of $A$ at scale $\varepsilon$ is $H(A, \varepsilon) = \log_2 \mathcal{N}(A, \varepsilon)$.
[/definition]
<!-- NOTATION PROPOSAL: The covering number $\mathcal{N}(A, \varepsilon)$ and metric entropy $H(A, \varepsilon)$ are standard in approximation theory, learning theory, and geometric measure theory (cf. Kolmogorov-Tikhomirov). They do not appear on the current Androma Notation page. -->
The covering number is closely related to another combinatorial quantity: the **packing number** $\mathcal{P}(A, \varepsilon)$, defined as the maximum cardinality of an $\varepsilon$-separated subset of $A$ (a set $S \subset A$ with $d(x, y) \ge \varepsilon$ for all distinct $x, y \in S$). These two quantities are comparable:
[quotetheorem:1095]
The upper bound $\mathcal{N}(A, \varepsilon) \le \mathcal{P}(A, \varepsilon)$ holds because a maximal $\varepsilon$-separated set is automatically an $\varepsilon$-net: if some point $a \in A$ were at distance $\ge \varepsilon$ from every point of the set, we could add $a$ to the set, contradicting maximality. The lower bound holds because any $\varepsilon$-net must contain at least one point in each ball $B(x_i, \varepsilon)$ of a $2\varepsilon$-separated set, and these balls are disjoint.
This duality means that, up to a factor of $2$ in the radius, covering and packing are equivalent descriptions of the "metric complexity" of $A$. It also gives a practical method for bounding covering numbers: construct a maximal separated set (by a greedy algorithm) and count its elements.
[example: Covering Numbers of Cubes in $\mathbb{R}^n$]
Let $Q = [0, 1]^n \subset \mathbb{R}^n$ with the Euclidean metric. We compute the covering number $\mathcal{N}(Q, \varepsilon)$ up to constants depending on $n$.
**Upper bound.** Partition $[0, 1]$ into $\lceil 1/\varepsilon \rceil$ subintervals of length at most $\varepsilon$. This induces a partition of $Q$ into $\lceil 1/\varepsilon \rceil^n$ sub-cubes, each with diameter at most $\varepsilon\sqrt{n}$. Placing a ball of radius $\varepsilon\sqrt{n}$ at the centre of each sub-cube covers $Q$. Thus $\mathcal{N}(Q, \varepsilon\sqrt{n}) \le \lceil 1/\varepsilon \rceil^n$, and by relabelling, $\mathcal{N}(Q, \varepsilon) \le \lceil \sqrt{n}/\varepsilon \rceil^n$.
**Lower bound.** A packing argument: the volume of $Q$ is $1$, and each ball $B(x_i, \varepsilon)$ intersects $Q$ in a set of Lebesgue measure at most $\omega_n \varepsilon^n$ (where $\omega_n$ is the volume of the unit ball in $\mathbb{R}^n$). Any $\varepsilon$-cover of $Q$ must satisfy $N \cdot \omega_n \varepsilon^n \ge 1$, giving $\mathcal{N}(Q, \varepsilon) \ge (1/\omega_n) \varepsilon^{-n}$.
Combining: there exist constants $c_n, C_n > 0$ depending only on $n$ such that
\begin{align*}
c_n \varepsilon^{-n} \le \mathcal{N}(Q, \varepsilon) \le C_n \varepsilon^{-n}.
\end{align*}
The metric entropy scales as $H(Q, \varepsilon) \asymp n \log(1/\varepsilon)$. The dimension $n$ appears as the exponent governing the growth rate --- this is no coincidence, as the Minkowski (box-counting) dimension of $Q$ is defined precisely as $\lim_{\varepsilon \to 0} \frac{\log \mathcal{N}(Q, \varepsilon)}{\log(1/\varepsilon)} = n$.
[/example]
The connection between covering numbers and dimension extends to fractals and irregular sets, where the Minkowski dimension may be non-integer (as with the Cantor set example above, where $\dim_M(\mathcal{C}) = \log 2 / \log 3$). In probability and statistical learning theory, the metric entropy of function classes controls the complexity of empirical approximation and appears in fundamental results on uniform convergence (Vapnik-Chervonenkis theory) and Gaussian process theory (Dudley's entropy integral).
## Common Techniques
Total boundedness enters mathematical arguments through several recurring patterns. Recognising these patterns --- and understanding which variant of total boundedness is being applied --- is essential for reading and writing proofs in analysis and functional analysis.
### Constructing $\varepsilon$-Nets
The most direct technique: to prove a set $A$ is totally bounded, explicitly construct a finite $\varepsilon$-net for each $\varepsilon > 0$.
**Strategy.** Identify a finite-dimensional "skeleton" that captures $A$ to within accuracy $\varepsilon$. Typical skeletons include:
- A grid of points in $\mathbb{R}^n$ (for subsets of Euclidean space).
- A finite set of "representative functions" obtained by discretising the domain and range (for families of equicontinuous functions --- this is the Arzel\`a-Ascoli mechanism).
- Finite-rank truncations of operators or series (for compact operators, where the image of the unit ball is approximated by finite-dimensional images).
[example: $\varepsilon$-Net Construction for Hilbert-Schmidt Operators]
Let $H$ be a separable [Hilbert space](/page/Hilbert%20Space) with orthonormal basis $\{e_k\}_{k=1}^\infty$, and let $T: H \to H$ be a compact operator. We construct an $\varepsilon$-net for $T(\overline{B}(0, 1))$.
Since $T$ is compact, it is the norm limit of finite-rank operators. For each $n$, define $T_n = P_n T$ where $P_n: H \to \operatorname{span}\{e_1, \ldots, e_n\}$ is the orthogonal projection. Then $\|T - T_n\| \to 0$ as $n \to \infty$ (this is a characterisation of compact operators on separable Hilbert spaces). Choose $n$ large enough that $\|T - T_n\| < \varepsilon/2$.
The image $T_n(\overline{B}(0, 1))$ lies in the $n$-dimensional subspace $\operatorname{span}\{e_1, \ldots, e_n\}$ and is bounded (by $\|T_n\|$). By total boundedness in $\mathbb{R}^n$, there exists a finite $\varepsilon/2$-net $\{y_1, \ldots, y_N\}$ for $T_n(\overline{B}(0, 1))$.
For any $x \in \overline{B}(0, 1)$, choose $y_i$ with $\|T_n x - y_i\| < \varepsilon/2$. Then
\begin{align*}
\|Tx - y_i\| \le \|Tx - T_n x\| + \|T_n x - y_i\| < \|T - T_n\| + \frac{\varepsilon}{2} < \varepsilon.
\end{align*}
So $\{y_1, \ldots, y_N\}$ is an $\varepsilon$-net for $T(\overline{B}(0, 1))$, confirming total boundedness.
The construction reveals the mechanism: a compact operator maps the unit ball to a set that is "almost finite-dimensional" at every scale $\varepsilon$, because the operator is well-approximated by a finite-rank operator at that scale.
[/example]
### The Pigeonhole-and-Refine Argument
To extract a Cauchy subsequence from a sequence in a totally bounded set (establishing the $(1) \Rightarrow (2)$ direction of the sequential characterisation), one applies the pigeonhole principle iteratively.
**Structure.**
1. Cover $A$ by finitely many balls of radius $1$. By the pigeonhole principle, infinitely many terms of the sequence lie in one ball; restrict to that subsequence.
2. Cover $A$ by finitely many balls of radius $1/2$. Infinitely many terms of the current subsequence lie in one ball; restrict again.
3. At step $k$, cover by balls of radius $1/k$ and extract a further subsequence.
4. Take the diagonal: the $k$-th term of the diagonal subsequence lies in a ball of radius $1/k$ together with all subsequent terms, so the diagonal subsequence is Cauchy.
This is the standard proof that total boundedness implies sequential pre-compactness, and it is the mechanism behind every "extract a convergent subsequence" argument in analysis.
### Reduction to Finite Dimensions
A powerful strategy for proving total boundedness of subsets of infinite-dimensional spaces: approximate the set by its projection onto a finite-dimensional subspace, verify total boundedness of the projection, and control the approximation error.
**Structure.** Let $A \subset X$ where $X$ is a Banach space. To show $A$ is totally bounded:
1. Find a sequence of finite-dimensional subspaces $X_n \subset X$ and projections $P_n: X \to X_n$ such that $\sup_{a \in A} \|a - P_n a\| \to 0$.
2. For each $n$, the projected set $P_n(A) \subset X_n$ is bounded (hence totally bounded, since $X_n$ is finite-dimensional).
3. Combine: for given $\varepsilon$, choose $n$ with $\sup_{a \in A} \|a - P_n a\| < \varepsilon/2$, then cover $P_n(A)$ by finitely many balls of radius $\varepsilon/2$ in $X_n$. The corresponding balls of radius $\varepsilon$ in $X$ cover $A$.
This is the strategy used in the proof of the Rellich-Kondrachov theorem (via mollification and Fourier truncation) and in the study of compact operators (as illustrated in the Hilbert-Schmidt example above).
### Total Boundedness by Comparison
To prove a set $A$ is totally bounded, embed it (or its elements) into a space where total boundedness is already known, and pull back the covering.
[example: Total Boundedness via Continuous Injection]
Let $f: (M_1, d_1) \to (M_2, d_2)$ be a uniformly continuous map, and let $A \subset M_1$ be totally bounded. We show $f(A)$ is totally bounded in $M_2$.
Fix $\varepsilon > 0$. By uniform continuity of $f$, there exists $\delta > 0$ such that $d_1(x, y) < \delta$ implies $d_2(f(x), f(y)) < \varepsilon$. By total boundedness of $A$, there exists a finite $\delta$-net $\{x_1, \ldots, x_N\}$ for $A$. For any $a \in A$, choose $x_i$ with $d_1(a, x_i) < \delta$. Then $d_2(f(a), f(x_i)) < \varepsilon$. So $\{f(x_1), \ldots, f(x_N)\}$ is an $\varepsilon$-net for $f(A)$.
In particular, the continuous image of a totally bounded set under a uniformly continuous map is totally bounded. Since every continuous function on a compact (hence totally bounded) metric space is uniformly continuous, this recovers the classical result that continuous images of totally bounded sets are totally bounded when the domain is compact. Without compactness, the result requires uniform continuity --- a merely continuous map need not preserve total boundedness.
[/example]
## References
1. Munkres, J. R., *Topology* (2000). Chapter 7 for total boundedness and its role in the compactness characterisation for metric spaces.
2. Rudin, W., *Principles of Mathematical Analysis* (1976). Chapter 2 for compact sets in $\mathbb{R}^n$ and total boundedness.
3. Conway, J. B., *A Course in Functional Analysis* (1990). Chapters II and VI for compact operators, total boundedness in normed spaces, and the Riesz characterisation.
4. Kolmogorov, A. N. and Tikhomirov, V. M., "$\varepsilon$-entropy and $\varepsilon$-capacity of sets in functional spaces," *American Mathematical Society Translations* (1961). The foundational paper on metric entropy and covering numbers.
5. Brezis, H., *Functional Analysis, Sobolev Spaces and Partial Differential Equations* (2011). Chapter 4 for the relationship between total boundedness, compact embeddings, and variational methods.
6. Evans, L. C., *Partial Differential Equations* (2010). Appendix C for function spaces and compactness in Sobolev spaces.
7. Vershynin, R., *High-Dimensional Probability* (2018). Chapter 4 for covering numbers and their applications in probability and statistics.