[motivation]
### Why Functions Are Not Enough
Consider the one-dimensional [wave equation](/page/Wave%20Equation) $u_{tt} - u_{xx} = 0$ on $\mathbb{R} \times (0, \infty)$ with initial data $u(x, 0) = f(x)$ and $u_t(x, 0) = 0$. D'Alembert's formula gives $u(x,t) = \tfrac{1}{2}(f(x+t) + f(x-t))$. If $f \in C^2(\mathbb{R})$, this is a classical solution: $u_{tt}$ and $u_{xx}$ exist pointwise and are equal. But if $f$ is merely continuous — say, a triangular pulse with a corner — then $u$ is continuous but not $C^2$, and the equation $u_{tt} = u_{xx}$ has no pointwise meaning at the corner. Yet the formula still describes the physically correct propagation of the wave. We need a framework in which $u$ "solves" the wave equation without requiring pointwise derivatives to exist.
The situation is worse for nonlinear conservation laws. The inviscid Burgers equation $u_t + uu_x = 0$ with smooth initial data can develop discontinuities (shock waves) in finite time. After the shock forms, no classical solution exists, but the physics continues — the shock propagates according to the Rankine–Hugoniot conditions. The equation must be interpreted in a sense that allows discontinuous solutions and their "derivatives" to make sense.
### The Integration-by-Parts Idea
The key observation is that *testing against smooth functions* can replace pointwise evaluation. Suppose $f \in C^{|\alpha|}(\Omega)$ and $\varphi \in C_c^\infty(\Omega)$. [Integration by parts](/theorems/210) gives
\begin{align*}
\int_\Omega (\partial^\alpha f)(x)\, \varphi(x) \, d\mathcal{L}^n(x) &= (-1)^{|\alpha|} \int_\Omega f(x)\, (\partial^\alpha \varphi)(x) \, d\mathcal{L}^n(x),
\end{align*}
with no [boundary](/page/Boundary) terms because $\varphi$ has compact support in $\Omega$. The right-hand side makes sense even when $f$ is merely locally integrable — one never differentiates $f$, only the smooth test function $\varphi$. This suggests defining the "derivative" of $f$ as the *rule* that assigns to each $\varphi$ the number $(-1)^{|\alpha|} \int f \, \partial^\alpha \varphi \, d\mathcal{L}^n$. More generally, any linear functional on test functions that is continuous in a suitable sense can serve as a "generalised function" — a distribution.
### From Weak Derivatives to Distributions
The [weak derivative](/page/Weak%20Derivative), central to [Sobolev space](/page/Sobolev%20Spaces) theory, is a special case: $v \in L^p(\Omega)$ is the weak derivative $\partial^\alpha f$ if $\int v \varphi \, d\mathcal{L}^n = (-1)^{|\alpha|} \int f \, \partial^\alpha \varphi \, d\mathcal{L}^n$ for all $\varphi \in C_c^\infty(\Omega)$. But weak derivatives are required to be *functions* in $L^p$. Distributions remove this restriction: the distributional derivative of $T_f$ is the functional $\varphi \mapsto (-1)^{|\alpha|} \int f \, \partial^\alpha \varphi \, d\mathcal{L}^n$, which need not be representable by any locally integrable function. The Heaviside function $H$ has no weak derivative in any $L^p$ (its distributional derivative is the Dirac delta, which is not a function), but $T_H$ has a perfectly well-defined distributional derivative. The theory of distributions is the completion of the weak derivative idea: every locally integrable function generates a distribution whose distributional derivatives of all orders exist, regardless of the regularity of the original function.
[/motivation]