Linearity of the Second Derivative — Statement & Proof

Linearity of the Second Derivative (Theorem # 8709)

Theorem

Edit Issues Pull Requests Attributions Admin

Discussion

Proof

[proofplan] We use the definition of twice differentiability at a point: the derivative maps $x \mapsto Df_x$ and $x \mapsto Dg_x$ are defined near $a$ and differentiable at $a$. First we prove directly from the first-order expansion that $\alpha f+\beta g$ is differentiable near $a$ and that its derivative is the corresponding linear combination of $Df_x$ and $Dg_x$. Then we differentiate this derivative map at $a$ and translate the resulting [linear map](/page/Linear%20Map) into the associated bilinear [second derivative](/page/Second%20Derivative). [/proofplan] [step:Choose a neighbourhood where the derivative maps are defined] Since $f$ and $g$ are twice differentiable at $a$, there exists $r>0$ such that $B(a,r) \subset U$, both $f$ and $g$ are differentiable at every point of $B(a,r)$, and the derivative maps \begin{align*} F: B(a,r) \to \operatorname{Lin}(\mathbb{R}^m,\mathbb{R}^n) \end{align*} are defined as follows. Here $B(a,r):=\{x \in \mathbb{R}^m: |x-a|<r\}$ is the open Euclidean ball centered at $a$ with radius $r$, and $\operatorname{Lin}(\mathbb{R}^m,\mathbb{R}^n)$ denotes the real [vector space](/page/Vector%20Space) of linear maps from $\mathbb{R}^m$ to $\mathbb{R}^n$. The first derivative map of $f$ is and the first derivative map of $g$ is \begin{align*} G: B(a,r) \to \operatorname{Lin}(\mathbb{R}^m,\mathbb{R}^n). \end{align*} They are defined by $F(x)=Df_x$ and $G(x)=Dg_x$, and they are differentiable at $a$. Let \begin{align*} H: U \to \mathbb{R}^n \end{align*} be the map $H=\alpha f+\beta g$. [/step] [step:Differentiate the linear combination once] Fix $x \in B(a,r)$. Since $f$ and $g$ are differentiable at $x$, for $q \in \mathbb{R}^m$ with $x+q \in U$ and $q \to 0$ there are remainder functions $\rho_f$ and $\rho_g$ satisfying \begin{align*} f(x+q)=f(x)+Df_x(q)+\rho_f(q) \end{align*} with $\rho_f(q)=o(|q|)$, and \begin{align*} g(x+q)=g(x)+Dg_x(q)+\rho_g(q) \end{align*} with $\rho_g(q)=o(|q|)$. Therefore \begin{align*} H(x+q)=H(x)+\bigl(\alpha Df_x+\beta Dg_x\bigr)(q)+\alpha\rho_f(q)+\beta\rho_g(q). \end{align*} Because $\alpha\rho_f(q)+\beta\rho_g(q)=o(|q|)$, the map $H$ is differentiable at $x$ and \begin{align*} DH_x=\alpha Df_x+\beta Dg_x. \end{align*} Since $x \in B(a,r)$ was arbitrary, $H$ is differentiable throughout $B(a,r)$. [guided] Fix a point $x \in B(a,r)$. The goal is to compute the derivative of $H=\alpha f+\beta g$ at $x$ directly from the defining first-order expansions. Because $f$ is differentiable at $x$, there is a remainder function $\rho_f$ such that \begin{align*} f(x+q)=f(x)+Df_x(q)+\rho_f(q) \end{align*} and $\rho_f(q)=o(|q|)$ as $q \to 0$. Likewise, because $g$ is differentiable at $x$, there is a remainder function $\rho_g$ such that \begin{align*} g(x+q)=g(x)+Dg_x(q)+\rho_g(q) \end{align*} and $\rho_g(q)=o(|q|)$. Now multiply the first expansion by $\alpha$, multiply the second expansion by $\beta$, and add them. This gives \begin{align*} H(x+q)=H(x)+\alpha Df_x(q)+\beta Dg_x(q)+\alpha\rho_f(q)+\beta\rho_g(q). \end{align*} The map $\alpha Df_x+\beta Dg_x$ is linear from $\mathbb{R}^m$ to $\mathbb{R}^n$, since it is a linear combination of linear maps. The remaining error term is still negligible compared with $|q|$, because \begin{align*} \frac{|\alpha\rho_f(q)+\beta\rho_g(q)|}{|q|}\le |\alpha|\frac{|\rho_f(q)|}{|q|}+|\beta|\frac{|\rho_g(q)|}{|q|} \end{align*} for $q \ne 0$, and the right-hand side tends to $0$ as $q \to 0$. Therefore $\alpha\rho_f(q)+\beta\rho_g(q)=o(|q|)$, so the defining expansion for differentiability gives \begin{align*} DH_x=\alpha Df_x+\beta Dg_x. \end{align*} Since this argument applies to every $x \in B(a,r)$, the derivative map of $H$ is defined on $B(a,r)$. [/guided] [/step] [step:Differentiate the derivative map at $a$] Let \begin{align*} A: \mathbb{R}^m \to \operatorname{Lin}(\mathbb{R}^m,\mathbb{R}^n) \end{align*} denote the derivative of $F$ at $a$, and let \begin{align*} B: \mathbb{R}^m \to \operatorname{Lin}(\mathbb{R}^m,\mathbb{R}^n) \end{align*} denote the derivative of $G$ at $a$. Equip $\operatorname{Lin}(\mathbb{R}^m,\mathbb{R}^n)$ with the operator norm \begin{align*} \|T\|_{\mathrm{op}}:=\sup\{|T(v)|: v \in \mathbb{R}^m, |v|\le 1\} \end{align*} for each $T \in \operatorname{Lin}(\mathbb{R}^m,\mathbb{R}^n)$. Thus, as $h \to 0$ in $\mathbb{R}^m$, \begin{align*} F(a+h)=F(a)+A(h)+R_f(h) \end{align*} where $\|R_f(h)\|_{\mathrm{op}}=o(|h|)$, and \begin{align*} G(a+h)=G(a)+B(h)+R_g(h) \end{align*} where $\|R_g(h)\|_{\mathrm{op}}=o(|h|)$. From the first-derivative formula in the previous step, the derivative map of $H$ on $B(a,r)$ is $\alpha F+\beta G$. Hence \begin{align*} DH_{a+h}=DH_a+\bigl(\alpha A+\beta B\bigr)(h)+\alpha R_f(h)+\beta R_g(h). \end{align*} Moreover, \begin{align*} \|\alpha R_f(h)+\beta R_g(h)\|_{\mathrm{op}}\le |\alpha|\|R_f(h)\|_{\mathrm{op}}+|\beta|\|R_g(h)\|_{\mathrm{op}}, \end{align*} so $\|\alpha R_f(h)+\beta R_g(h)\|_{\mathrm{op}}=o(|h|)$. Therefore the derivative map $x \mapsto DH_x$ is differentiable at $a$, with derivative \begin{align*} D(DH)_a=\alpha A+\beta B. \end{align*} Thus $H=\alpha f+\beta g$ is twice differentiable at $a$. [/step] [step:Identify the resulting second derivative as a bilinear map] By the definition of the bilinear second derivative as the derivative of the first derivative map evaluated on a second direction, for $h,k \in \mathbb{R}^m$ we have \begin{align*} D^2H_a(h,k)=\bigl(D(DH)_a(h)\bigr)(k). \end{align*} Using $D(DH)_a=\alpha A+\beta B$, we obtain \begin{align*} D^2H_a(h,k)=\bigl(\alpha A(h)+\beta B(h)\bigr)(k). \end{align*} Since scalar multiplication and addition in $\operatorname{Lin}(\mathbb{R}^m,\mathbb{R}^n)$ are pointwise, \begin{align*} D^2H_a(h,k)=\alpha A(h)(k)+\beta B(h)(k). \end{align*} Again using the same identification, $A(h)(k)=D^2f_a(h,k)$ and $B(h)(k)=D^2g_a(h,k)$. Therefore \begin{align*} D^2(\alpha f+\beta g)_a(h,k)=\alpha D^2f_a(h,k)+\beta D^2g_a(h,k) \end{align*} for all $h,k \in \mathbb{R}^m$, which is the desired formula. [/step]

Prerequisites (0/3 completed)

Prerequisites Graph

Interactive dependency map showing how this theorem builds on foundational concepts

Loading dependency graph...

Definitions & Concepts

Explore Further

Second Derivative Definition Derivative Definition Function Definition Moment-Cumulant Formula for Free Cumulants Analysis Clarkson's Inequalities Functional Analysis Jensen Inequality Measure Theory Closed Graph Theorem Functional Analysis Energy Conservation for the Homogeneous Wave Equation Partial Differential Equations Characterisation of Twice Differentiability Multivariable Calculus Semiclassical Calderón-Vaillancourt Theorem Analysis Integrability Criterion Integration Analysis Area

What brings you to Androma?

Start with a route through the knowledge graph.

Linearity of the Second Derivative (Theorem # 8709)

Discussion

Proof

Prerequisites (0/3 completed)

Prerequisites Graph

Explore Further

Sign in to Androma

Check your inbox

One last step

Linearity of the Second Derivative (Theorem # 8709)

Discussion

Proof

Prerequisites (0/3 completed)

Prerequisites Graph

Explore Further