[guided]The variance computation is the point where the scaling $\sqrt{nh_n}$ is determined. We need the total variance of the triangular array to converge to a finite non-zero quantity, and the natural candidate is the local value of the density times the squared $L^2$ size of the kernel.
For each $n\in\mathbb N$, define
\begin{align*}
\sigma_n^2:=\sum_{i=1}^{n}\operatorname{Var}(Y_{n,i}).
\end{align*}
Because $X_1,\dots,X_n$ are identically distributed, the random variables $Y_{n,1},\dots,Y_{n,n}$ also have the same distribution. Therefore
\begin{align*}
\sigma_n^2=n\operatorname{Var}(Y_{n,1})=n\operatorname{Var}\left(\sqrt{\frac{h_n}{n}}W_{n,1}\right)=h_n\operatorname{Var}(W_{n,1}),
\end{align*}
where subtracting the mean inside the definition of $Y_{n,1}$ does not change the variance except for centring.
We now compute $\mathbb E[W_{n,1}]$ and $\mathbb E[W_{n,1}^2]$ from the density of $X_1$. Since $X_1$ has density $f$ with respect to $\mathcal L^1$,
\begin{align*}
\mathbb E[W_{n,1}]
&=\int_{\mathbb R} h_n^{-1}K\left(\frac{x-u}{h_n}\right)f(u)\,d\mathcal L^1(u).
\end{align*}
Use the substitution $t=(x-u)/h_n$, so that $u=x-h_nt$. The map $t\mapsto x-h_nt$ sends $\mathbb R$ onto $\mathbb R$, and the one-dimensional Lebesgue measure transforms by
\begin{align*}
d\mathcal L^1(u)=h_n\,d\mathcal L^1(t).
\end{align*}
Thus
\begin{align*}
\mathbb E[W_{n,1}]
&=\int_{\mathbb R}K(t)f(x-h_nt)\,d\mathcal L^1(t).
\end{align*}
The same substitution applied to the second moment starts from
\begin{align*}
h_n\mathbb E[W_{n,1}^2]
=h_n\int_{\mathbb R}h_n^{-2}K\left(\frac{x-u}{h_n}\right)^2f(u)\,d\mathcal L^1(u).
\end{align*}
After substituting $t=(x-u)/h_n$, this becomes
\begin{align*}
h_n\mathbb E[W_{n,1}^2]
=\int_{\mathbb R}K(t)^2f(x-h_nt)\,d\mathcal L^1(t).
\end{align*}
Now we justify the limiting passage. Since $K$ has compact support, choose $A>0$ such that $\operatorname{supp}K\subset[-A,A]$. Since $f$ is continuous at $x$, it is bounded in a neighbourhood of $x$: there exist $\delta>0$ and $B<\infty$ such that $f(y)\le B$ whenever $|y-x|<\delta$. For all sufficiently large $n$, $h_nA<\delta$. Hence, whenever $t\in\operatorname{supp}K$, we have
\begin{align*}
|x-h_nt-x|\le h_nA<\delta,
\end{align*}
and therefore $f(x-h_nt)\le B$.
The functions $t\mapsto K(t)^2f(x-h_nt)$ are then dominated by the integrable function $t\mapsto B K(t)^2$ supported on $[-A,A]$. Also $f(x-h_nt)\to f(x)$ for each fixed $t\in\mathbb R$, because $h_n\to0$ and $f$ is continuous at $x$. By the dominated convergence theorem,
\begin{align*}
\int_{\mathbb R}K(t)^2f(x-h_nt)\,d\mathcal L^1(t)
\to f(x)\int_{\mathbb R}K(t)^2\,d\mathcal L^1(t)
=f(x)R(K).
\end{align*}
For the first moment, the possible sign changes of $K$ require domination by the absolute value. Since $K$ is bounded and compactly supported, the function $t\mapsto B|K(t)|$ is integrable with respect to $\mathcal L^1$, and
\begin{align*}
|K(t)f(x-h_nt)|\le B|K(t)|
\end{align*}
for all sufficiently large $n$ and all $t\in\operatorname{supp}K$. Applying the dominated convergence theorem again gives
\begin{align*}
\int_{\mathbb R}K(t)f(x-h_nt)\,d\mathcal L^1(t)
\to f(x)\int_{\mathbb R}K(t)\,d\mathcal L^1(t).
\end{align*}
In particular, $\mathbb E[W_{n,1}]$ is bounded as $n\to\infty$. Therefore
\begin{align*}
h_n(\mathbb E[W_{n,1}])^2\to0,
\end{align*}
because $h_n\to0$. Finally,
\begin{align*}
\sigma_n^2=h_n\operatorname{Var}(W_{n,1}).
\end{align*}
Using the variance identity gives
\begin{align*}
\sigma_n^2=h_n\mathbb E[W_{n,1}^2]-h_n(\mathbb E[W_{n,1}])^2.
\end{align*}
Combining the two limits proved above,
\begin{align*}
\sigma_n^2\to f(x)R(K).
\end{align*}
This proves that the total variance of the centred triangular array converges to the variance appearing in the claimed normal limit.[/guided]