[guided]We first need to justify that the expression defining $Y_t$ is not merely formal. Define the centered random variable $Z_t: \Omega \to \mathbb{C}$ by
\begin{align*}
Z_t = X_t - \mu_X.
\end{align*}
Weak stationarity means, in particular, that all $Z_t$ have the same finite second moment:
\begin{align*}
\mathbb{E}[|Z_t|^2] = \gamma_X(0) < \infty.
\end{align*}
Since $X_t = Z_t + \mu_X$, each $X_t$ also belongs to $L^2(\Omega)$.
For a finite set $F \subset \mathbb{Z}$, define the finite partial filter
\begin{align*}
S_{t,F}: \Omega \to \mathbb{C}, \qquad
S_{t,F} = \sum_{j \in F} a_j X_{t-j}.
\end{align*}
To show that these partial sums converge in $L^2$, compare two finite sums indexed by finite sets $F,G \subset \mathbb{Z}$. By the triangle inequality in $L^2(\Omega)$,
\begin{align*}
\|S_{t,F} - S_{t,G}\|_{L^2(\Omega)}
&=
\left\|\sum_{j \in F \triangle G} \varepsilon_j a_j X_{t-j}\right\|_{L^2(\Omega)} \\
&\leq \sum_{j \in F \triangle G} |a_j|\,\|X_{t-j}\|_{L^2(\Omega)},
\end{align*}
where each $\varepsilon_j \in \{-1,1\}$ records whether the term comes from $F \setminus G$ or $G \setminus F$. Weak stationarity gives a common value for $\|X_{t-j}\|_{L^2(\Omega)}$, namely $\|X_0\|_{L^2(\Omega)}$. Therefore
\begin{align*}
\|S_{t,F} - S_{t,G}\|_{L^2(\Omega)}
\leq \|X_0\|_{L^2(\Omega)} \sum_{j \in F \triangle G} |a_j|.
\end{align*}
Because $a \in \ell^1(\mathbb{Z})$, the right-hand side tends to $0$ as the finite sets exhaust $\mathbb{Z}$. Thus the partial sums converge in $L^2(\Omega)$, and $Y_t$ is well-defined.
Now compute the mean. Since $L^2$ convergence implies $L^1$ convergence for probability measures, expectation passes to the limit of the finite partial sums. Hence
\begin{align*}
\mathbb{E}[Y_t]
= \sum_{j \in \mathbb{Z}} a_j \mathbb{E}[X_{t-j}]
= \sum_{j \in \mathbb{Z}} a_j \mu_X
= A(0)\mu_X.
\end{align*}
This quantity does not depend on $t$, which is the first part of weak stationarity for $(Y_t)$.[/guided]