An ARMA(p,q)-GARCH(r,s) model specifies the conditional distribution of a time series $x_t$:
\begin{aligned} x_t &= \mu_t + u_t, \\ \mu_t &= \varphi_0 + \varphi_1 x_{t-1} + \dots + \varphi_p x_{t-p} + \theta_1 u_{t-1} + \dots + \theta_q u_{t-q}, \\ u_t &= \sigma_t \varepsilon_t, \\ \sigma_t^2 &= \omega + \alpha_1 u_{t-1}^2 + \dots + \alpha_s u_{t-s}^2 + \beta_1 \sigma_{t-1}^2 + \dots + \beta_r \sigma_{t-r}^2, \\ \varepsilon_t &\sim iid(0,1). \end{aligned}
Suppose based on ARMA(p,q) model we predicted conditional mean for the forecast horizon $h$, i.e., we estimated $\hat{\mu}_{t+1}, \hat{\mu}_{t+2}, ..., \hat{\mu}_{t+h}$. Further, from GARCH(r,s) model we estimated conditional variance for the forecast horizon $h$, i.e., $\hat{\sigma}^{2}_{t+1},\hat{\sigma}^{2}_{t+2},..., \hat{\sigma}^{2}_{t+h}$.
The problem is to come up with forecasts for the process $x$, i.e., $\hat{x}_{t+1}, \hat{x}_{t+2}, ..., \hat{x}_{t+h}$. Accepted approach is to take predicted conditional means $(\hat{\mu}_{t+1}, \hat{\mu}_{t+2}, ..., \hat{\mu}_{t+h})$ for point forecast for time series ${x}$.
Question: Why not utilize predicted conditional variance to come up with point forecasts? Is there anything wrong to estimate point forecasts as follows: $\hat{x}_{t+1} = \hat{\mu}_{t+1} + \hat{\sigma}^{2}_{t+1}\epsilon_{t+1}, \hat{x}_{t+2} = \hat{\mu}_{t+2} + \hat{\sigma}^{2}_{t+2}\epsilon_{t+2}, ..., \hat{x}_{t+h} = \hat{\mu}_{t+h} + \hat{\sigma}^{2}_{t+h}\epsilon_{t+h}$?
P.S. The reason that I am suggesting the above described approach is that to "save" volatility clustering effect in forecasted values as well. If we utilize only mean equation (ARMA part), we remove volatility clustering effect from predicted values.