If an interaction term in a regression model leads to a lower AIC (indicating a better model fit), but the p-value for the interaction term itself is not statistically significant, should the interaction still be retained in the model?
From one perspective, a lower AIC suggests the model with the interaction better explains the data. However, a non-significant interaction implies that there is no strong evidence that the effect of one variable on the outcome differs across levels of the other variable.
In this case, how should one balance the improvement in AIC against the lack of statistical significance? Does a non-significant interaction still have any practical or interpretive value, or is it better to exclude it to maintain a more parsimonious model?