I understand that it is a convention for statisticians to control for mediators to observe the direct effects of an exposure. However, is it appropriate to do so?
Let's look at an example below: 
In Scenario A, X is the exposure, and M is the mediator between X and Y. In Scenario B, the diagram is flipped horizontally, making M the exposure and X the confounder between M and Y.
We can then build a regression model Y = Intercept + mM + xX, with m = effect size of M and x = effect size of X.
The model depicts the relationship in both scenarios. However, the interpretation is different. In Scenario A, we would say that the effect of X on Y is x adjusted for M. In Scenario B, we would say that the effect of M on Y is m adjusted for X. But is it okay to treat a mediator the same way we would treat a confounder (i.e., adjustment)?