Interpretation of Interaction when the results in one of the two groups is not statistically significant

Question

Here is the question. Suppose we have a drug A, and we have conducted a RCT evaluating the risk of death in patients randomised to receive A or not. The results of a Cox-regression model shows HR: 0.60 (95%CI: 0.50-0.70) (please don't focus on the numbers, this is only for demonstrative purposes).

Then we want to explore whether there is an interaction between sex and the effect of A. Females were approximately 35% of the cohort. We therefore perform a Cox-Regression including an interaction term:

outcome~(Sex*A)+A+Sex

and the results are the following:

 coef exp(coef) p A -0.6733 0.5100 0.0032 sexFemale -0.2543 0.7754 0.2834 A:sexFemale 0.2689 1.3085 0.4503

Manipulating the coefficients, we can have:

A*sex=Female: HR: 0.67 (95%CI: 0.40-1.20)

A*sex=Male: HR: 0.51 (95%CI: 0.43-0.82) [again, don't focus on the number]

interaction p: 0.450

Now, here's the question, related to the interpretation of the results presented: I think one can conclude that no statistically significant interaction was observed between males and females for the effect of A (with effect being broadly consistent?). However, how can one interpret the non-statistically significant results observed for females, while the effect remained statistically significant in males? My guess is that one should not speculate onto this, given that this is mainly due to the reduced sample size in females group, but I'd like to hear more expert opinions.

Please provide more details on the Cox model that you used. With a dichotomous predictor like sex one typically only has a single "main"-effect coefficient and a single "interaction" effect with a treatment. Without seeing the model as written, it's hard to know how to interpret the separate A*sex=Female and A*sex=Male coefficients. Please provide that information by editing the question, as comments are easy to overlook and can be deleted. — EdM
– EdM, Commented Jan 9, 2023 at 14:34
@EdM done, I have added more details are requested, hopefully my question is now understandable — user89547235
– user89547235, Commented Jan 9, 2023 at 15:18
Your calculation of the A*sex=Female coefficient is based on the sum of the A:sexFemale and the A coefficients. In estimating the CI, did you take into account the covariance between those coefficients or just use their individual standard errors? Please edit the question to show those details. You might still get a "statistically insignificant" result but it's important to start with knowing whether the CI were calculated properly. — EdM
– EdM, Commented Jan 9, 2023 at 16:31
Yes, it was. I am rather interested in the interpretation of such results from a general point of view rather than in the specific scenario, as this (non-statistically significant interaction with different statistical significance in the two groups) is one common situation, so that would be particularly interesting — user89547235
– user89547235, Commented Jan 9, 2023 at 16:45
"However, how can one interpret the non-statistically significant results observed for females, while the effect remained statistically significant in males? " What does this mean? How do you differentiate between 'for males' and 'for females'? — Sextus Empiricus
– Sextus Empiricus, Commented Jan 13, 2023 at 22:35

EdM · Accepted Answer · 2023-01-13 22:21:41Z

how can one interpret the non-statistically significant results observed for females, while the effect remained statistically significant in males?

Fundamentally, the tests on the interaction coefficient and on the estimated (combinations of) coefficients for the effects of intervention A on each sex separately are evaluating different things. Then you add on evaluating "significance" based on a fixed p-value cutoff and you have a recipe for confusion. I'll focus on the (combinations of) coefficients here, as those are the primary result of the regression and they have an asymptotic multivariate normal distribution.

The test on the individual coefficient for A, representing its association with outcome in males, is whether you can distinguish its value from 0 at p < 0.05. In this case, it can.

The test on the A:sexFemale interaction coefficient is effectively whether you can distinguish the effect of A between males and females at p < 0.05. In this case, you can't.

A plot of the probability distribution of coefficient estimates for males and females, based on the values that you provided, shows what's going on. There's a lot of overlap in the estimates for the 2 sexes, but only that for males is sufficiently far from the value of 0 to qualify as "significant."

The above is based on: the coefficient for A of -0.6733 with a standard error of 0.2284 estimated from the p-value, and the coefficient combination providing the effect of A on females (log of 0.67 = -0.4005) with a standard error of 0.28 estimated from the reported 95% CI.

Although this type of thing might sometimes result from differences in sample sizes, that's not required. Sample sizes primarily affect the widths of the distributions. The coefficient mean-value estimates also matter. If the two curves overlap like this but only one is far enough away from 0 to be "significant," then you will get the same phenomenon with equal sample sizes.

Thank you for this answer. So, it is wrong to make inference on the significance of the effect in Females, but rather one should focus on the interaction between the two sexes. Am I right? — user89547235
– user89547235, Commented Jan 19, 2023 at 15:03
@user89547235 for inference, don't focus on the individual coefficients that you show. You need to do tests on A and on sex that include both their individual coefficients and their interaction term, while taking the covariances among the coefficient estimates into account. The default Type II anova performed by the Anova() (note the capital "A") function in the R car package can do that. Such tests on all coefficients involving A or sex are the best way to evaluate whether either is associated with outcome overall. — EdM
– EdM, Commented Jan 19, 2023 at 15:27

Graham Wright · Accepted Answer · 2023-01-14 01:52:14Z

Interaction terms are tricky! But there is nothing out of the ordinary here.

Note: I'm assuming that the variables "A" and "female" are both coded 1/0 (where 0 means "didn't get the treatment" or "male") If not then you need to recode them that way or the interpretation of the main effects in the interanion model will be all messed up.

But assuming they are coded that way...here is what your results mean.

The coefficient for "A" tells you the estimated effect of getting the treatment (vs not getting it) FOR MALES (that is for people who's "sexFemale" value is ZERO)

The coefficient for "sexFemale" tells you the estimated effect of BEING FEMALE (vs male) for people who did NOT get the treatment (i.e. for people who's "A" value is ZERO).

The coefficient for the interaction term tells you whether the estimated effect of A is significantly different for females, compared to the effect for males (which is what the first coefficient showed you).

This means that [purely looking in terms of significance at the 95% level):

-MEN who got the treatment have a significantly lower hazard than men who did not (p<.01)

-Among the control group, being female had no impact on the hazard rate.

-The effect of the treatment for females is not significantly different from the effect estimated for males.

In short: The treatment seems to work, and it seems to work basically the same way for males and females. Also gender itself doesn't seem to impact the hazard of dying in this population.

Stack Exchange Network

Interpretation of Interaction when the results in one of the two groups is not statistically significant

2 Answers 2

Hot Network Questions

Interpretation of Interaction when the results in one of the two groups is not statistically significant

2 Answers 2

Related

Hot Network Questions