PyTorch `torch.no_grad` vs `torch.inference_mode`

Question

PyTorch has new functionality torch.inference_mode as of v1.9 which is "analogous to torch.no_grad... Code run under this mode gets better performance by disabling view tracking and version counter bumps."

If I am just evaluating my model at test time (i.e. not training), is there any situation where torch.no_grad is preferable to torch.inference_mode? I plan to replace every instance of the former with the latter, and I expect to use runtime errors as a guardrail (i.e. I trust that any issue would reveal itself as a runtime error, and if it doesn't surface as a runtime error then I assume it is indeed preferable to use torch.inference_mode).

More details on why inference mode was developed are mentioned in the PyTorch Developer Podcast.

Current link explaining the differences between these: Locally disabling gradient computation. In short, no_grad disables gradients but allows you to use the resulting values in gradient computations later, while inference_mode doesn't, so the advice is to use it in things like data processing and model evaluation. — javidcf
– javidcf, Commented Sep 16, 2024 at 8:50

kayak · Accepted Answer · 2022-12-22 06:02:16Z

27

Yes, torch.inference_mode is indeed preferable to torch.no_grad in all situations where inference mode does not throw a runtime error. Check here.

edited Dec 22, 2022 at 6:02

kayak

14710 bronze badges

answered Oct 13, 2021 at 16:54

efthimio

1,0581 gold badge11 silver badges24 bronze badges

Sign up to request clarification or add additional context in comments.

4 Comments

carbocation Over a year ago

FYI the first link has now rotted.

efthimio Over a year ago

link is now removed

simplyPTA Over a year ago

a more insightful answer: stackoverflow.com/a/74197846/10805680

efthimio Over a year ago

@simplyPTA Both my question and the link in my answer already mention view tracking, the version counter, and the PyTorch podcast episode - the answer you link does not mention anything additional that is responsible for the performance improvement

Collectives™ on Stack Overflow

PyTorch `torch.no_grad` vs `torch.inference_mode`

1 Answer 1

4 Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

4 Comments

Linked

Related