What does model.train() do in PyTorch?

Question

Does it call forward() in nn.Module? I thought when we call the model, forward method is being used. Why do we need to specify train()?

These days there is exist a documentation inside PyTorch: pytorch.org/docs/stable/generated/torch.nn.Module.html you can check documentation, it describes pretty clear I think. Another libraries/frameworks can have lack of documentation, but in PyTorch I think official documentation is pretty nice. — Konstantin Burlachenko
– Konstantin Burlachenko, Commented Oct 19, 2021 at 12:34
Perhaps "configure_training" or "set_training_mode" would have been better names for this function. — Rexcirus
– Rexcirus, Commented Dec 1, 2021 at 13:42
it simple changes the self.training via self.training = training recursively for all modules by doing self.train(False). In fact that is what self.train does, changes the flag to true recursively for all modules. see code: github.com/pytorch/pytorch/blob/… — Charlie Parker
– Charlie Parker, Commented Dec 19, 2021 at 19:07

ricber · Accepted Answer · 2025-08-20 18:11:49Z

339

model.train() tells your model that you are training the model. This helps inform layers such as Dropout and BatchNorm, which are designed to behave differently during training and evaluation. For instance, in training mode, BatchNorm updates a moving average on each new batch; whereas, for evaluation mode, these updates are frozen.

More details: model.train() sets the mode to train (see source code). You can call either model.eval() or model.train(mode=False) to tell that you are testing. It is somewhat intuitive to expect train function to train model but it does not do that. It just sets the mode.

edited Aug 20 at 18:11

ricber

972 silver badges8 bronze badges

answered Jul 20, 2018 at 0:14

Umang Gupta

16.6k7 gold badges51 silver badges69 bronze badges

Sign up to request clarification or add additional context in comments.

9 Comments

Charlie Parker Over a year ago

is there a flag to detect if the model is in eval mode? e.g. mdl.is_eval()?

Umang Gupta Over a year ago

Use model.training flag. It is False, when in eval mode.

Indrajit Over a year ago

In the current documentation, I find this "model.train()" is no longer being used: pytorch.org/tutorials/beginner/basics/quickstart_tutorial.html I did a small test with a small 3 layer neural network model with batch norm and dropout and trained it on tabular dataset. I found adding model.train() actually prevented my model accuracy going above 70%. When I removed the line, the accuracy was 87%!

Umang Gupta Over a year ago

@Indrajit Did you check that was not in train model, i.e., model.training is False? I think by default it is true and that is why they omit model.train() call. As for your result, I cannot say much without knowing what the data was and if you measure test or train accuracy etc.

Indrajit Over a year ago

@UmangGupta- actually I figured out just now what was happening. My model.train() was actually impacting batchnorm and dropout layers - which in turn was impacting the model performance.

|

Mateen Ulhaq · Accepted Answer · 2022-07-25 01:57:37Z

Here is the code for nn.Module.train():

def train(self, mode=True): r"""Sets the module in training mode.""" self.training = mode for module in self.children(): module.train(mode) return self

Here is the code for nn.Module.eval():

def eval(self): r"""Sets the module in evaluation mode.""" return self.train(False)

By default, the self.training flag is set to True, i.e., modules are in train mode by default. When self.training is False, the module is in the opposite state, eval mode.

Of the most commonly used layers, only Dropout and BatchNorm care about that flag.

Are there any other layers that support self.training flag now ?
model.eval() is is just a switch not to take dropout and batch norms. I have a nice intro to PyTorch training where you can check the forward and backward pass, and deep intro to PyTorch AD where you can confidently understand the details of PyTorch AD.

Dharman · Accepted Answer · 2025-05-15 11:59:12Z

`model.train()`	`model.eval()`
Sets model in training mode i.e. • `BatchNorm` layers use per-batch statistics • `Dropout` layers activated etc	Sets model in evaluation (inference) mode i.e. • `BatchNorm` layers use running statistics • `Dropout` layers de-activated etc
	Equivalent to `model.train(False)`.

^{Note: neither of these function calls run forward / backward passes. They tell the model how to act when run.
This is important as some modules (layers) (e.g. Dropout, BatchNorm) are designed to behave differently during training vs inference, and hence the model will produce unexpected results if run in the wrong mode.}

Chris Tang · Accepted Answer · 2020-11-14 21:45:24Z

There are two ways of letting the model know your intention i.e do you want to train the model or do you want to use the model to evaluate. In case of model.train() the model knows it has to learn the layers and when we use model.eval() it indicates the model that nothing new is to be learnt and the model is used for testing. model.eval() is also necessary because in pytorch if we are using batchnorm and during test if we want to just pass a single image, pytorch throws an error if model.eval() is not specified.

Lawhatre · Accepted Answer · 2021-06-24 12:12:11Z

Consider the following model

import torch import torch.nn.functional as F from torch_geometric.nn import GCNConv class GraphNet(torch.nn.Module): def __init__(self, num_node_features, num_classes): super(GraphNet, self).__init__() self.conv1 = GCNConv(num_node_features, 16) self.conv2 = GCNConv(16, num_classes) def forward(self, data): x, edge_index = data.x, data.edge_index x = self.conv1(x, edge_index) x = F.dropout(x, training=self.training) #Look here x = self.conv2(x, edge_index) return F.log_softmax(x, dim=1)

Here, the functioning of dropout differ in different modes of operation. As you can see, it works only when self.training==True. So, when you type model.train(), the model's forward function will perform dropout otherwise it will not (say when model.eval() or model.train(mode=False)).

End genocide - save Gaza · Accepted Answer · 2021-03-10 15:33:15Z

The current official documentation states the following:

This has any [sic] effect only on certain modules. See documentations of particular modules for details of their behaviors in training/evaluation mode, if they are affected, e.g. Dropout, BatchNorm, etc.

Collectives™ on Stack Overflow

What does model.train() do in PyTorch?

6 Answers 6

9 Comments

4 Comments

Comments

Comments

Comments

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

6 Answers 6

9 Comments

4 Comments

Comments

Comments

Comments

Comments

Linked

Related