Why it's necessary to frozen all inner state of a Batch Normalization layer when fine-tuning

Question

The following content comes from Keras tutorial

This behavior has been introduced in TensorFlow 2.0, in order to enable layer.trainable = False to produce the most commonly expected behavior in the convnet fine-tuning use case.

Why we should freeze the layer when fine-tuning a convolutional neural network? Is it because some mechanisms in tensorflow keras or because of the algorithm of batch normalization? I run an experiment myself and I found that if trainable is not set to false the model tends to catastrophic forgetting what has been learned before and returns very large loss at first few epochs. What's the reason for that?

Ma SG · Accepted Answer · 2021-02-17 17:14:31Z

-1

There is a good explanation and an end-to-end example here:

answered Feb 17, 2021 at 17:14

Ma SG

1

Add a comment |

Stack Exchange Network

Why it's necessary to frozen all inner state of a Batch Normalization layer when fine-tuning

1 Answer 1

Hot Network Questions

Why it's necessary to frozen all inner state of a Batch Normalization layer when fine-tuning

1 Answer 1

Related

Hot Network Questions