Tweeted twitter.com/StackStats/status/1615997491249401858

occurred Jan 19, 2023 at 9:00

added 2 characters in body

edited Aug 24, 2018 at 15:35

855
2
8
11

For a linear model, $y=\beta_0+x\beta+\varepsilon$, the shrinkage term is always $P(\beta) $.

What is the reason that we do not shrink the bias (intercept) term $\beta_0$? Should we shrink the bias term in the neural network models?

edited tags

Link

edited May 14, 2016 at 23:00

amoeba

109.1k
37
325
350

light editing

Source Link

edited Jul 15, 2015 at 23:22

amoeba

109.1k
37
325
350

Reason for not shrinkshrinking the bias (intercept) term in regression

For linear model, $y=\beta_0+x*\beta+\varepsilon$$y=\beta_0+x\beta+\varepsilon$, the shrinkage term is always like $P(\beta) $.

What'sWhat is the reason that we do not shrink the bias (intercept) term $\beta_0$? Comparatively, shouldShould we shrink the bias term in the Neuralneural network modelmodels?

regression linear-model bias neural-networks ridge-regression regularization

Sentence case style in title, typo, tags

Source Link

edit approved Feb 18, 2014 at 13:37

Andre Silva

3.1k
5
34
58

Loading

Source Link

asked Feb 18, 2014 at 9:50

yliueagle

855
2
8
11

Loading

Stack Exchange Network

Return to Question

Reason for not shrinkshrinking the bias (intercept) term in regression

Reason for not shrink the bias term

Reason for not shrinking the bias (intercept) term in regression