I am running a Poisson GLM on insurance claim data. I use L1/L2 regularisation to account for potential lack of full credibility in my dataset. I also have an industry table, which (log of it) I use as an offset term.
In such a setting, should I penalise the intercept?
Earlier discussions typically advise against this - eg see here or here.
However, my understanding is that a fully regularised model, with its intercept penalised, would make predictions in line with the industry table. Which is my expectation - if not strongly justified by my data, then rely on the industry.
On the other hand, by keeping the intercept unpenalised, the coefficients for the individual risk factors could be shrunk / eliminated, but the overall claim count from my data would still stay higher than the industry table implies - which I do not expect to be justified.
Thoughts on this appreciated!