Skip to main content

You are not logged in. Your edit will be placed in a queue until it is peer reviewed.

We welcome edits that make the post easier to understand and more valuable for readers. Because community members review edits, please try to make the post substantially better than how you found it, for example, by fixing grammar or adding additional resources and hyperlinks.

Required fields*

5
  • 2
    $\begingroup$ thanks. I believe the coeff are returned back on the original scale. So one would need to re scale them (I assume by using the technique I posted for example). $\endgroup$ Commented Sep 1, 2011 at 18:18
  • $\begingroup$ user6129 is right! you don't get any means of ranking the variables selected. It's an active area of research. $\endgroup$ Commented Sep 1, 2011 at 18:31
  • 4
    $\begingroup$ @B_Miner: you are right, if called with "standardize=TRUE" glmnet returns coefficients on the original scale. One way to get around that is to standardize the explanatory variables outside (e.g. using "scale()" function) and call glmnet with "standardize=FALSE". The resulting coefficients could then be ranked by magnitude to judge their importance. $\endgroup$ Commented Sep 1, 2011 at 18:57
  • $\begingroup$ @suncoolsu: pls see my updated answer above $\endgroup$ Commented Sep 1, 2011 at 19:14
  • 1
    $\begingroup$ @Yevgeny I have a question. Then technically, should the performance results (e.g. area under the curve) be the same whether we set 'standardize=FALSE' and standardize the variables ourselves or we just use 'standardize=TRUE'? (Only the beta-coefficients returned would be different). This is what I theoretically think, but in practice, I get slightly better results when I use 'standardize=TRUE'. Hence, both the coefficients and performance are different. Is this how it should be? $\endgroup$ Commented Sep 11, 2017 at 6:21