Behavior of Lasso Estimator with More Predictors Than Observations (p > n) and Identical Correlations?

Asked 1 year, 5 months ago

Viewed 109 times

What is the behavior of a Lasso estimator if it is used in a dataset with more predictors (p) than observations (n), where all predictors are uncorrelated but highly relevant to 𝑦 y with exactly the same correlation with 𝑦 y? Which predictors does the Lasso estimator shrink to zero and which does it retain?

A consistent estimator would not reduce any of the 𝑝 p variables to zero. However, as I understand, the Lasso estimator would select at most 𝑛 n predictors. My question is: given these conditions, which predictors does Lasso select and why?

asked Jun 10, 2024 at 15:04

Joe94

5372 silver badges8 bronze badges

$\begingroup$ The answer will likely depend on floating point rounding error and be algorithm dependent, because the mathematical answer is "it's perfectly arbitrary." See our posts about the Lasso for an explanation. $\endgroup$

whuber
– whuber ♦

2024-06-10 15:21:11 +00:00
Commented Jun 10, 2024 at 15:21
3

$\begingroup$ I'm glad to see this compilation of lasso issues. When $p > n$ no one should accept the results without a simulation showing stability and reliability of the approach. The results of such simulations are typically quite disconcerting. $\endgroup$

Frank Harrell
– Frank Harrell

2024-06-10 15:25:30 +00:00
Commented Jun 10, 2024 at 15:25
$\begingroup$ Why would the lasso select at most $n$ predictors? $\endgroup$

Richard Hardy
– Richard Hardy

2024-06-10 19:29:02 +00:00
Commented Jun 10, 2024 at 19:29
1

$\begingroup$ @RichardHardy check out my answer here: stats.stackexchange.com/a/631944/341520 1) because it answers your and also kind of this question and 2) because I'm very proud of it :) $\endgroup$

Lukas Lohse
– Lukas Lohse

2024-06-10 20:29:05 +00:00
Commented Jun 10, 2024 at 20:29
$\begingroup$ @whuber thanks. Do you have a link to the post that described that the variable selection would be arbitrary in this case? $\endgroup$

Joe94
– Joe94

2024-06-11 08:04:25 +00:00
Commented Jun 11, 2024 at 8:04

| Show 2 more comments

0

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Stack Exchange Network

Behavior of Lasso Estimator with More Predictors Than Observations (p > n) and Identical Correlations?

0

Linked

Hot Network Questions

Behavior of Lasso Estimator with More Predictors Than Observations (p > n) and Identical Correlations?

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Linked

Related

Hot Network Questions