Timeline for How to peform clustering on heavily right skewed data and zero inflated data
Current License: CC BY-SA 4.0
6 events
| when toggle format | what | by | license | comment | |
|---|---|---|---|---|---|
| Sep 24 at 17:24 | comment | added | Christian Hennig | This will depend on what in your specific application are the relevant characteristics of a cluster. There is software for mixtures of skew normal and skew t-distributions, however these may have difficulties with a large percentage of zero values. One consideration is whether having a zero on such a variable is distinctive enough a feature of observations that you may want to have these separated by clustering from the others. Another consideration is whether a transformation will do something good. | |
| Sep 24 at 16:56 | comment | added | Peter Flom | I edited your question to make it clearer and more grammatical. Please check that I did not change what you intended to ask. | |
| Sep 24 at 16:52 | history | edited | Peter Flom | CC BY-SA 4.0 | edited tags, fixed grammar. |
| Sep 24 at 16:52 | comment | added | Peter Flom | Welcome to CV Please spell out AOV and RPV. Also, please tell us what you want the clusters to be like. That is, how do you want these skewed variables to be treated? What will you do with the clusters? What do you mean by "optimal"? | |
| S Sep 24 at 12:43 | review | First questions | |||
| Sep 24 at 16:02 | |||||
| S Sep 24 at 12:43 | history | asked | Rishab | CC BY-SA 4.0 |