I am currently working on clustering continuous variables (such as AOV, RPV, and conversions(conversion/visits)). The variables are heavily right skewed with long tails and one variable is dominated by zeroes meaning with more than 50% of values zeroes. And overall most of my data is concentrated near origin. The variables are also on different scales. Traditional clustering like k means is not performing well as data is clearly not spherical to cluster using k means.
I need suggestions for how to proceed with optimal clustering approach, data transformation and handle zero inflated data where cluster numbers are not pre-defined but rather are dynamic and adjust as per the data