Questions tagged [interpretation]
The interpretation tag has no summary.
56 questions
3 votes
0 answers
58 views
Is there an alternative to Shapley values for tree-based models where the sum of feature contributions is equal to the prediction?
I'm currently working on a project where the ultimate goal is to reduce the quantity of a bad thing, b. I've been tasked with assigning the blame between several different features that are believed ...
5 votes
1 answer
439 views
Why do Shapley value solutions remain consistent when the value function of the empty set changes in the ML context?
Hey there data science stack exchange - question about SHAP. In the original Shapley value formulation from Lloyd, one assumption is that the value function of the empty set equals zero, $v(\emptyset) ...
1 vote
2 answers
69 views
Am I suppose to do any statistical test on results?
I am doing an email campaign. Before sending emails to users I divided my user base into treatment and control groups (50-50). Divided the groups in such a way that no difference in user behaviour. I ...
0 votes
0 answers
88 views
Help with multinomial logistic regression
I am a data science student and have the opportunity to work on an article regrading cardiac arrests in our country. For now I performed the multinomial regression model and I also plan on doing a ...
0 votes
1 answer
2k views
ValueError: X has 54 features, but DecisionTreeClassifier is expecting 53 features as input
I am analysing and prediction 2023 Cricket World Cup based on previous given dataset. This is Exploratary analysis: Feature selection and Training model: Applying Random forest classifier algorithm: ...
0 votes
0 answers
44 views
classification with rejection and how to interpret a 2D ambiguous data
Let's say we're going to train a classifier with the full data set. There's also a reject logic for ambiguous regions in the data. So, at the end, the final system outputs reject or 0 or 1. That is, ...
0 votes
1 answer
66 views
Efficient ways of clustering for big data
I have a task which is customer segmentation with 120k users and a record of their purchases which is +3 million records of data, the approach I want to use is to use clustering algorithms like kmeans ...
1 vote
1 answer
65 views
Interpreting model
If I trained a model (say logistic regression) on train, test and validation. During interpretation which dataset (test or validation) should I base on for interpretation? If test and validation shows ...