Questions tagged [xgboost]
For questions related to the eXtreme Gradient Boosting algorithm.
702 questions
4 votes
0 answers
63 views
What makes XGBoost so much more dominant with structured data?
XGBoost has a history of being dominant in Kaggle competitions, but why is it so competitive on structured data compared to other ML algorithms? I’ve read that sparse matrix utilization improves the ...
5 votes
1 answer
69 views
Does using test data in eval_set argument for xgboost cause data leakage?
I'm using an early stopping for XGBClassifier. The fitting looks like this (simplified): ...
6 votes
2 answers
720 views
What are the typical GPU requirements for training a classic predictions Model like XGBoost or Random Forest?
I am training a machine learning model to predict a score based on some behavioral client data. The model would be something classic like a random forest, XGBoost or multilinear Regression. Depending ...
2 votes
0 answers
143 views
Evaluating model performance when used in targeting decisions
I have a logistic regression model, the output of which is used to make decisions. I am testing an improved version of this model. In testing, it has substantially improved logloss vs old model. When ...
2 votes
1 answer
79 views
New Variables to Add to Model GLM/GBM
I already have a GLM model in place to predict claims frequency. I know have access to many new variables (a mix of categorical and continuous variables, some of which are likely correlated). I wish ...
8 votes
1 answer
284 views
Machine learning model for ranking that outputs probabilities
Traditionally ML algorithms for ranking take the features as input and then output a "ranking score" which do not have a natural probabilistic interpretation. For example, suppose we have ...
3 votes
2 answers
2k views
XGBoost __sklearn_tags__ Method Error in Python When Loading Model [closed]
I'm getting this error when trying to load a saved XGBRegressor model locally: ...
5 votes
1 answer
129 views
Comparing probabilities of two models
Consider a dataset and two binary classes CLASS_A and CLASS_B. These two classes are not necessarely independent. Let's say that CLASS_A = "buy an apple" and CLASS_B = "buy an orange&...