Questions tagged [hyperparameter-tuning]
Hyperparameter tuning (also called hyperparameter optimization) refers to the process of finding the optimal set of hyperparameters for a given machine learning algorithm.
277 questions
0 votes
0 answers
10 views
How to handle unstable best_iteration in LightGBM when using Optuna for hyperparameter optimization?
I'm using Optuna to optimize LightGBM hyperparameters, and I'm running into an issue with the variability of best_iteration across different random seeds. Current ...
0 votes
1 answer
20 views
Should Hyperparameter Optimization Be Equalized by Trials or Compute Time?
Say I have two different models with different hyperparameters and I want to compare the performance of both models on some dataset. One model is much simpler than the other and, therefore, if I were ...
1 vote
0 answers
11 views
How to interpret an unstable learning curve on a model tuned with Hyberband Tuning?
I have used Hyperband automatic tuning for an ANN model to predict price. After running the model with the automatic tuning, I am obtaining an R2 score of 1.00 that suggests overfitting, however, I am ...
4 votes
3 answers
105 views
Can cross validation for tuning and LOO for evaluation on the exact same dataset cause bias?
I read two articles by the same guy where he uses the whole dataset for hyperparameter optimisation using with CV and then evaluates the model with the best hyperparameters using leave one out on the ...
0 votes
0 answers
44 views
Which hyperparameters for a standard LLM provide the most benefit vs performance cost?
GPT3 has several hyper-parameters that define the network architecture. My question is: which of these hyper-parameters, when increased, provide the most performance benefit vs computational cost? ...
1 vote
1 answer
137 views
Should I Use the Same Hyperparameters for Different Datasets in ML Models?
I am a student and am looking for your help. I have two datasets, including pre-treatment CT scan and post-treatment CT scan. I want to compare these datasets to determine which yields the best ...
1 vote
0 answers
33 views
Error in plotting Gaussian Process for 3 models that use Bayesian Optimization
I'm writing a python script for Orange Data Mining to plot the gaussian processes in order to find the best hyperparameters for the 5-FoldCrossValidation Accuracy metric. The three models are SVC, ...
1 vote
0 answers
55 views
What is the standard ML pipeline for training and testing? [closed]
I have a dataframe containing 1324 rows and 28 columns and I'm kinda lost on which approach to go for when training regression models. Currently I perform a data split and run GridSearchCV to pick the ...