Skip to main content

Questions tagged [k-fold-cv]

For questions related to the k-fold cross-validation technique, where you split the dataset into k folds (subsets), train the model on k - 1 of these folds and test it on the remaining (test) fold; then repeat this procedure for each of the k folds, such that we compute the test performance for each fold; finally, we can average these test performances.

1 vote
1 answer
74 views

We know that in machine learning the dataset is divided into 3 parts: training data, validation data and test data. On the other hand, K-fold cross-validation is defined as follows: the dataset is ...
DSPinfinity's user avatar
  • 1,223
0 votes
1 answer
61 views

Some of you may be familiar with the unusual split scheme used for time-series data. In short, there is a saying that one should only consider a split where the training set comes prior to the testing ...
Hadar Sharvit's user avatar
0 votes
1 answer
128 views

i want to pretrain some model on P100 which is provided by kaggle. Pretraining on 3 A100 is about 1.5 day. I have 2 questions: Can I put the same seed everywhere so that the results match, train the ...
kali_xyyali's user avatar
0 votes
1 answer
135 views

I got this feedback for my thesis paper. The improvement shown in the results section could be the result of random initialization. There should be multiple runs with means and standard deviations. ...
Md. Asif Iqbal Fahim's user avatar
0 votes
1 answer
1k views

I have a multi-label classification task I am solving. I have done hyperparameter tuning (with Keras Tuner) to determine the best configuration for my neural network. Is it valid to do this (determine ...
user9317212's user avatar
7 votes
4 answers
2k views

I want to prevent my model from overfitting. I think that k-fold cross-validation (because it is doing this each time with different datasets) may be more effective than splitting the dataset into ...
jennifer ruurs's user avatar
7 votes
1 answer
573 views

I wanted to ask about the methodology of testing the ML models against overfitting. Please note that I don't mean any overfitting reducing methods like regularisation, just a measure to judge whether ...
GKozinski's user avatar
  • 1,280
1 vote
1 answer
273 views

I am dealing with an intent classification task on an Italian customer service data set. I've more or less 1.5k sentences and 29 classes (imbalanced). According to the literature, a good choice is to ...
Alfonso's user avatar
  • 65