Skip to main content

Questions tagged [isolation-forest]

1 vote
0 answers
38 views

I am using sklearn's Isolation Forest as a model to detect anomalies. My dataset is relatively small, 50 records with only 2-3 features. To prevent any overfitting, what would you recommend to tune ...
Mar's user avatar
  • 165
5 votes
1 answer
148 views

I am using sklearn's IsolationForest for unsupervised anomaly detection task. According to the docs, https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.IsolationForest.html, there are ...
Mar's user avatar
  • 165
5 votes
1 answer
95 views

I am looking to better understand sklearn IsolationForest decision_function. My understanding is that if the metric is closer to -1 then the model is more confident ...
Mar's user avatar
  • 165
2 votes
0 answers
60 views

I have implemented an Isolation Forest algorithm for anomaly detection (unsupervised learning), where I divided my dataset into 1000 subsets, and for each subset, there is one isolation tree. This ...
Learner's user avatar
  • 71
0 votes
0 answers
23 views

I'm performing some outlier detection and am plotting an IsolationForest with varying levels of contamination against PCA variance reduction. I want to maximize the outlier count while minimizing loss ...
user178671's user avatar
3 votes
1 answer
244 views

In unsupervised anomaly detection, does including the contamination percentage turn isolation forest into supervised instead of unsupervised when I fit the data after?
roaa's user avatar
  • 31
7 votes
2 answers
253 views

I have recently came across on this algorithm and was working on my graduation project. As per my understanding, we creates sub trees for each sub samples. Then we calculates the scores for each ...
Mayank Singh's user avatar
2 votes
0 answers
76 views

When using Isolation Forest for outlier detection (an unsupervised case), how to evaluate the performance of isolation forest?
Ram's user avatar
  • 21
0 votes
2 answers
107 views

I am trying to do unsupervised anomaly detection on a dataset with a dozen of variables. None of them have descriptions, and the dataset doesn't have any labels or class variable. I have tried using a ...
ggtb's user avatar
  • 1
0 votes
0 answers
121 views

Let say, I have the anomaly detection (unsupervised learning) dataset with 10 observations (two features). The datasets is like below: After executing the model, following are the results (anomalies ...
Bits's user avatar
  • 131
0 votes
0 answers
44 views

[I am a total beginner in machine learning algorithms] I have 10 spectrograms (lines) for phytoplankton (each composed of 288 points). Each spectrogram is associated with a phytoplankton dendity data ...
Marianne's user avatar
0 votes
1 answer
113 views

I have a dataset which has a column of prices, a column of dates, and various other columns of numerical and categorical values. I would like to find outlier prices based on all the columns in the ...
ImNotSureAboutStats's user avatar
2 votes
1 answer
2k views

im trying to build an outlier detector to find outliers in test data. That data varies a bit (more test channels, longer/shorter testing). First im applying the train test split because i want to use ...
arooki's user avatar
  • 23
0 votes
0 answers
32 views

I'm trying to build an anomaly detection model using Isolation Forest. I currently have 12 features, about half of them depends on the presence of a particular data field, say ...
Rayne's user avatar
  • 131
0 votes
0 answers
129 views

I'm running sklearn's IsolationForest on a dataset containing 2 classes of data, one that I know is the anomaly (~1.5% of the entire dataset), the other is the normal dataset. I'm using this (shuffled)...
Rayne's user avatar
  • 131

15 30 50 per page