Questions tagged [data-mining]

Question 1

I have a simple question of how would you measure the logicality of a programming language? EDIT: I was asked to specify the term "logicality". Hence I will try and provide a stipulation. By ...

Question 2

I want to make a Galaxy schema of a Restaurant. There are 2 fact tables sales and purchases. sales are related to customer and purchases to supplier of ingredients. Now my question is how can i make ...

Question 3

I saw a post on Reddit (https://www.reddit.com/r/math/comments/ci50d3/visualizing_mathematical_subjects/) that utilizes label propagation, Fruchterman-Reingold algorithm, and edge betweenness ...

Question 4

I have a project to match two groups of people. Under insurance, if the initial sales agent leaves the insurer, their customers will become so-called “orphan customers”. I've given a big data set ...

Question 5

I have used different machine learning algorithms to predict solar panels' power output. There are ten independent features for weather data. In all models, I set time as an index and have used the ...

Question 6

In the apriori algorithm there's the self join step, So, say we have 1 3 2 3 3 5 2 5 If I were to do an exhaustive join I'd end up with a tuple including (1, 3, ...

Question 7

We were doing project work for plagiarism checking. For this purpose, we have taken a term frequency vector of two documents and measured the similarity using a cosine similarity measure. The value of ...

Question 8

Suppose I have a list that reflects the priority of web pages for recrawling: l1 = [3, 2, 1, 4, 2, 5] Now, I have tried to estimate the priorities with two ...

Question 9

I am working on building ML/DL solution for a problem where that data is considered, naturally similar and I am worried if that would be considered as data redundancy. My question is, is that so? and ...

Question 10

I am studying data mining and I stumbled upon types of attributes. They are Nominal Ordinal Interval Ratio Data mining book by Tan,Steinbech,Kumar says Permissible transformations for-: nominal-: ...

Question 11

I'm currently going through past paper questions and was wondering if I could get some help answering this one? 'Consider a classification model which is applied to a set of records, of which 100 ...

Question 12

I don't understand two parts in this paper: The min notion on page 4 line 357 (equation 10d): I understand this as to find all the $M_{10}$, $M_{11}$, $M_{01}$ first and then try to minimize the ...

Question 13

I am working on a Fraudulent Cash Transaction Detection System using DBSCAN and I want to know what is the proper way to identify outliers? Thank you ##Edite## I had a problem how to represent the ...

Question 14

So, for the sake of simplicity, I am going to use English characters for this example. Let's say I have a set of strings of characters in English ranked by difficulty: Easy, Intermediate, Advanced. So ...

Question 15

I'm a student studying a data mining course and have come across a problem. I need to explain the problem with the help of an example scenario as I do not know how to explain the problem in any other ...

Stack Exchange Network

Questions tagged [data-mining]

Measuring logicality of programming languages?

Restaurant Galaxy schema

What books are there to learn to implement these graph algorithms?

Persona matching algorithm

Machine learning and test split for time series data

Efficient way to do self join with minimum support?

How can we express value of cosine similarity of two documents into percentage?

Difference between C-Index and Spearman correlation

Would samples be considered data redundancy if they are similar to each other fairly naturally?

What do we mean by permissible transformations in types of attributes-:nominal,ordinal,interval,ratio? [closed]

What are the confusion matrix values?

About the paper Privacy-preserving in association rule mining using an improved discrete binary artificial bee colony

How to detect outliers using DBSCAN?

In a set of sentences, how could I determine the fewest sentences that contains all characters?

How to handle distribution of values with same attributes into different classes

Hot Network Questions