Skip to main content

Questions tagged [data-mining]

Data mining is the process of analyzing large amounts of data in order to find patterns and commonalities.

8 votes
1 answer
266 views

I have tried following the formulas for Term frequency–Inverse document frequency (TF-IDF) calculation and Cosine similarity calculation, and translated it into code. The results I get seems to be ...
Malde's user avatar
  • 81
4 votes
1 answer
215 views

This is my implementation of the FPGrowth algorithm where, as an optimisation, I avoid re-creating the tree at each extension of the prefix, while I use a view representation that I think would be ...
jackb's user avatar
  • 113
1 vote
1 answer
330 views

From a public dataset available on film rating I created a contingency table as follow. Honestly I don't like all these "for-loops" I think the quality of the code can be definitely improved ...
Andrea Ciufo's user avatar
2 votes
0 answers
204 views

I wrote this code to make a non-blocking manager along with pipeline operations using asyncio, my main concern is to catch received items producer, and when the received operation is complete. I want ...
etyzz's user avatar
  • 21
2 votes
2 answers
163 views

(See the next iteration.) Introduction Suppose Evil Lottery Inc is interested in not paying millions of dollars back to players. They gather the drawn lottery rows first, after which they mine rows ...
coderodde's user avatar
  • 32.3k
2 votes
1 answer
525 views

I work in the population health industry and get contracts from commercial companies to conduct research on their products. This is the general code to identify target patient groups from a provincial ...
KubiK888's user avatar
  • 225
2 votes
2 answers
328 views

I've written a python program to rank the names that appear in the file(s) based on their frequency. In other words, there are multiple files and want to rank the frequency of the names that appears ...
nsivakr's user avatar
  • 163
5 votes
1 answer
6k views

This is k-means implementation using Python (numpy). I believe there is room for improvement when it comes to computing distances (given I'm using a list comprehension, maybe I could also pack it in a ...
Adel Redjimi's user avatar

15 30 50 per page