Timeline for Cluster documents and identify the prominent document in the cluster?
Current License: CC BY-SA 3.0
9 events
| when toggle format | what | by | license | comment | |
|---|---|---|---|---|---|
| Jul 12, 2017 at 2:38 | comment | added | Smith | Please let me know how to print the document name of the prominent document? | |
| Jul 9, 2017 at 3:49 | comment | added | Smith | Please let me know if you know an answer for this datascience.stackexchange.com/questions/20255/… | |
| Jul 7, 2017 at 7:56 | comment | added | Bogas | @Smith, according to the sklearn.DBSCAN fit documentation you should use a distance matrix as input, not a similarity matrix.. You will need to play around with the min_samples, eps parameters according to your data.. | |
| Jul 7, 2017 at 3:45 | comment | added | Smith | Please let me know if I should use cosine distance or 1 - cosine distance (in other words cosine similarity) in the fit parameter of DBSCAN? DBSCAN(min_samples=1, metric='precomputed').fit(pairwise_distances(feature_matrix, metric='cosine')) | |
| Jul 7, 2017 at 1:28 | comment | added | Emre | Use items instead. | |
| Jul 7, 2017 at 0:26 | comment | added | Smith | However, while running the code I get an error saying "AttributeError: 'dict' object has no attribute 'iteritems'". Do you know how to fix it? :) | |
| Jul 7, 2017 at 0:11 | vote | accept | Smith | ||
| Jul 6, 2017 at 13:53 | history | edited | Bogas | CC BY-SA 3.0 | added 2 characters in body |
| Jul 6, 2017 at 10:48 | history | answered | Bogas | CC BY-SA 3.0 |