Timeline for What algorithms can I use to detect if articles or posts are duplicates?
Current License: CC BY-SA 3.0
2 events
| when toggle format | what | by | license | comment | |
|---|---|---|---|---|---|
| Oct 8, 2012 at 0:38 | comment | added | Roland Mai | n-grams based measures are much better than md5 hashes especially for semi-structured data such as html. | |
| Oct 8, 2012 at 0:34 | history | answered | gam3 | CC BY-SA 3.0 |