Finding near-duplicates with Jaccard similarity and MinHashblog.nelhage.com247 pointsbrianyu82 years ago