Show HN: Using Simhash and DOM trees to scrape HN with 3 lines of configurationgist.github.com2 pointsbartolsthoorn12 years ago