Show HN: Identifying unstructured junk data using machine learning techniquesgithub.com/rectangletangle8 pointsrectangletangle12 years ago